1 .. This work is licensed under a Creative Commons Attribution 4.0 International License.
2 .. http://creativecommons.org/licenses/by/4.0
3 .. Copyright (C) 2019 AT&T Intellectual Property
8 All notable changes to this project will be documented in this file.
10 The format is based on `Keep a Changelog <http://keepachangelog.com/>`__
11 and this project adheres to `Semantic Versioning <http://semver.org/>`__.
17 * Add counters of create/update/delete actions on policy types and instances
18 * Add Prometheus /metrics endpoint to report counter data
24 * Fix _send_msg method to free allocated RMR message buffers
25 * Adjust send-message methods to retry only on RMR_ERR_RETRY
26 * Extend send-message methods to log message state after send
27 * Use constants from ricxappframe.rmr instead of hardcoded strings
28 * Upgrade RMR to version 4.0.5
29 * Upgrade tavern to version 1.2.2
30 * Extend user guide with southbound API schemas
36 * Revise Dockerfile to set user as owner of .local dir with a1 package
37 * Rename console shell start script to run-a1 from run.py
38 * Extend start script to report webserver listening port
39 * Add tiny RMR routing table for use in demo and test
40 * Extend documentation for running a container locally
41 * Add documentation of start/init parameters to _RmrLoop class
42 * Add new environment variable USE_FAKE_SDL (`RIC-351 <https://jira.o-ran-sc.org/browse/RIC-351>`_)
43 * Respond with error if policy type ID differs from ID in object on create
44 * Upgrade integration tests to use Tavern version 1.0.0
50 * Upgrade to rmr 4.0.2
51 * Upgrade integration tests to xapp-frame-go version 0.4.8 which drops NNG
52 * Extend exception handler to return error details in HTTP response
53 * Ensure that policy type ID on path matches ID in object
54 * Add OpenAPI spec to RST documentation
61 * Switch to using rmr in the ricxappframe
68 * Switch to SI95 from NNG (rmr v3 vs rmr v1)
69 * The switch to SI95 led to a rabbit hole in which we eventually discovered that rmr_send may sometimes block for an arbitrary period of time. Because of this issue, a1's sends are now threaded. Please see the longer comment about this in a1rmr.
70 * Bump version of py xapp frame (SDL used only) in A1
71 * Bump version of go xapp frame (0.0.24 -> 0.4.2) in integration tests
72 * Add some additional logging in A1
78 * SDL Wrapper was moved into the python xapp framework; use it from there instead.
84 * This is a pretty big amount of work/changes, however no APIs were changed hence the semver patch
85 * Switches A1's three test receivers (integration tests) over to golang; this was mostly done to learn the go xapp framework and they are identical in functionality.
86 * Upgrades the version of rmr in A1 and all integration receivers to 1.13.*
87 * Uses a much fancier Docker build to reduce the size of a1's image. The python:3.7-alpine image itself is 98MB and A1 is now only ~116MB, so we're done optimizing A1's container size.
92 * Upgrades from sdl 2.0.2 to 2.0.3
93 * Integrates an sdl healthcheck into a1's healthcheck
99 * Upgrades from sdl 1.0.0 to 2.0.2
100 * Delete a1test_helpers because SDL 2.0.2 provides the mockup we need
101 * Remove general catch all from A1
107 * Represents a resillent version of 2.0.0 that uses Redis for persistence
108 * Now relies on SDL and dbaas; SDL is the python interface library to dbaas
109 * Adds a 503 http code to nearly all http methods, as A1 now depends on an upstream system
110 * Integration tests have a copy of a dbaas helm chart, however the goal is to simplify that deployment per https://jira.o-ran-sc.org/browse/RIC-45
111 * Unit tests have a mockup of SDL, however again the goal is to simplify as SDL grows per https://jira.o-ran-sc.org/browse/RIC-44
117 * Implements new logic around when instances are deleted. See flowcharts in docs/. Basically timeouts now trigger to actually delete instances from a1s database, and these timeouts are configurable.
118 * Eliminates the barrier to deleting an instance when no xapp evdr replied (via timeouts)
119 * Add two new ENV variables that control timeouts
120 * Make unit tests more modular so new workflows can be tested easily
121 * Fixes the API for ../status to return a richer structure. This is an (albeit tiny) API change.
122 * Clean up unused items in the integration tests helm chart
123 * Removed "RMR_RCV_RETRY_INTERVAL" leftovers since this isn't used anymore
124 * Uses the standard RIC logging library
125 * Switch the backend routing scheme to using subscription id with constant message types, per request.
126 * Given the above, policy type ids can be any valid 32bit greater than 0
127 * Decouple the API between northbound and A1 from A1 with xapps. This is now two seperate OpenAPI files
128 * Update example for AC Xapp
129 * Updgrade rmr and rmr-python to utilize new features; lots of cleanups because of that
130 * Implements a POLICY QUERY feature where A1 listens for queries for a policy type. A1 then responds via multiple RTS messages every policy instance of that policy type (and expects an ACK back from xapps as usual). This feature can be used for xapp recovery etc.
136 * Only external change here is to healthcheck the rmr thread as part of a1s healthcheck. k8s will now respin a1 if that is failing.
137 * Refactors (simplifies) how we wait for rmr initialization; it is now called as part of __init__
138 * Refactors (simplifies) how the thread is actually launched; it is now internal to the object and also a part of __init__
139 * Cleans up unit testing; a1rmr now exposes a replace_rcv_func; useful for unit testing, harmless if not called otherwise
140 * Upgrades to rmr-python 1.0.0 for simpler message allocation
146 * Move database cleanup (e.g., deleting instances based on statuses) into the polling loop
147 * Rework how unit testing works with the polling loop; prior, exceptions were being thrown silently from the thread but not printed. The polling thread has now been paramaterized with override functions for the purposes of testing
148 * Make type cleanup more efficient since we know exactly what instances were touched, and it's inefficient to iterate over all instances if they were not
149 * Bump rmr-python version, and bump rmr version
150 * Still an item left to do in this work; refactor the thread slightly to tie in a healthcheck with a1s healthcheck. We need k8s to restart a1 if that thread dies too.
156 * a1 now has a seperate, continuous polling thread, which will enable operations like database cleanup
157 (based on ACKs) and external notifications in real time, rather than when the API is invoked
158 * all rmr send and receive operations are now in this thread
159 * introduces a thread safe job queue between the two threads
160 * Not done yet: database cleanups in the thread
161 * Bump rmr python version
162 * Clean up some logging
168 * Moves the "database" access calls to mimick the SDL API, in preparation for moving to SDL
169 * Does not yet actually use SDL or Redis, but the transition to those will be much shorter after this change.
175 * Represents v1.0.0 of the A1 API for O-RAN-SC Release A
177 - Implement type DELETE
178 - Clean up where policy instance cleanups happen
186 * Upgrade rmr to 1.9.0
187 * Upgrade rmr-python to 0.13.2
188 * Use the new helpers module in rmr-python for the rec all functionality
189 * Switch rmr mode to a multithreaded mode that continuously reads from rmr and populates an internal queue of messages with a deterministic queue size (2048) which is better behavior for A1
190 * Fix a memory leak (python obj is garbage collected but not the underlying C memory allocation)
199 * Implement instance delete
200 * Moves away from the status vector and now aggregates statuses
201 * Pop through a1s mailbox "3x as often"; on all 3 kinds of instance GET since all such calls want the latest information
202 * Misc cleanups in controller (closures ftw)
203 * Add rmr-version.yaml for CICD jobs
210 * Implement GET all policy type ids
211 * Implement GET all policy instance ids for a policy type
212 * fix a tiny bug in integration test receiver
220 * switch to rmr 1.8.1 to pick up a non blocking variant of rmr that deals with bad routing tables (no hanging connections / blocking calls)
221 * improve test receiver to behave with this setup
222 * add integration test for this case
223 * this also switches past 1.5.x, which included another change that altered the behavior of rts; deal with this with a change to a1s helmchart (env: `RMR_SRC_ID`) that causes the sourceid to be set to a1s service name, which was not needed prior
224 * improve integration tests overall
234 * Remove RIC manifest
235 * Read type GET to get schema for instance PUT
236 * Remove Utils (no longer needed)
237 * lots more tests (unit and integration)
244 * This is on the road to release 1.0.0. It is not meant to be tested (E2E) as it's own release
245 * Implement the Release A spec in the openapi.yaml
246 * Rework A1 to follow that spec
247 * Remove rmr_mapping now that we use policyid as the mtype to send and a well known mtype for the ACKs
248 * Add the delay receiver test to the tavern integration tests
249 * Remove unneeded ENV variables from helm charts
250 * Switch away from builder images to avoid quicksand; upgrade rmr at our own pace
258 * Update to later rmr-python
259 * Add docs about upgrading rmr
260 * remove bombarder since tavern runs apache bench
268 * Update to later rmr-python
275 * Greatly reduce the size of A1 docker from 1.25GB to ~278MB.
276 * Add a seperate dockerfile for unit testing
284 * Rename all /ric/ URLs to be consistent with requirements of /a1-p/
292 * Implement the GET on policies
293 * Add a new endpoint for healthcheck. NOTE, it has been decided by oran architecture documents that this policy interface should be named a1-p in all URLS. In a future release the existing URLs will be renamed (existing URLs were not changed in this release).
301 * Fix the 400, which was in the API, but wasn't actually implemented
302 * Update the test fixture manifests to reflect the latest adm control, paves way for next feature coming which is a policy GET
311 * Use base Docker with NNG version 1.1.1
320 * Upgrade RMR due to a bug that was preventing rmr from init in kubernetes
329 * Run unit tests as part of docker build
338 * Convert docs to appropriate format
339 * Move rmr string to int mapping to a file
348 * Use tavern to test the actual running docker container
349 * Restructures the integration tests to run as a single tox command
350 * Re-ogranizes the README and splits out the Developers guide, which is not needed by users.
358 * Adds a defense mechanism against A1 getting queue-overflowed with messages A1 doesnt care about; A1 now ignores all incoming messages it's not waiting for, so it's queue size should now always be "tiny", i.e., never exceeding the number of valid requests it's waiting for ACKs back for
359 * Adds a test "bombarding" script that tests this
367 * Main purpose of this change is to fix a potential race condition where A1 sends out M1 expecting ACK1, and while waiting for ACK1, sends out M2 expecting ACK2, but gets back ACK2, ACK1. Prior to this change, A1 may have eaten ACK2 and never fufilled the ACK1 request.
368 * Fix a bug in the unit tests (found using a fresh container with no RIC manifest!)
369 * Fix a (critical) bug in a1rmr due to a rename in the last iteration (RMR_ERR_RMR_RCV_RETRY_INTERVAL)
370 * Make unit tests faster by setting envs in tox
371 * Move to the now publically available rmr-python
372 * Return a 400 if am xapp does not expect a body, but the PUT provides one
373 * Adds a new test policy to the example RIC manifest and a new delayed receiver to test the aformentiond race condition
381 * Upgrade to rmr 0.10.0
382 * Fix bad api spec RE GET
383 * Fix a (big) bug where transactionid wasn't being checked, which wouldn't have worked on sending two policies to the same downstream policy handler
391 * Rip some testing structures out of here that should have been in rmr (those are now in rmr 0.9.0, upgrade to that)
392 * Run Python BLACK for formatting
400 * Fix a blocking execution bug by moving from rmr's timeout to a non blocking call + retry loop + asyncronous sleep
401 * Changes the ENV RMR_RCV_TIMEOUT to RMR_RCV_RETRY_INTERVAL
409 * Update to rmr 0.8.3
410 * Change 503 to 504 for the case where downstream does not reply, per recommendation
411 * Add a 502 with different reasons if the xapp replies but with a bad/malformed/missing status
412 * Make testing much more modular, in anticipating of moving some unit test functionality into rmr itself
420 * Crash immediately if manifest isn't mounted
421 * Add unit tests for utils
430 * Upgrade A1 to rmr 0.8.0
431 * Go from deb RMR installation to git
432 * Remove obnoxious receiver logging
440 * Upgrade A1 to rmr 0.6.0
448 * Add license headers
456 * Introduce RIC Manifest
457 * Move some testing functionality into a helper module
458 * Read the policyname to rmr type mapping from manifest
459 * Do PUT payload validation based on the manifest
467 * Bump rmr python dep version
468 * Include a Dockerized test receiver
469 * Stencil out the mising GET
471 * Include a test docker compose file
479 * Initial Implementation