mirrors/onos - onos - gitea@git.xfx1.de

mirror of https://github.com/opennetworkinglab/onos.git synced 2026-01-04 16:11:27 +01:00

Author	SHA1	Message	Date
Carmelo Cascone	c32976e7d5	Various fixes in preparation of Stratum demo at ONS 2019 - Do not read counters with table entries for Barefoot drivers - If driver behavior setup fails, log which operation we are aborting - Remove unnecessary setup steps in Stratum-related drivers - Always get clients by their key in gRPC-based drivers - Log when P4Runtime group operation fails because of missing group in store - Fix polling of table entry counters for P4Runtime driver Change-Id: Ic9bf19b76d8cb5a191aec24852af4410fea8b998	2019-04-08 23:00:54 +00:00
Carmelo Cascone	a7e60c65c8	Various fixes for P4Runtime group handling - Workaround for PI bug that ignores max_group_size - Use max_group_size and not buckets size when translating groups Change-Id: Id12a12311b20ca8fb4e785e1c5a4f0f4215d1bbf	2019-03-23 01:04:10 +00:00
Carmelo Cascone	a1cabe50a5	Do not clear the P4Runtime device mirror when devices go offline Implementations of P4Runtime like Stratum are expected to persist table entries and other data plane state across reboots. Change-Id: I4395e9e60b395bfca85c71c9d3bc604a2269a3ce	2019-03-19 19:51:43 +00:00
Carmelo Cascone	b4863b3f40	Fix missing P4Runtime groups because of infinite delete/insert cycle The fix is simple: when cleaning up inconsistent entries from device, we update the mirror when the response is received, and not before sending the request. Otherwise, if delete goes wrong, writes happening right after reconciliation cycle might find an inconsistent mirror state. When writing entries (e.g. apply group/flow rule) we keep updating the mirror before sending the request to handled the case of back-to-back writes. Change-Id: I9e1cc5cac3f8746c67e93e2cee17aff78d3f1d7e	2019-03-14 06:10:59 +00:00
Carmelo Cascone	1ae25061fb	Add stratum-tofino driver This patch also introduces a new driver property flag to indicate whether a P4Runtime target supports default table entries or not. Stratum targets built against the current version of p4lang/PI do not. Change-Id: I1fbb57521516bee99057319ed1695cb05b68ee7c	2019-03-08 06:15:07 +00:00
Carmelo Cascone	6146946e16	Fix inconsistent update type during concurrent P4Runtime writes This is achieved by optimistically updating the P4Runtime mirror using the write request (instead of waiting for a response) and by serializing building write requests for the same device. This change requires updating the P4Runtime protocol classes to expose the content of the write request. It also includes: - force member weight to 1 when reading groups (some server implementation still fails to be compliant to the spec) - remove unused operation timeout handling in GDP (now all RPCz have a timeout) Change-Id: Ib4f99a6085c1283f46a2797e0c883d96954e02e9	2019-03-06 19:28:42 +00:00
Carmelo Cascone	3977ea4d9e	Refactor channel and mastership handling in P4Runtime This (big) change aims at solving the issue observed with mastership flapping and device connection/disconnection with P4Runtime. Channel handling is now based on the underlying gRPC channel state. Before, channel events (open/close/error) were generated as a consequence of P4Runtime StreamChannel events, making device availability dependent on mastership. Now Stream Channel events only affect mastership (MASTER/STANDBY or NONE when the SteamChannel RPC is not active). Mastership handling has been refactored to generate P4Runtime election IDs that are compatible with the mastership preference decided by the MastershipService. GeneralDeviceProvider has been re-implemented to support in-order device event processing and to reduce implementation complexity. Stats polling has been moved to a separate component, and netcfg handling updated to only depend on BasicDeviceConfig, augmented with a pipeconf field, and re-using the managementAddress field to set the gRPC server endpoints (e.g. grpc://myswitch.local:50051). Before it was depending on 3 different config classes, making hard to detect changes. Finally, this change affects some core interfaces: - Adds a method to DeviceProvider and DeviceHandshaker to check for device availability, making the meaning of availability device-specific. This is needed in cases where the device manager needs to change the availability state of a device (as in change #20842) - Support device providers not capable of reconciling mastership role responses with requests (like P4Runtime). - Clarify the meaning of "connection" in the DeviceConnect behavior. - Allows driver-based providers to check devices for reachability and availability without probing the device via the network. Change-Id: I7ff30d29f5d02ad938e3171536e54ae2916629a2	2019-03-06 19:28:42 +00:00
Carmelo Cascone	4c289b731d	New P4RuntimeClient implementation that supports batching and error reporting The new client API supports batching and provides detailed response for write requests (e.g. if entity already exists when inserting), which was not possible with the old one. This patch includes: - New more efficient implementation of P4RuntimeClient (no more locking, use native gRPC executor, use stub deadlines) - Ported all codecs to new AbstractCodec-based implementation (needed to implement codec cache in the future) - Uses batching in P4RuntimeFlowRuleProgrammable and P4RuntimeGroupActionProgrammable - Minor changes to PI framework runtime classes Change-Id: I3fac42057bb4e1389d761006a32600c786598683	2019-02-04 23:12:46 -08:00
Carmelo Cascone	99c59dbb03	ONOS-7898 Action profile group/member refactoring Also includes: - New abstract P4Runtime codec implementation. Currently used for action profile members/groups encoding/deconding, the plan is to handle all other codecs via this. - Improved read requests in P4RuntimeClientImpl - Removed handling of max group size in P4Runtime driver. Instead, added modified group translator to specify a max group size by using information from the pipeline model. Change-Id: I684bae0184d683bb448ba19863c561f9848479d2	2019-01-17 18:11:43 -08:00
Carmelo Cascone	cb4327a09e	ONOS-7887 Rename action profile-related entities Members can exist outside of a group. Previous naming was ambiguous about this. Action group -> action profile group Action group member -> action profile member Change-Id: I5097e92253353d355b864e689f9653df2d318230	2019-01-10 13:52:52 -08:00
Carmelo Cascone	a71b849708	Removes the gRPC channel if an exception occurs while instantiating the client Includes also other minor changes to gRPC channel creation/connection process, such as: - More compact logs showing the gRPC client key - GrpcChannelController.connectChannel() now returns the same StatusRuntime exception, no need to wrap it in an IOException - Wait for channel shutdown after initial connection error Change-Id: Ib7d2b728b8c82d9f9b2097cffcebd31cac891b27	2018-12-17 17:58:54 -08:00
Carmelo Cascone	a46f554fb1	Fix runtime of P4-related stuff with new karaf Change-Id: I139575d5b038224ca8ccce4ec71ec8a84b35dfeb	2018-12-12 23:41:01 -08:00
Yi Tseng	d771648025	Cherry pick gNMI and Stratum related changes to this branch Cherry picked commits: 20211 Update gNMI version and build script 20247 [ONOS-7829] Implement AbstractGrpcClient and AbstractGrpcClientControl 20233 [ONOS-7141][ONOS-7142] Add GnmiClient and GnmiController 20234 Refactor OpenConfig gNMI device description descovery 20260 [ONOS-7831] Implement GnmiHandshaker 20270 Add Stratum driver Change-Id: I81ad8bce45251af5909cfcac0edbcfd11c8ebf1d	2018-12-13 01:05:05 +00:00
Carmelo Cascone	b5324e7168	Improve fabric.p4 to reduce pipeline resources and refactor pipeconf impl This patch affects both the P4 pipeline implementation and the Java pipeconf. P4 PIPELINE - Less tables and smarter use of metadata to reduce inter-tables dependencies and favor parallel execution of tables. - Removed unused actions / renamed existing ones to make forwarding behavior clearer (e.g. ingress_port_vlan table) - Remove co-existence of simple and hansed table. Hashed should be the default one, but implementations that do not support action profiles might compile fabric.p4 to use the simple one. - Use @name annotations for match fields to make control plane independent of table implementation. - Use @hidden to avoid showing actions and table on the p4info that cannot be controlled at runtime. - First attempt to support double VLAN cross-connect (xconnect table). - New design has been tested with "fabric-refactoring" branch of fabric-p4test: github.com/opennetworkinglab/fabric-p4test/tree/fabric-refactoring JAVA PIPECONF This patch brings a major refactoring that reflects the experience gathered in the past months of working on fabric.p4 and reasoning on its pipeconf implementation. Indeed, the FlowObjective API is under-specified and sometimes ambiguous which makes the process of creating and maintaining a pipeliner implementation tedious. This refactoring brings a simplified implementation by removing unused/ unnecessary functionalities and by recognizing commonality when possible (e.g. by means of abstract and utility classes). It also makes design patterns more explicit and consistent. Overall, the goal is to reduce technical debt and to make it easier to support new features as we evolve fabric.p4 Changes include: - Changes in pipeliner/interpreter to reflect new pipeline design. - By default translate objective treatment to PiAction. This favors debuggability of flow rules in ONOS. - Support new NextObjective’s NextTreatment class. - Remove lots of unused/unnecessary code (e.g. async callback handling for pending objective install status in pipeliner as current implementation was always returning success) - Gather commonality in abstract classes and simplify implementation for objective translator (filtering, forwarding, next) - New implementation of ForwardingFunctionTypes (FFT) that looks at criterion instance values along with their types (to avoid relying on case-specific if-else conditions to recognize variants of an FFT) - Adaptive translation of NextObjective based on presence of simple or hashed table. - Support DENY FilteringObjective Also: - Fix onos-p4-gen-constants to avoid generating conflicting PiMatchFieldId variable names. - Install Graphviz tools in p4vm to generate p4c graphs - Generate p4c graphs by default when compiling fabric.p4 - Use more compact Hex string when printing PI values Change-Id: Ife79e44054dc5bc48833f95d0551a7370150eac5	2018-12-11 14:48:06 -08:00
Ray Milkey	fe6afd8c38	Sonar suggestions Change-Id: I68d400185ffc885bf92b558a2565a49af149ef0c	2018-11-26 23:15:53 +00:00
Ray Milkey	5739b2cc0c	Merge branch 'master' into dev-karaf-4.2.1 Change-Id: I260f0ee72fa87f1547fa790f031c4980176992ac	2018-11-06 14:11:34 -08:00
Yi Tseng	2a340f7cc6	[ONOS-7829] Implement AbstractGrpcClient and AbstractGrpcClientControl Change-Id: I39cba6834e7fe8d1b60b576b9934c0b3cfa7104b	2018-11-05 19:05:35 +00:00
Thomas Vachuska	07782d262b	Eliminated old features.xml and app.xml files required by no-longer supported Maven build. Change-Id: I060af47c85c1b83d35f6cc8e462d6d609872b15a	2018-11-01 20:41:00 +00:00
Ray Milkey	a218d43062	Merge branch 'master' into merge Change-Id: Id77bb2de77dd288404c83e331b076aaf9aafacc5	2018-10-30 12:44:55 -07:00
ghj0504520	ed7340c7c2	ONOS-7797 Support adding an arbitrary number of members to an existing P4Runtime group Change-Id: I7b9ed589ca15957ee1b2780934a6da8e998bace3	2018-10-27 00:25:03 +00:00
hjtsao	1e8a1bd5ce	[ONOS-7818] Implement a device behaviour to get table statistics. Make FlowRuleDriverProvider periodically call FlowRuleProviderService.pushTableStatistics(). Change-Id: Ife2b67b8c6c3582281704afe6f9a90538afbc798	2018-10-26 22:47:29 +00:00
Ray Milkey	956bb1691a	Merge remote-tracking branch 'origin/master' into dev-karaf-4.2.1 Change-Id: If611105632e508cb17352c6f5b1cbcfd85f716d1	2018-10-26 10:53:44 -07:00
steven30801	7632e150e1	[ONOS-7596] Support reading table entries with counter data in P4Runtime Change-Id: I85bacb1697a6c881dd69ba74a2162c73ec0b8aee	2018-10-26 17:25:13 +00:00
Thomas Vachuska	22ea6126e4	Removing BUCK files and other Buck-related auxiliaries. Change-Id: I23691380efcd9bce4906695605214ed59ada0e87	2018-10-24 20:02:31 +00:00
Ray Milkey	db57f1c326	Merge branch 'master' into dev-karaf-4.2.1 Change-Id: I86b9d80581cd76a7c20e05201023090f9692d1ab	2018-10-09 11:26:45 -07:00
Carmelo Cascone	e44592f246	More robust P4Runtime group handling This patch solves the PENDING_UPDATE and PENDING_ADD_RETRY issue observed on the ONS EU topology. The P4Runtime action profile group handling has been re-implemented to be robust against inconsistencies of the device mirror, which is now periodically synchronized with the device state. Similarly, we implement a routine in the P4RuntimeClient to cleanup unused action profile members. This patch includes also: - Refactor PI handle classes to allow creating handles without the entity instance - Use list instead of collections in P4RuntimeClient methods, as order of updates sent and/or entities received from the device is important Change-Id: I2e7964ce90f43d66680131b47ab52aca32ab55d2	2018-10-05 23:45:29 +00:00
Carmelo Cascone	50d195ffb3	Do not insert or delete default action entries in P4Runtime Spec says: the default entry for a table is always set. It can be set at compile-time by the P4 programmer - or defaults to NoAction (which is a no-op) otherwise - and assuming it is not declared as const, can be modified by the P4Runtime client. Because the default entry is always set, we do not allow INSERT and DELETE updates on the default entry and the P4Runtime server must return an INVALID_ARGUMENT error code if the client attempts one. With this patch we convert insert or delete operations into modify ones (unless specified by a driver property, to support non-compliant devices). For delete, we use the interpreter to suggest a default action that is the same as the one when the pipeline was originally deployed. Also, we introduce the capability of synchronizing the device mirror with the device state. Change-Id: I3758fc11780eb0f1cf4ed5a295bd98b54b182e29	2018-10-05 23:45:18 +00:00
Carmelo Cascone	c1fa8f2f0b	More permissive group operations in P4RuntimeGroupProgrammable Change-Id: Ib5d0bdaad88536727d27b39252cc55ccf1ac183c	2018-10-05 23:45:07 +00:00
Carmelo Cascone	26600970b0	Keep P4Runtime device groups in sync with translator/mirror state Change-Id: I7257c2ab5f3d4118f30ecf3ae3820d95e5afa4c8	2018-10-05 23:45:02 +00:00
Carmelo Cascone	c7639fb240	Flush device mirror in P4Runtime driver when pipeline status is unknown Change-Id: Ida5874adc68a5d6ba5f1b5063880ac31c0a2ee9b	2018-10-05 23:44:58 +00:00
Carmelo Cascone	055e9b2c4e	More robust handling of multicast groups in P4Runtime Since we cannot read the device state, try to delete and re-insert if insert or modify operation fails. Change-Id: I35733c7e0af509317e92f991978c0a4ef36b9bc8	2018-10-05 23:44:54 +00:00
Ray Milkey	d84f89ba3d	[WIP] Upgrade ONOS to karaf version 4.2.1 Change-Id: I7cd40c995bdf1c80f94b1895fb3344e32404c7fa	2018-09-28 11:20:57 -07:00
Carmelo Cascone	33b27bc190	Ignore entries from constant tables in P4Runtime Change-Id: I6ba8591a9674287832a18e258c5dd3801c588a32 (cherry picked from commit ba73da5bd09a6b40ab5727ef4a03767d437687e8)	2018-09-10 08:33:35 +00:00
Carmelo Cascone	9e4972c5d8	Refactor P4Runtime subsystem to implement async connection procedure This patch is an attempt to solve issues observed when restarting both switches and ONOS nodes. Most of the issues seemed to depend on a brittle mastership handling when deploying the pipeline. With this patch, GDP registers devices to the core with available=false (i.e. offline) and marks them online only when the P4 pipeline has been deployed to the device. A new PiPipeconfWatchdogService takes care of deploying pipelines and producing event when devices are ready. Moreover, we fix a race condition where pipeconf-related behaviors were not found. This was caused by GDP enforcing the merged driver name in the network config, while external entities (e.g. Mininet) were pushing a JSON blob with the base driver name. This patch removes the need to rely on such a trick and instead uses pipeconf-aware logic directly in the driver manager (change #19622). Finally, we fix issues in P4RuntimeClientImpl that were causing the stream channel not detecting unreachable devices. The solution is to follow gRPC APIs and re-instantiate a new channel once the first fails. Change-Id: I6fbc91859c0fb58a6db3bc197b7081a8fe9f97f7	2018-09-06 03:08:09 +00:00
Yi Tseng	fe13f3e634	Fix timeout problem of fabric pipeliner Including a workaround for ONOS-7785 Change-Id: I867161f5edf63e82c42a731a4b107ea326d4675c	2018-08-22 20:13:34 +00:00
Ray Milkey	761e80a427	Remove pom files from ONOS Change-Id: I53e2395466cba10222d638736635b11380421d98	2018-08-21 18:11:58 +00:00
Ray Milkey	312d987732	Starting snapshot 1.15.0-SNAPSHOT Change-Id: I9239c7695f7faf6dc184a0ff54a34dedf4761b2d	2018-08-20 16:09:54 -07:00
Ray Milkey	05d609aa02	buildifier reformatting of bazel files Change-Id: I1645613ec9dc57e659e7036634fcf3f3992b05a7	2018-08-15 17:55:00 +00:00
ONOS Jenkins User	03e5521fba	Starting snapshot 1.14.0-SNAPSHOT	2018-08-14 22:13:58 +00:00
ONOS Jenkins User	3594c24cb2	Tagging 1.14.0-rc2	2018-08-14 22:13:02 +00:00
ONOS Jenkins User	4e5f2d67ac	Starting snapshot 1.14.0-SNAPSHOT	2018-08-13 23:16:37 +00:00
ONOS Jenkins User	cff0bd8922	Tagging 1.14.0-rc1	2018-08-13 23:15:43 +00:00
Ray Milkey	4867af2f9c	Revert "Tagging 1.14.0-b1" This reverts commit 78ce40093ac008fd6d169ab83d2d5961f91f6e31. Change-Id: I8482f550c30f75d65333f6be9c9115a930abc56f	2018-08-10 16:53:10 -07:00
Carmelo Cascone	4164436785	More p4runtime-related bazel build files Change-Id: Icdd02a06d7905e4cd0a6c61d282fd522c7b75fe7	2018-08-10 18:42:44 +00:00
Ray Milkey	78ce40093a	Tagging 1.14.0-b1	2018-08-10 11:16:44 -07:00
Carmelo Cascone	58136816ab	ONOS-7739 Support for P4Runtime multicast programming Design doc: https://docs.google.com/document/d/13rkQlwr49M-uxQQEuxCMP7BFEPY2gtQ850Hn3gUfesU/edit#heading=h.lzdayib259sq Change-Id: Ief00bec89fe5a9784b0ee13fdaafa3ae58ab654f	2018-07-20 18:16:17 +00:00
Carmelo Cascone	7044efd07e	More bugfixes in P4Runtime subsystem - NPE when removing agent listeners - Don't block netcfg event dispatch thread on GDP - Avoid unnecessary warn logs during disconnection in GDP Change-Id: I612f7f7914579eea9ba393e952377a3933d92e8d	2018-07-06 13:01:36 +02:00
Carmelo Cascone	158b8c4f6b	More improvements and bugfixes in P4Runtime subsystem Most notably, we fix a bug in which some nodes were not able to find pipeconf-specific behaviors for a given device. The problem is not completelly solved but it's mitigated. There's a race condition caused by the fact that the GDP updates the cfg with the merged driver name before advertising the device to the core. Some nodes might receive the cfg update after the device has been advertised. We mitigate the problem by performing the pipeline deploy (slow operation) after the cfg update, giving more time for nodes to catch up. Perhaps we should listen for cfg update events before advertising the device to the core? Also: - NPE when getting P4Runtime client - Detect if a base driver is already merged in pipeconf manager - Longer timeouts in P4Runtime driver and protocol (for slow networks) - Configurable timeout in P4Runtime driver and GDP - NPE when adding/removing device agent listeners in P4Rtunime handshaker - Various exceptions due to race conditions in GDP when disconnecting devices (by serializing disconnect tasks per device) - NPE when cancelling polling tasks in GDP - Refactored PipeconfService to distinguish between driver merge, pipeconf map update, and cfg update (now performed in the GDP) - Fixed PipeconfManagerTest, not testing driver behaviours - Use Guava striped locks when possible (more memory-efficient than maps, and with strict atomicity guarantees w.r.t. to caches). Change-Id: I30f3887541ba0fd44439a86885e9821ac565b64c	2018-07-06 08:13:44 +00:00
Yi Tseng	f325a60933	[ONOS-7435] Add configurable properties in P4RuntimeGroupProgrammable Change-Id: Ic2d847f4962bef8d6b91732e590764cdef5c2e30	2018-07-02 10:45:22 +00:00
Carmelo Cascone	e5b2872eb6	Improve scalability of P4Runtime subsystem The P4Runtime client was hanging (deadlock) on a master arbitration request. As such, all other requests (e.g. table write) were waiting for the client's request lock to become available. Apart from fixing those deadlocks, this patch brings a number of improvements that all together allow to run networks of 100+ P4Runtime devices on a single ONOS instance (before only ~20 devices) Includes: - Asynchrounous mastership handling in DevicHandshaker (as defined in the P4Runtime and OpenFlow spec) - Refactored arbitration handling in the P4RuntimeClient to be consistent with the P4Runtime spec - Report suspect deadlocks in P4RuntimeClientImpl - Exploit write errors in P4RuntimeClient to quickly report channel/mastership errors to upper layers - Complete all futures with deadlines in P4Runtime driver - Dump all tables in one request - Re-purposed ChannelEvent to DeviceAgentEvent to carry also mastership response events - Fixed IntelliJ warnings - Various code and log clean-ups Change-Id: I9376793a9fe69d8eddf7e8ac2ef0ee4c14fbd198	2018-06-29 13:59:27 +00:00

1 2 3

105 Commits