Monitor the MLAG Service

Monitor the MLAG Service

The Cumulus NetQ UI enables operators to view the health of the MLAG service on a network-wide and a per session basis, giving greater insight into all aspects of the service. This is accomplished through two card workflows, one for the service and one for the session. They are described separately here.

MLAG or CLAG? The Cumulus Linux implementation of MLAG is referred to by other vendors as CLAG, MC-LAG or VPC. The Cumulus NetQ UI uses the CLAG terminology.

Monitor the CLAG Service (All Sessions)

With NetQ, you can monitor the number of nodes running the CLAG service, view sessions running, and view alarms triggered by the CLAG service. For an overview and how to configure CLAG in your data center network, refer to Multi-Chassis Link Aggregation - MLAG.

CLAG Service Card Workflow Summary

The small CLAG Service card displays:

/images/download/thumbnails/12321372/image2019-3-3-16_22_4.png

Item

Description

/images/lh3.googleusercontent.com/9pKK7nvSXHVZ9B7ZhBwE4tJ0yTYz1Cnblgm_0e3-KDmA8qdySLR5COAhLPZomROlRUSRTTRlXlgwEKcjuVljew1z5zd6QRMCXQXjcVbkhXJiX63LgNYuAg5nC58u_mkmYpFHFK43

Indicates data is for all sessions of a Network Service or Protocol

Title

CLAG: All CLAG Sessions, or the CLAG Service

/images/lh3.googleusercontent.com/6savpoUbgaVzG-b8wvwDqPBL2SvepqAMmGuTy06oenbCU27_dmDBdmJobMvQD89-1iBJRpKB0lFW4cTzNEp3kdZuAOOs81DmptnyhBgCxeIwwuwkq91hLLOTSLACVkDRTfeYoMNe

Total number of switches with the CLAG service enabled during the designated time period

/images/lh3.googleusercontent.com/zfvCpsG-VHAjSrTbVj3Q9ZDhqhqlkG1MiPwZEFtSM10fC2F8AExk-7gTb3norOvIXhvQL6rLN5NWpDdrnx55YCLnzdn2Y2INmiKan-83QG6a9s4aw3LlOUm7zlOGoi4mK-yUNQDT

Total number of CLAG-related alarms received during the designated time period

Chart

Distribution of CLAG-related alarms received during the designated time period

The medium CLAG Service card displays:

/images/download/attachments/12321372/image2019-6-10-18_4_36.png

Item

Description

Time period

Range of time in which the displayed data was collected; applies to all card sizes

/images/lh3.googleusercontent.com/9pKK7nvSXHVZ9B7ZhBwE4tJ0yTYz1Cnblgm_0e3-KDmA8qdySLR5COAhLPZomROlRUSRTTRlXlgwEKcjuVljew1z5zd6QRMCXQXjcVbkhXJiX63LgNYuAg5nC58u_mkmYpFHFK43

Indicates data is for all sessions of a Network Service or Protocol

Title

Network Services | All CLAG Sessions

/images/lh3.googleusercontent.com/6savpoUbgaVzG-b8wvwDqPBL2SvepqAMmGuTy06oenbCU27_dmDBdmJobMvQD89-1iBJRpKB0lFW4cTzNEp3kdZuAOOs81DmptnyhBgCxeIwwuwkq91hLLOTSLACVkDRTfeYoMNe

Total number of switches with the CLAG service enabled during the designated time period

/images/lh3.googleusercontent.com/zfvCpsG-VHAjSrTbVj3Q9ZDhqhqlkG1MiPwZEFtSM10fC2F8AExk-7gTb3norOvIXhvQL6rLN5NWpDdrnx55YCLnzdn2Y2INmiKan-83QG6a9s4aw3LlOUm7zlOGoi4mK-yUNQDT

Total number of CLAG-related alarms received during the designated time period

/images/lh3.googleusercontent.com/OZ7aek87vE2GR9IP49ewj7F8XTSYe4ec-eyolCoWYf3aTc79K-j1EjF0gj6EL-YqPlLxkERQ7iNHi3FHQPBwczW2SNRAOmlbDM0RbJqTSADWDKcK3VWCzPTrC_0uvHVkRrm9J2Id

Total number of sessions with an inactive backup IP address during the designated time period

/images/lh4.googleusercontent.com/uez2nLxgyfGtFFzPWdwXXWg-pfu5-s6bSXW7ZsXKpF7SS_ZEtRhuDuToD9ojGhz0thfzzzCDrgpJyjlelL9kWtGyaL-SaRBFvOX1qGgLv7gjuN5U0dxmAtxHIUKYLECr50heq0HN

Total number of bonds with only a single connection during the designated time period

Total Nodes Running chart

Distribution of switches and hosts with the CLAG service enabled during the designated time period, and a total number of nodes running the service currently.

Note: The node count here may be different than the count in the summary bar. For example, the number of nodes running CLAG last week or last month might be more or less than the number of nodes running CLAG currently.

Total Open Alarms chart

Distribution of CLAG-related alarms received during the designated time period, and the total number of current CLAG-related alarms in the network.

Note: The alarm count here may be different than the count in the summary bar. For example, the number of new alarms received in this time period does not take into account alarms that have already been received and are still active. You might have no new alarms, but still have a total number of alarms present on the network of 10.

Total Sessions chart

Distribution of CLAG sessions running during the designated time period, and the total number of sessions running on the network currently

The large CLAG service card contains two tabs.

The All CLAG Sessions Summary tab which displays:

/images/download/attachments/12321372/image2019-6-10-18_6_3.png

Item

Description

Time period

Range of time in which the displayed data was collected; applies to all card sizes

/images/lh3.googleusercontent.com/9pKK7nvSXHVZ9B7ZhBwE4tJ0yTYz1Cnblgm_0e3-KDmA8qdySLR5COAhLPZomROlRUSRTTRlXlgwEKcjuVljew1z5zd6QRMCXQXjcVbkhXJiX63LgNYuAg5nC58u_mkmYpFHFK43

Indicates data is for all sessions of a Network Service or Protocol

Title

All CLAG Sessions Summary

/images/lh3.googleusercontent.com/6savpoUbgaVzG-b8wvwDqPBL2SvepqAMmGuTy06oenbCU27_dmDBdmJobMvQD89-1iBJRpKB0lFW4cTzNEp3kdZuAOOs81DmptnyhBgCxeIwwuwkq91hLLOTSLACVkDRTfeYoMNe

Total number of switches with the CLAG service enabled during the designated time period

/images/lh3.googleusercontent.com/zfvCpsG-VHAjSrTbVj3Q9ZDhqhqlkG1MiPwZEFtSM10fC2F8AExk-7gTb3norOvIXhvQL6rLN5NWpDdrnx55YCLnzdn2Y2INmiKan-83QG6a9s4aw3LlOUm7zlOGoi4mK-yUNQDT

Total number of CLAG-related alarms received during the designated time period

Total Nodes Running chart

Distribution of switches and hosts with the CLAG service enabled during the designated time period, and a total number of nodes running the service currently.

Note: The node count here may be different than the count in the summary bar. For example, the number of nodes running CLAG last week or last month might be more or less than the number of nodes running CLAG currently.

Total Sessions chart

Distribution of CLAG sessions running during the designated time period, and the total number of sessions running on the network currently

Total Sessions with Inactive-backup-ip chart

Distribution of sessions without an active backup IP defined during the designated time period, and the total number of these sessions running on the network currently

Table/Filter options

When the Switches with Most Sessions filter is selected, the table displays switches running CLAG sessions in decreasing order of session count—devices with the largest number of sessions are listed first

When the Switches with Most Unestablished Sessions filter is selected, the table displays switches running CLAG sessions in decreasing order of unestablished session count—devices with the largest number of unestablished sessions are listed first

Show All Sessions

Link to view all CLAG sessions in the full screen card

The All CLAG Alarms tab which displays:

Item

Description

Time period

Range of time in which the displayed data was collected; applies to all card sizes

/images/lh3.googleusercontent.com/zfvCpsG-VHAjSrTbVj3Q9ZDhqhqlkG1MiPwZEFtSM10fC2F8AExk-7gTb3norOvIXhvQL6rLN5NWpDdrnx55YCLnzdn2Y2INmiKan-83QG6a9s4aw3LlOUm7zlOGoi4mK-yUNQDT (in header)

Indicates alarm data for all CLAG sessions

Title

Network Services | All CLAG Alarms (visible when you hover over card)

Total number of switches with the CLAG service enabled during the designated time period

(in summary bar)

Total number of CLAG-related alarms received during the designated time period

Total Alarms chart

Distribution of CLAG-related alarms received during the designated time period, and the total number of current CLAG-related alarms in the network.

Note: The alarm count here may be different than the count in the summary bar. For example, the number of new alarms received in this time period does not take into account alarms that have already been received and are still active. You might have no new alarms, but still have a total number of alarms present on the network of 10.

Table/Filter options

When the Events by Most Active Device filter is selected, the table displays switches running CLAG sessions in decreasing order of alarm count—devices with the largest number of sessions are listed first

Show All Sessions

Link to view all CLAG sessions in the full screen card

The full screen CLAG Service card provides tabs for all switches, all sessions, and all alarms.

/images/download/attachments/12321372/image2019-3-3-16_49_25.png

Item

Description

Title

Network Services | CLAG

/images/lh4.googleusercontent0.com/DO5d-BvJ-vciNs7f0SlTY72rHmQgJpxHGUYsRkK0aDIMZ2VQP9ygWJzCZH5qouUZGI3MZOvOxdfvn8dt8xMxBI_4UvJVTZMJVnmb5Za0LEdQ3lOeqs01w942HG2AJ14kJm1sY56T

Closes full screen card and returns to workbench

Time period

Range of time in which the displayed data was collected; applies to all card sizes; select an alternate time period by clicking /images/lh5.googleusercontent.com/V88gxOaxuUjBWw5tni0vwGrNs2JBQsz0SwWFpQCdJTOSYfuUGQnpkWz8-cHDSF-jZsE4TeZfpRhaeIhOU7UIZZE2AwtP870d78GBCwuD0Kzqb7TbAiDnX5hgQh5DC68zoKgoLd5U

Results

Number of results found for the selected tab

All Switches tab

Displays all switches and hosts running the CLAG service. By default, the device list is sorted by hostname. This tab provides the following additional data about each device:

  • Agent

    • State: Indicates communication state of the NetQ Agent on a given device. Values include Fresh (heard from recently) and Rotten (not heard from recently).

    • Version: Software version number of the NetQ Agent on a given device. This should match the version number of the NetQ software loaded on your server or appliance; for example, 2.1.0.

  • ASIC

    • Core BW: Maximum sustained/rated bandwidth. Example values include 2.0 T and 720 G.

    • Model: Chip family. Example values include Tomahawk, Trident, and Spectrum.

    • Model Id: Identifier of networking ASIC model. Example values include BCM56960 and BCM56854.

    • Ports: Indicates port configuration of the switch. Example values include 32 x 100G-QSFP28, 48 x 10G-SFP+, and 6 x 40G-QSFP+.

    • Vendor: Manufacturer of the chip. Example values include Broadcom and Mellanox.

  • CPU

    • Arch: Microprocessor architecture type. Values include x86_64 (Intel), ARMv7 (AMD), and PowerPC.

    • Max Freq: Highest rated frequency for CPU. Example values include 2.40 GHz and 1.74 GHz.

    • Model: Chip family. Example values include Intel Atom C2538 and Intel Atom C2338.

    • Nos: Number of cores. Example values include 2, 4, and 8.

  • Disk Total Size: Total amount of storage space in physical disks (not total available). Example values: 10 GB, 20 GB, 30 GB.

  • License State: Indicator of validity. Values include ok and bad.

  • Memory Size: Total amount of local RAM. Example values include 8192 MB and 2048 MB.

  • OS

    • Vendor: Operating System manufacturer. Values include Cumulus Networks, RedHat, Ubuntu, and CentOS.

    • Version: Software version number of the OS. Example values include 3.7.3, 2.5.x, 16.04, 7.1.

    • Version Id: Identifier of the OS version. For Cumulus, this is the same as the Version (3.7.x).

  • Platform

    • Date: Date and time the platform was manufactured. Example values include 7/12/18 and 10/29/2015.

    • MAC: System MAC address. Example value: 17:01:AB:EE:C3:F5.

    • Model: Manufacturer's model name. Examples values include AS7712-32X and S4048-ON.

    • Number: Manufacturer part number. Examples values include FP3ZZ7632014A, 0J09D3.

    • Revision: Release version of the platform

    • Series: Manufacturer serial number. Example values include D2060B2F044919GD000060, CN046MRJCES0085E0004.

    • Vendor: Manufacturer of the platform. Example values include Cumulus Express, Dell, EdgeCore, Lenovo, Mellanox.

  • Time: Date and time the data was collected from device.

All Sessions tab

Displays all CLAG sessions network-wide. By default, the session list is sorted by hostname. This tab provides the following additional data about each session:

  • Backup Ip: IP address of the interface to use if the peerlink (or bond) goes down

  • Backup Ip Active: Indicates whether the backup IP address has been specified and is active (true) or not (false)

  • Bonds

    • Conflicted: Identifies the set of interfaces in a bond that do not match on each end of the bond

    • Single: Identifies a set of interfaces connecting to only one of the two switches

    • Dual: Identifies a set of interfaces connecting to both switches

    • Proto Down: Interface on the switch brought down by the clagd service. Value is blank if no interfaces are down due to clagd service.

  • Clag Sysmac: Unique MAC address for each bond interface pair. Note: Must be a value between 44:38:39:ff:00:00 and 44:38:39:ff:ff:ff.

  • DB State: Session state of the DB.

  • OPID: CLAG service identifier

  • Peer:

    • If: Name of the peer interface

    • Role: Role of the peer device. Values include primary and secondary.

    • State: Indicates if peer device is up (true) or down (false)

  • Role: Role of the host device. Values include primary and secondary.

  • Timestamp: Date and time the CLAG session was started, deleted, updated, or marked dead (device went down)

  • Vxlan Anycast: Anycast IP address used for VXLAN termination

All Alarms tab

Displays all CLAG events network-wide. By default, the event list is sorted by time, with the most recent events listed first. The tab provides the following additional data about each event:

  • Message: Text description of a BGP-related event. Example: Clag conflicted bond changed from swp7 swp8 to swp9 swp10

  • Source: Hostname of network device that generated the event

  • Severity: Importance of the event. Values include critical, warning, info, and debug.

  • Type: Network protocol or service generating the event. This always has a value of clag in this card workflow.

Export

Enables export of all or selected items in a CSV or JSON formatted file

/images/lh5.googleusercontent.com/TxyRotE-Ks3VoU0rMfISNSl_V0m0yXqQyq8cn7CI6da54YIrMvzU8ttAOXmnbpUJdXBIQBG9OothePcEuJ-DoNYR1SdJIpW6RAlGd5wXxJdRcI0HPR3eMMcrSwotbHTrjqUNFH3w

Enables manipulation of table display; choose columns to display and reorder columns

View Service Status Summary

A summary of the CLAG service is available from the CLAG Service card workflow, including the number of nodes running the service, the number of CLAG-related alarms, and a distribution of those alarms.

To view the summary, open the small CLAG Service card.

/images/download/thumbnails/12321372/image2019-3-3-16_22_4.png

For more detail, select a different size CLAG Service card.

View the Distribution of Sessions and Alarms

It is useful to know the number of network nodes running the CLAG protocol over a period of time, as it gives you insight into the amount of traffic associated with and breadth of use of the protocol. It is also useful to compare the number of nodes running CLAG with the alarms present at the same time to determine if there is any correlation between the issues and the ability to establish a CLAG session.

To view these distributions, open the medium CLAG Service card.

/images/download/attachments/12321372/image2019-6-10-18_4_36.png

If a visual correlation is apparent, you can dig a little deeper with the large CLAG Service card tabs.

View Devices with the Most CLAG Sessions

You can view the load from CLAG on your switches using the large CLAG Service card. This data enables you to see which switches are handling the most CLAG traffic currently, validate that is what is expected based on your network design, and compare that with data from an earlier time to look for any differences.

To view switches and hosts with the most CLAG sessions:

  1. Open the large CLAG Service card.
  2. Select Switches with Most Sessions from the filter above the table.
    The table content is sorted by this characteristic, listing nodes running the most CLAG sessions at the top. Scroll down to view those with the fewest sessions.

    /images/download/attachments/12321372/image2019-4-5-11_16_49.png

To compare this data with the same data at a previous time:

  1. Open another large CLAG Service card.
  2. Move the new card next to the original card if needed.
  3. Change the time period for the data on the new card by hovering over the card and clicking /images/lh4.googleusercontent.com/fo-yr9tPWzyO_CQPDiddcB5tmuwuX1OS7nIlj4_9iA3_6xnmg_c-54SmAAdNJL_odR0UGjeTSxRHw2BHkeZ3YEfiwOhjpUeHVwPVd8s4F2XwOGnRWXD4NfaIzj86ETYgEU8frnzw .
  4. Select the time period that you want to compare with the current time.
    You can now see whether there are significant differences between this time period and the previous time period.

    /images/download/thumbnails/12321372/image2019-4-5-11_21_47.png /images/download/attachments/12321372/image2019-6-11-11_13_47.png

    If the changes are unexpected, you can investigate further by looking at another time frame, determining if more nodes are now running CLAG than previously, looking for changes in the topology, and so forth.

View Devices with the Most Unestablished CLAG Sessions

You can identify switches that are experiencing difficulties establishing CLAG sessions; both currently and in the past.

To view switches with the most unestablished CLAG sessions:

  1. Open the large CLAG Service card.
  2. Select Switches with Most Unestablished Sessions from the filter above the table. The table content is sorted by this characteristic, listing nodes with the most unestablished CLAG sessions at the top. Scroll down to view those with the fewest unestablished sessions.

    /images/download/attachments/12321372/image2019-6-11-11_15_7.png

Where to go next depends on what data you see, but a few options include:

  • Hover over the any of the charts to focus on the number of switches or sessions with the chart characteristic during that smaller time slice.
    The table content changes to match the hovered content. Click on the chart to persist the table changes.

    /images/download/attachments/12321372/image2019-3-3-17_19_9.png
  • Change the time period for the data to compare with a prior time.

    /images/download/thumbnails/12321372/image2019-3-3-12_34_3.png /images/download/attachments/12321372/image2019-3-3-17_20_6.png

    If the same switches are consistently indicating the most unestablished sessions, you might want to look more carefully at those switches using the Switches card workflow to determine probable causes. Refer to Monitor Switches.

  • Click Show All Sessions to investigate all CLAG sessions with events in the full screen card.

Switches experiencing a large number of CLAG alarms may indicate a configuration or performance issue that needs further investigation. You can view the switches sorted by the number of CLAG alarms and then use the Switches card workflow or the Alarms card workflow to gather more information about possible causes for the alarms.

To view switches with most CLAG alarms:

  1. Open the large CLAG Service card.
  2. Hover over the header and click /images/lh3.googleusercontent0.com/zfvCpsG-VHAjSrTbVj3Q9ZDhqhqlkG1MiPwZEFtSM10fC2F8AExk-7gTb3norOvIXhvQL6rLN5NWpDdrnx55YCLnzdn2Y2INmiKan-83QG6a9s4aw3LlOUm7zlOGoi4mK-yUNQDT .
  3. Select Events by Most Active Device from the filter above the table.
    The table content is sorted by this characteristic, listing nodes with the most CLAG alarms at the top. Scroll down to view those with the fewest alarms.

    /images/download/attachments/12321372/image2019-6-10-18_7_49.png

Where to go next depends on what data you see, but a few options include:

  • Hover over the Total Alarms chart to focus on the switches exhibiting alarms during that smaller time slice.
    The table content changes to match the hovered content. Click on the chart to persist the table changes.

    /images/download/attachments/12321372/image2019-3-3-17_25_29.png
  • Change the time period for the data to compare with a prior time. If the same switches are consistently indicating the most alarms, you might want to look more carefully at those switches using the Switches card workflow.

    /images/download/thumbnails/12321372/image2019-3-3-12_34_3.png /images/download/attachments/12321372/image2019-3-3-17_26_37.png
  • Click Show All Sessions to investigate all CLAG sessions with alarms in the full screen card.

View All CLAG Events

The CLAG Service card workflow enables you to view all of the CLAG events in the designated time period.

To view all CLAG events:

  1. Open the full screen CLAG Service card.
  2. Click All Alarms tab.

    /images/download/attachments/12321372/image2019-4-5-11_33_22.png

Where to go next depends on what data you see, but a few options include:

  • Open the All Switches or All Sessions tabs to look more closely at the alarms from the switch or session perspective.
  • Sort on other parameters:
    • by Message to determine the frequency of particular events
    • by Severity to determine the most critical events
    • by Time to find events that may have occurred at a particular time to try to correlate them with other system events
  • Export the data to a file by clicking Export or selecting a subset and clicking Export Selected in edit menu
  • Return to your workbench by clicking /images/download/attachments/12321372/close-14.svg in the top right corner

View Detailed Information About All Switches Running CLAG

You can view all stored attributes of all switches running CLAG in your network in the full-screen card.

To view all switch details, open the full screen CLAG Service card, and click the All Switches tab.

/images/download/attachments/12321372/image2019-4-5-11_32_20.png

To return to your workbench, click /images/download/attachments/12321372/close-14.svg in the top right corner.

Take Actions on Data Displayed in Results List

In the full screen BGP Service card, you can determine which results are displayed in the results list, and which are exported.

To take actions on the data, click in the blank column at the very left of a row. A checkbox appears, selecting that switch, session, or alarm, and an edit menu is shown at the bottom of the card (shown enlarged here).

/images/download/attachments/12321372/image2019-4-5-11_36_7.png /images/download/attachments/12321372/image2019-6-3-10_44_21.png

You can perform the following actions on the results list:

Option Action or Behavior on Click
Select All Selects all items in the results list
Clear All Clears all existing selections of items in the results list. This also hides the edit menu.
Open Cards Open the corresponding validation or trace result card.
Hide Selected Hide selected items (switches, sessions, alarms, and so forth) from the results list.
Show Only Selected Hide unselected items (switches, sessions, alarms, and so forth) from the results list.
Export Selected Exports selected data into a .csv file. If you want to export to a .json file format, use the Export button.

To return to original display of results, click the associated tab.

Monitor a Single CLAG Session

With NetQ, you can monitor the number of nodes running the CLAG service, view switches with the most peers alive and not alive, and view alarms triggered by the CLAG service. For an overview and how to configure CLAG in your data center network, refer to Multi-Chassis Link Aggregation - MLAG.

To access the single session cards, you must open the full screen CLAG Service, click the All Sessions tab, select the desired session, then click (Open Cards).

Granularity of Data Shown Based on Time Period

On the medium and large single CLAG session cards, the status of the peers is represented in heat maps stacked vertically; one for peers that are reachable (alive), and one for peers that are unreachable (not alive). Depending on the time period of data on the card, the number of smaller time blocks used to indicate the status varies. A vertical stack of time blocks, one from each map, includes the results from all checks during that time. The results are shown by how saturated the color is for each block. If all peers during that time period were alive for the entire time block, then the top block is 100% saturated (white) and the not alive block is zero percent saturated (gray). As peers that are not alive increase in saturation, the peers that are alive block is proportionally reduced in saturation. An example heat map for a time period of 24 hours is shown here with the most common time periods in the table showing the resulting time blocks.

/images/download/attachments/12321372/image2019-6-11-14_10_14.png
Time Period Number of Runs Number Time Blocks Amount of Time in Each Block
6 hours 18 6 1 hour
12 hours 36 12 1 hour
24 hours 72 24 1 hour
1 week 504 7 1 day
1 month 2,086 30 1 day
1 quarter 7,000 13 1 week

CLAG Session Card Workflow Summary

The small CLAG Session card displays:

/images/download/thumbnails/12321372/image2019-6-11-11_43_54.png

Item

Description

/images/lh4.googleusercontent.com/8UFHoFOr6nLn2EDe7ck8pxl4sD8PUveRXriidckRXurN1fJg24JdReHZPseSaLV2V2OgQByxS3HYPVBUfAeejBHYeIT1BDCPCagb3lXEp6XphglqHJCSDmYZ75uMmnsAToZVdOx8

Indicates data is for a single session of a Network Service or Protocol

Title

CLAG Session

 

Device identifiers (hostname, IP address, or MAC address) for host and peer in session.

/images/lh4.googleusercontent.com/f3fEO9qiCUVGArdmjdJDPfxXlE1VstKvmsxIK1wKvD-7KUwAdcaMgnORW8SoqGn-etCMtkZwT6zFsxTT7dxyabBJ8_kloNeGJFGepuvlaEocY0BHZmePHPePVcLpv_-K4a4CVjCh , /images/lh3.googleusercontent.com/aGdfLOBGXXT2lrVhBrI0kU9noHk7uyLcGDNj_0A0CB97xxzMmtnmCzt_0BxhgJ2nG_9BoSofaC5QMkYmRk8-p2Po3vDKVZsElKuq2Xk0D_pTeN4hm1aDgUaJ0rSk8x2lLDrQOhUW

Indication of host role, primary /images/lh4.googleusercontent.com/f3fEO9qiCUVGArdmjdJDPfxXlE1VstKvmsxIK1wKvD-7KUwAdcaMgnORW8SoqGn-etCMtkZwT6zFsxTT7dxyabBJ8_kloNeGJFGepuvlaEocY0BHZmePHPePVcLpv_-K4a4CVjCh or secondary /images/lh3.googleusercontent.com/aGdfLOBGXXT2lrVhBrI0kU9noHk7uyLcGDNj_0A0CB97xxzMmtnmCzt_0BxhgJ2nG_9BoSofaC5QMkYmRk8-p2Po3vDKVZsElKuq2Xk0D_pTeN4hm1aDgUaJ0rSk8x2lLDrQOhUW

The medium CLAG Session card displays:

Item

Description

Time period

Range of time in which the displayed data was collected; applies to all card sizes

/images/lh4.googleusercontent.com/8UFHoFOr6nLn2EDe7ck8pxl4sD8PUveRXriidckRXurN1fJg24JdReHZPseSaLV2V2OgQByxS3HYPVBUfAeejBHYeIT1BDCPCagb3lXEp6XphglqHJCSDmYZ75uMmnsAToZVdOx8

Indicates data is for a single session of a Network Service or Protocol

Title

Network Services | CLAG Session

/images/lh6.googleusercontent.com/JaXSSELl_nz1tMCRgnfs7dN4bErlnOHc-uteFbWlsLyCqW2khDGqUCVZcZYusEdlvGhDj3cml3sU0YGQq0aST_mpgZe-p1d07LL4bP-GsWr7Xcozg7w7cDhpyYeCSgkrYRFDW9xx

Device identifiers (hostname, IP address, or MAC address) for host and peer in session. Arrow points from the host to the peer. Click on /images/lh6.googleusercontent.com/JaXSSELl_nz1tMCRgnfs7dN4bErlnOHc-uteFbWlsLyCqW2khDGqUCVZcZYusEdlvGhDj3cml3sU0YGQq0aST_mpgZe-p1d07LL4bP-GsWr7Xcozg7w7cDhpyYeCSgkrYRFDW9xx to open associated device card.

/images/lh4.googleusercontent.com/f3fEO9qiCUVGArdmjdJDPfxXlE1VstKvmsxIK1wKvD-7KUwAdcaMgnORW8SoqGn-etCMtkZwT6zFsxTT7dxyabBJ8_kloNeGJFGepuvlaEocY0BHZmePHPePVcLpv_-K4a4CVjCh , /images/lh3.googleusercontent.com/aGdfLOBGXXT2lrVhBrI0kU9noHk7uyLcGDNj_0A0CB97xxzMmtnmCzt_0BxhgJ2nG_9BoSofaC5QMkYmRk8-p2Po3vDKVZsElKuq2Xk0D_pTeN4hm1aDgUaJ0rSk8x2lLDrQOhUW

Indication of host role, primary /images/lh4.googleusercontent.com/f3fEO9qiCUVGArdmjdJDPfxXlE1VstKvmsxIK1wKvD-7KUwAdcaMgnORW8SoqGn-etCMtkZwT6zFsxTT7dxyabBJ8_kloNeGJFGepuvlaEocY0BHZmePHPePVcLpv_-K4a4CVjCh or secondary /images/lh3.googleusercontent.com/aGdfLOBGXXT2lrVhBrI0kU9noHk7uyLcGDNj_0A0CB97xxzMmtnmCzt_0BxhgJ2nG_9BoSofaC5QMkYmRk8-p2Po3vDKVZsElKuq2Xk0D_pTeN4hm1aDgUaJ0rSk8x2lLDrQOhUW

Time period

Range of time for data displayed in peer status chart

Peer Status chart

Distribution of peer availability, alive or not alive, during the designated time period. The number of time segments in a time period varies according to the length of the time period.

Role

Role that host device is playing. Values include primary and secondary.

CLAG sysmac

System MAC address of the CLAG session

Peer Role

Role that peer device is playing. Values include primary and secondary.

Peer State

Operational state of the peer, up (true) or down (false)

The large CLAG Session card contains two tabs.

The Session Summary tab displays:

/images/download/attachments/12321372/image2019-6-10-18_10_34.png

Item

Description

Time period

Range of time in which the displayed data was collected; applies to all card sizes

/images/lh4.googleusercontent.com/8UFHoFOr6nLn2EDe7ck8pxl4sD8PUveRXriidckRXurN1fJg24JdReHZPseSaLV2V2OgQByxS3HYPVBUfAeejBHYeIT1BDCPCagb3lXEp6XphglqHJCSDmYZ75uMmnsAToZVdOx8

Indicates data is for a single session of a Network Service or Protocol

Title

(Network Services | CLAG Session) Session Summary

/images/lh6.googleusercontent.com/JaXSSELl_nz1tMCRgnfs7dN4bErlnOHc-uteFbWlsLyCqW2khDGqUCVZcZYusEdlvGhDj3cml3sU0YGQq0aST_mpgZe-p1d07LL4bP-GsWr7Xcozg7w7cDhpyYeCSgkrYRFDW9xx

Device identifiers (hostname, IP address, or MAC address) for host and peer in session. Arrow points from the host to the peer. Click on /images/lh6.googleusercontent.com/JaXSSELl_nz1tMCRgnfs7dN4bErlnOHc-uteFbWlsLyCqW2khDGqUCVZcZYusEdlvGhDj3cml3sU0YGQq0aST_mpgZe-p1d07LL4bP-GsWr7Xcozg7w7cDhpyYeCSgkrYRFDW9xx to open associated device card.

/images/lh4.googleusercontent.com/f3fEO9qiCUVGArdmjdJDPfxXlE1VstKvmsxIK1wKvD-7KUwAdcaMgnORW8SoqGn-etCMtkZwT6zFsxTT7dxyabBJ8_kloNeGJFGepuvlaEocY0BHZmePHPePVcLpv_-K4a4CVjCh , /images/lh3.googleusercontent.com/aGdfLOBGXXT2lrVhBrI0kU9noHk7uyLcGDNj_0A0CB97xxzMmtnmCzt_0BxhgJ2nG_9BoSofaC5QMkYmRk8-p2Po3vDKVZsElKuq2Xk0D_pTeN4hm1aDgUaJ0rSk8x2lLDrQOhUW

Indication of host role, primary /images/lh4.googleusercontent.com/f3fEO9qiCUVGArdmjdJDPfxXlE1VstKvmsxIK1wKvD-7KUwAdcaMgnORW8SoqGn-etCMtkZwT6zFsxTT7dxyabBJ8_kloNeGJFGepuvlaEocY0BHZmePHPePVcLpv_-K4a4CVjCh or secondary /images/lh3.googleusercontent.com/aGdfLOBGXXT2lrVhBrI0kU9noHk7uyLcGDNj_0A0CB97xxzMmtnmCzt_0BxhgJ2nG_9BoSofaC5QMkYmRk8-p2Po3vDKVZsElKuq2Xk0D_pTeN4hm1aDgUaJ0rSk8x2lLDrQOhUW

Alarm Count Chart

Distribution and count of CLAG alarm events over the given time period.

Info Count Chart

Distribution and count of CLAG info events over the given time period.

Peer Status chart

Distribution of peer availability, alive or not alive, during the designated time period. The number of time segments in a time period varies according to the length of the time period.

Backup IP

IP address of the interface to use if the peerlink (or bond) goes down

Backup IP Active

Indicates whether the backup IP address is configured

CLAG SysMAC

System MAC address of the CLAG session

Peer State

Operational state of the peer, up (true) or down (false)

Count of Dual Bonds

Number of bonds connecting to both switches.

Count of Single Bonds

Number of bonds connecting to only one switch.

Count of Protocol Down Bonds

Number of bonds with interfaces that were brought down by the clagd service.

Count of Conflicted Bonds

Number of bonds which have a set of interfaces that are not the same on both switches

The Configuration File Evolution tab displays:

/images/download/attachments/12321372/image2019-7-2-13_51_25.png

Item

Description

Time period

Range of time in which the displayed data was collected; applies to all card sizes

/images/lh4.googleusercontent.com/8UFHoFOr6nLn2EDe7ck8pxl4sD8PUveRXriidckRXurN1fJg24JdReHZPseSaLV2V2OgQByxS3HYPVBUfAeejBHYeIT1BDCPCagb3lXEp6XphglqHJCSDmYZ75uMmnsAToZVdOx8

Indicates data is for a single session of a Network Service or Protocol

Title

(Network Services | CLAG Session) Configuration File Evolution

/images/lh6.googleusercontent.com/JaXSSELl_nz1tMCRgnfs7dN4bErlnOHc-uteFbWlsLyCqW2khDGqUCVZcZYusEdlvGhDj3cml3sU0YGQq0aST_mpgZe-p1d07LL4bP-GsWr7Xcozg7w7cDhpyYeCSgkrYRFDW9xx

Device identifiers (hostname, IP address, or MAC address) for host and peer in session. Click on /images/lh6.googleusercontent.com/JaXSSELl_nz1tMCRgnfs7dN4bErlnOHc-uteFbWlsLyCqW2khDGqUCVZcZYusEdlvGhDj3cml3sU0YGQq0aST_mpgZe-p1d07LL4bP-GsWr7Xcozg7w7cDhpyYeCSgkrYRFDW9xx to open associated device card.

/images/lh4.googleusercontent.com/f3fEO9qiCUVGArdmjdJDPfxXlE1VstKvmsxIK1wKvD-7KUwAdcaMgnORW8SoqGn-etCMtkZwT6zFsxTT7dxyabBJ8_kloNeGJFGepuvlaEocY0BHZmePHPePVcLpv_-K4a4CVjCh , /images/lh3.googleusercontent.com/aGdfLOBGXXT2lrVhBrI0kU9noHk7uyLcGDNj_0A0CB97xxzMmtnmCzt_0BxhgJ2nG_9BoSofaC5QMkYmRk8-p2Po3vDKVZsElKuq2Xk0D_pTeN4hm1aDgUaJ0rSk8x2lLDrQOhUW

Indication of host role, primary /images/lh4.googleusercontent.com/f3fEO9qiCUVGArdmjdJDPfxXlE1VstKvmsxIK1wKvD-7KUwAdcaMgnORW8SoqGn-etCMtkZwT6zFsxTT7dxyabBJ8_kloNeGJFGepuvlaEocY0BHZmePHPePVcLpv_-K4a4CVjCh or secondary /images/lh3.googleusercontent.com/aGdfLOBGXXT2lrVhBrI0kU9noHk7uyLcGDNj_0A0CB97xxzMmtnmCzt_0BxhgJ2nG_9BoSofaC5QMkYmRk8-p2Po3vDKVZsElKuq2Xk0D_pTeN4hm1aDgUaJ0rSk8x2lLDrQOhUW

Timestamps

When changes to the configuration file have occurred, the date and time are indicated. Click the time to see the changed file.

Configuration File

When File is selected, the configuration file as it was at the selected time is shown.

When Diff is selected, the configuration file at the selected time is shown on the left and the configuration file at the previous timestamp is shown on the right. Differences are highlighted.

The full screen CLAG Session card provides tabs for all CLAG sessions and all events.

/images/download/attachments/12321372/image2019-4-5-11_50_13.png

Item

Description

Title

Network Services | CLAG

/images/lh4.googleusercontent0.com/DO5d-BvJ-vciNs7f0SlTY72rHmQgJpxHGUYsRkK0aDIMZ2VQP9ygWJzCZH5qouUZGI3MZOvOxdfvn8dt8xMxBI_4UvJVTZMJVnmb5Za0LEdQ3lOeqs01w942HG2AJ14kJm1sY56T

Closes full screen card and returns to workbench

Time period

Range of time in which the displayed data was collected; applies to all card sizes; select an alternate time period by clicking /images/lh5.googleusercontent.com/V88gxOaxuUjBWw5tni0vwGrNs2JBQsz0SwWFpQCdJTOSYfuUGQnpkWz8-cHDSF-jZsE4TeZfpRhaeIhOU7UIZZE2AwtP870d78GBCwuD0Kzqb7TbAiDnX5hgQh5DC68zoKgoLd5U

Results

Number of results found for the selected tab

All CLAG Sessions tab

Displays all CLAG sessions for the given session. By default, the session list is sorted by hostname. This tab provides the following additional data about each session:

  • Backup Ip: IP address of the interface to use if the peerlink (or bond) goes down

  • Backup Ip Active: Indicates whether the backup IP address has been specified and is active (true) or not (false)

  • Bonds

    • Conflicted: Identifies the set of interfaces in a bond that do not match on each end of the bond

    • Single: Identifies a set of interfaces connecting to only one of the two switches

    • Dual: Identifies a set of interfaces connecting to both switches

    • Proto Down: Interface on the switch brought down by the clagd service. Value is blank if no interfaces are down due to clagd service.

  • Clag Sysmac: Unique MAC address for each bond interface pair. Note: Must be a value between 44:38:39:ff:00:00 and 44:38:39:ff:ff:ff.

  • DB State: Session state of the DB.

  • OPID: CLAG service identifier

  • Peer:

    • If: Name of the peer interface

    • Role: Role of the peer device. Values include primary and secondary.

    • State: Indicates if peer device is up (true) or down (false)

  • Role: Role of the host device. Values include primary and secondary.

  • Timestamp: Date and time the CLAG session was started, deleted, updated, or marked dead (device went down)

  • Vxlan Anycast: Anycast IP address used for VXLAN termination

All Events tab

Displays all events network-wide. By default, the event list is sorted by time, with the most recent events listed first. The tab provides the following additional data about each event:

  • Message: Text description of an event. Example: Clag conflicted bond changed from swp7 swp8 to swp9 swp10

  • Source: Hostname of network device that generated the event

  • Severity: Importance of the event. Values include critical, warning, info, and debug.

  • Type: Network protocol or service generating the event. This always has a value of clag in this card workflow.

Export

Enables export of all or selected items in a CSV or JSON formatted file

/images/lh5.googleusercontent.com/TxyRotE-Ks3VoU0rMfISNSl_V0m0yXqQyq8cn7CI6da54YIrMvzU8ttAOXmnbpUJdXBIQBG9OothePcEuJ-DoNYR1SdJIpW6RAlGd5wXxJdRcI0HPR3eMMcrSwotbHTrjqUNFH3w

Enables manipulation of table display; choose columns to display and reorder columns

View Session Status Summary

A summary of the CLAG session is available from the CLAG Session card workflow, showing the node and its peer and current status.

To view the summary:

  1. Open the full screen CLAG Service card.
  2. Select a session from the listing to view.
  3. Close the full screen card to view the medium CLAG Session card.

    /images/download/attachments/12321372/image2019-4-1-14_15_13.png /images/download/attachments/12321372/image2019-6-11-13_58_38.png

    In the left example, we see that the tor1 switch plays the secondary role in this session with the switch at 44:38:39:ff:01:01. In the right example, we see that the leaf03 switch plays the primary role in this session with leaf04.

View CLAG Session Peering State Changes

You can view the peering state for a given CLAG session from the medium and large CLAG Session cards. For a given time period, you can determine the stability of the CLAG session between two devices. If you experienced connectivity issues at a particular time, you can use these cards to help verify the state of the peer. If the peer was not alive more than it was alive, you can then investigate further into possible causes.

To view the state transitions for a given CLAG session:

  1. Open the full screen CLAG Service card.
  2. Select a session from the listing to view.
  3. Close the full screen card to view the medium CLAG Session card.

    /images/download/attachments/12321372/image2019-6-11-14_0_21.png

    In this example, the peer switch has been alive for the entire 24-hour period.

From this card, you can also view the node role, peer role and state, and CLAG system MAC address which identify the session in more detail.

To view the peering state transitions for a given CLAG session on the large CLAG Session card, open that card.

/images/download/attachments/12321372/image2019-6-11-14_2_14.png

From this card, you can also view the alarm and info event counts, node role, peer role, state, and interface, CLAG system MAC address, active backup IP address, single, dual, conflicted, and protocol down bonds, and the VXLAN anycast address identifying the session in more detail.

View Changes to the CLAG Service Configuration File

Each time a change is made to the configuration file for the CLAG service, NetQ logs the change and enables you to compare it with the last version. This can be useful when you are troubleshooting potential causes for alarms or sessions losing their connections.

To view the configuration file changes:

  1. Open the large CLAG Session card.
  2. Hover over the card and click /images/download/thumbnails/12321372/file-setting-2.png to open the Configuration File Evolution tab.
  3. Select the time of interest on the left; when a change may have impacted the performance. Scroll down if needed.

    /images/download/attachments/12321372/image2019-7-2-13_51_25.png
  4. Choose between the File view and the Diff view (selected option is dark; File by default).
    The File view displays the content of the file for you to review.

    /images/download/attachments/12321372/image2019-7-2-13_54_21.png

    The Diff view displays the changes between this version (on left) and the most recent version (on right) side by side. The changes are highlighted in red and green. In this example, we don’t have any changes after this first creation, so the same file is shown on both sides and no highlighting is present.

    /images/download/attachments/12321372/image2019-7-2-13_54_55.png

All CLAG Session Details

You can view all stored attributes of all of the CLAG sessions associated with the two devices on this card.

To view all session details, open the full screen CLAG Session card, and click the All CLAG Sessions tab.

/images/download/attachments/12321372/image2019-7-1-18_9_14.png

Where to go next depends on what data you see, but a few options include:

  • Open the All Events tabs to look more closely at the alarm and info events fin the network.
  • Sort on other parameters:
    • by Single Bonds to determine which interface sets are only connected to one of the switches
    • by Backup IP and Backup IP Active to determine if the correct backup IP address is specified for the service
  • Export the data to a file by clicking Export or selecting a subset and clicking Export Selected in edit menu
  • Return to your workbench by clicking /images/download/attachments/12321372/close-14.svg in the top right corner

View All Events

You can view all of the alarm and info events for the two devices on this card.

To view all events, open the full screen CLAG Session card, and click the All Events tab.

/images/download/attachments/12321372/image2019-4-4-10_58_7.png

Where to go next depends on what data you see, but a few options include:

  • Open the All CLAG Sessions tabs to look more closely at the individual sessions.
  • Sort on other parameters:
    • by Message to determine the frequency of particular events
    • by Severity to determine the most critical events
    • by Time to find events that may have occurred at a particular time to try to correlate them with other system events
  • Export the data to a file by clicking Export or selecting a subset and clicking Export Selected in edit menu
  • Return to your workbench by clicking /images/download/attachments/12321372/close-14.svg in the top right corner