[Docs] [txt|pdf] [Tracker] [WG] [Email] [Diff1] [Diff2] [Nits]
Versions: 00 01 02 04 05 06 07 RFC 4377
Network Working Group Thomas D. Nadeau
Internet Draft Monique Morrow
Expires: April 2005 George Swallow
Cisco Systems, Inc.
David Allan
Nortel Networks
Satoru Matsushima
Japan Telecom
September 2004
OAM Requirements for MPLS Networks
draft-ietf-mpls-oam-requirements-04.txt
Status of this Memo
By submitting this Internet-Draft, we certify that any applicable
patent or other IPR claims of which we are aware have been disclosed,
or will be disclosed, and any of which we become aware will be
disclosed, in accordance with RFC 3668.
This document is an Internet-Draft and is subject to all
provisions of section 3 of RFC 3667. By submitting this
Internet-Draft, each author represents that any applicable patent
or other IPR claims of which he or she is aware have been or will
be disclosed, and any of which he or she become aware will be
disclosed, in accordance with RFC 3668.
Internet-Drafts are working documents of the Internet Engineering
Task Force (IETF), its areas, and its working groups. Note that
other groups may also distribute working documents as
Internet-Drafts.
Internet-Drafts are draft documents valid for a maximum of six
months and may be updated, replaced, or obsoleted by other
documents at any time. It is inappropriate to use
Internet-Drafts as reference material or to cite them other than
as "work in progress."
The list of current Internet-Drafts can be accessed at
http://www.ietf.org/ietf/1id-abstracts.txt.
The list of Internet-Draft Shadow Directories can be accessed at
http://www.ietf.org/shadow.html.
Abstract
As transport of diverse traffic types such as voice, frame
MPLS Working Group Expires April 2005 [Page 1]
Internet Draft MPLS OAM Requirements September 2004
relay, and ATM over MPLS become more common, the ability to detect,
handle and diagnose control and data plane defects becomes
critical.
Detection and specification of how to handle those defects is not
only important because such defects may not only affect the
fundamental operation of an Multi-Protocol Label Switching network,
but also because they MAY impact service level specification
commitments for customers of that network.
This document describes requirements for user and data
plane operations and management for Multi-Protocol
Label Switching. These requirements have been gathered
from network operators who have extensive experience deploying
Multi-Protocol Label Switching networks, similarly some of these
requirements have appeared in other documents. This draft specifies
Operations and Management requirements for Multi-Protocol Label
Switching, as well as for applications of Multi-Protocol Label
Switching such as pseudowire voice and virtual private network
services. Those interested in specific issues relating to
instrumenting Multi-Protocol Label Switching for Operations
and Management purposes are directed to the Multi-Protocol Label
Switching Architecture specification.
Table of Contents
1. Introduction.....................................................2
2. Terminology......................................................2
3. Motivations......................................................3
4. Requirements.....................................................4
5. Security Considerations..........................................8
6. IANA Considerations..............................................8
7. References.......................................................9
8. Authors' Addresses..............................................10
9. Copyright Notice.................................................11
10. Intellectual Property...........................................11
11. Disclaimer of Validity..........................................12
12. Copyright Statement.............................................12
13. Acknowledgment..................................................13
1. Introduction
This document describes requirements for user and data
plane operations and management (OAM) for Multi-Protocol
Label Switching (MPLS). These requirements have been gathered
from network operators who have extensive experience deploying
MPLS networks. This draft specifies OAM requirements
for MPLS, as well as for applications of MPLS.
MPLS Working Group Expires April 2005 [Page 2]
Internet Draft MPLS OAM Requirements September 2004
No specific mechanisms are proposed to address these
requirements at this time. The goal of this draft is to
identify a commonly applicable set of requirements for MPLS
OAM at this time. Specifically, a set of requirements that apply
to the most common set of MPLS networks deployed by service
provider organizations today. These requirements can then be used
as a base for network management tool development and to guide
the evolution of currently specified tools, as well as the
specification of OAM functions that are intrinsic to protocols
used in MPLS networks.
Comments should be made directly to the MPLS mailing list
at mpls@lists.ietf.org.
2. Terminology
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT",
"SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this
document are to be interpreted as described in RFC 2119.
CE: Customer Edge
Defect: Any error condition that prevents an LSP
functioning correctly. For example, loss of an
IGP path will most likely also result in an LSP
not being able to deliver traffic to its
destination. Another example is the breakage of
a TE tunnel. These MAY be due to physical
circuit failures or failure of switching nodes
to operate as expected.
Multi-vendor/multi-provider network operation typically
requires agreed upon definitions of defects (when it is
broken and when it is not) such that both recovery
procedures and service level specification impacts can
be specified.
ECMP: Equal Cost Multipath
LSP: Label Switch Path
LSR: Label Switch Router
OAM: Operations and Management
Head-end Label Switch Router (LSR): The beginning of a label
switched path.
MPLS Working Group Expires April 2005 [Page 3]
Internet Draft MPLS OAM Requirements September 2004
probe-based-dectection: Active measurement using a tool such as
LSP ping.
collecting traffic: Passive measurement of network traffic.
3. Motivations
MPLS OAM has been tackled in numerous Internet drafts.
However as of this writing, existing drafts focus on single provider
solutions or focus on a single aspect of the MPLS architecture
or application of MPLS. For example, the use of RSVP or LDP
signaling and defects MAY be covered in some deployments,
and a corresponding SNMP MIB module exists to manage this
application; however, the handling of defects and specification
of which types of defects are interesting to operational
networks MAY not have been created in concert with those for
other applications of MPLS such as L3 VPN. This leads to
inconsistent and inefficient applicability across the MPLS
architecture, and/or requires significant modifications to
operational procedures and systems in order to provide consistent
and useful OAM functionality. As MPLS has matured, relationships
between providers has become more complex. Furthermore, the
deployment of multiple concurrent applications of MPLS is common
place, leading to a need to consider broader and more uniform
solutions, rather than very specific ad hoc point solutions.
4. Requirements
The following sections enumerate the OAM requirements
gathered from service providers who have deployed MPLS
and services based on MPLS networks. Each requirement is
specified in detail to clarify its applicability.
Although the requirements specified herein are defined by
the IETF, they have been harmonized with requirements
gathered by other standards bodies such as the ITU [Y1710].
4.1 Detection of Broken Label Switch Paths
The ability to detect a broken Label Switch Path (LSP)
SHOULD not require manual hop-by-hop troubleshooting of
each LSR used to switch traffic for that LSP. For example,
it is not desirable to manually visit each LSR along the data
plane path used to transport an LSP; instead,this function
SHOULD be automated and performed from the origination of that LSP.
This implies solutions that are interoperable as to allow for
such automatic operation. Furthermore, the automation of path
liveliness is desired in cases where large numbers of LSPs might
be tested. For example, automated ingress LSR to egress LSR testing
MPLS Working Group Expires April 2005 [Page 4]
Internet Draft MPLS OAM Requirements September 2004
functionality is desired for some LSPs. The goal is to detect LSP
problems before customers do, and this requires detection of
problems in a "reasonable" amount of time. One useful definition
of reasonable is both predictable and consistent.
Synchronization of detection time bounds by tools used to detect
broken LSPs is required. Failure to specifying defect detection
time bounds may result in an ambiguity in test results. If the
time to detect is known, then automated responses can be specified
both with respect to and with regard to resiliency and service
level specification reporting. Further, if synchronization of
detection time bounds is possible, an operational framework can be
established that can guide the design and specification of MPLS
applications.
Although ICMP-based ping can be sent through an LSP, the use of
this tool to verify the LSP path liveliness has the potential for
returning erroneous results (both positive and negative). For
example, failures may occur when
inconsistencies exist between the IP and MPLS forwarding tables,
inconsistencies in the MPLS control and data planes or problems
with the reply path (i.e., a reverse path does not exist). As an
example of a false positive, consider the case where the MPLS data
plane flows through a network node using a different output line
card than the data plane does to each the next neighbor. Also
assume that although the control plane is functional, the data
plane on the output line card where data traffic is programmed to
exit the device is defective. Now, if an LSP is signaled using
this node, any test based solely on the control plane's view of the
world (i.e., ICMP-based) will return with a false positive result
because although the control plane traffic at the node in the
example would be forwarded correctly, the actual data plane
switching at the node in the example would misroute or drop any
traffic transmitted onto that LSP. An example of a false
negative case would be when a functioning return path does not
exist. In this case, neither a positive nor a negative reply
will be received by the sender.
The OAM packet MUST follow exactly the customer data path in order
to reflect path liveliness used by customer data. Particular cases
of interest are forwarding mechanisms such as equal cost multipath
(ECMP) scenarios within the operator's network whereby flows are
load-shared across parallel (i.e., equal IGP cost) paths. Where
the customer traffic MAY be spread over multiple paths, what is
required is to be able to detect failures on any of the path
permutations. Where the spreading mechanism is payload specific,
payloads need to have forwarding that is common with the traffic
under test. Satisfying these requirements introduces complexity
into ensuring that ECMP connectivity permutations are exercised,
and that defect detection occurs in a reasonable amount of time.
MPLS Working Group Expires April 2005 [Page 5]
Internet Draft MPLS OAM Requirements September 2004
4.2 Diagnosis of a Broken Label Switch Path
The ability to diagnose a broken LSP and to isolate the failed
component (i.e., link or node) in the path is required. For
example, note that specifying recovery actions for misbranching
defects in an LDP network is a particularly difficult case.
Diagnosis of defects and isolation of the failed component is
best accomplished via a path trace function which can return the
the entire list of LSRs and links used by a certain LSP (or at
least the set of LSRs/links up to the location of the defect) is
required. The tracing capability SHOULD include the ability to
trace recursive paths, such as when nested LSPs are used, or when
LSPs enter and exit traffic-engineered tunnels. This path trace
function MUST also be capable of diagnosing LSP mis-merging by
permitting comparison of expected vs. actual forwarding behavior
at any LSR in the path. The path trace capability SHOULD be
capable of being executed from both the head-end Label Switch
Router (LSR) and any mid-point LSR. Additionally, the path trace
function MUST have the ability to support equal cost multipath
scenarios described above in section 4.1.
4.3 Path characterization
The path characterization function is the ability to reveal details
of LSR forwarding operations. These details can then be compared
later during subsequent testing relevant to OAM functionality.
This would include but is not limited to:
- consistent use of pipe or uniform TTL models by an LSR
- externally visible aspects of load spreading such as
ECMP, the specific algorithm(s) used, and examples of how
algorithm will spread traffic.
- data/control plane OAM capabilities of the LSR, such as
the ability of the data plane to reveal how it will forward
an LSP so that this may be compared with the control
plane notion of how this LSP is to be forwarded.
- stack operations performed by the LSR, such as pushes, pops,
and time to live (TTL) propigation at penultimate hop LSRs.
4.4 Service Level Agreement Measurement
Mechanisms are required to measure the diverse aspects of Service
Level Agreements:
- availability - in which the service is considered to be
available and the other aspects of performance measurement
listed below have meaning, or unavailable and other aspects
of performance measurement do not.
- latency - amount of time required for traffic to transit
MPLS Working Group Expires April 2005 [Page 6]
Internet Draft MPLS OAM Requirements September 2004
the network
- packet loss
- jitter - measurement of latency variation
Such measurements can be made independently of the user traffic
or via a hybrid of user traffic measurement and OAM probing.
At least one mechanism is required to measure the number
of OAM packets. In addition, the ability to measure the qualitative
aspects of OAM probing MUST be available to specifically compute
the latency of OAM packets generated and received at each end of a
tested LSP so as to accurately reflect the latency of data packets
traveling along that LSP. Note that latency is a measurable
parameter in service level specification reporting. Any method
considered SHOULD be capable of measuring the latency of an LSP
with minimal impact on network resources.
4.5 Frequency of OAM Execution
The operator MUST have the flexibility to configure OAM
parameters. This includes the frequency of the execution of any OAM
functions. The capability to synchronize OAM operations is desired
as to to permit consistent measurement of service level agreements.
To elaborate, there are defect conditions such as misbranching or
misdirection of traffic for which probe-based detection mechanisms
that incur significant mismatches in the probe rate MAY result in
flapping.
One observation would be that wide-spread deployment of MPLS, common
implementation of monitoring tools and the need for
inter-carrier synchronization of defect and service level
specification handling will drive specification of OAM parameters
to commonly agreed on values and such values will have to be
harmonized with the surrounding technologies (e.g. SONET/SDH,
ATM etc.) in order to be useful. This will become particularly
important as networks scale and misconfiguration can result in
churn, alarm flapping etc.
4.5 Alarm Suppression, Aggregation and Layer Coordination
Devices MUST provide alarm suppression functionality that prevents
the generation of superfluous generation of alarms by simply
discarding them (or not generating them in the first place), or by
aggregating them together, and thereby greatly reducing the number
of notifications emitted. When viewed in conjuction with
requirement 4.6 below, this typically requires fault notification
to the LSP egress that
MAY have specific time constraints if the application using the LSP
independently implements path continuity testing (for example ATM
MPLS Working Group Expires April 2005 [Page 7]
Internet Draft MPLS OAM Requirements September 2004
I.610 Continuity check (CC)[I610]). These constraints apply to
LSPs that are monitored. The nature of MPLS applications allows
for an arbitrary hierarchy of LSPs. This introduces the
opportunity to have multiple MPLS applications attempt to respond
to defects simultaneously. For example, layer-3 MPLS VPNs that
utilize Traffic Engineered tunnels, where a failure occurs on the
LSP carrying the Traffic Engineered tunnel. This failure would
affect he VPN traffic that uses the tunnel's LSP. Mechanisms are
required to coordinate network response to defects.
4.6 Support for OAM Interworking for Fault Notification
An LSR supporting the interworking of one or more networking
technologies over MPLS MUST be able to translate an MPLS defect
into the native technology's error condition. For example, errors
occurring over a MPLS transport LSP that supports an emulated
ATM VC MUST translate errors into native ATM OAM AIS cells at the
termination points of the LSP. The mechanism SHOULD consider
possible bounded detection time parameters, e.g., a "hold off"
function before reacting as to synchronize with the OAM functions.
One goal would be alarm suppression by the upper layer using
the LSP. As observed in section 4.5, this requires that MPLS
perform detection in a bounded timeframe in order to initiate
alarm suppression prior to the upper layer independently
detecting the defect.
4.7 Error Detection and Recovery.
Recovery from a fault can be facilitated by OAM procedures. It is
often desirable for such recovery to be automatic. It is a
requirement that faults be detected prior to customers detecting
them.
4.8 Standard Management Interfaces
The wide-spread deployment of MPLS requires common information
modeling of management and control of OAM functionality. This is
reflected in the the integration of standard MPLS-related MIBs
(e.g. [RFC3813][RFC3812][RFC3814]) for fault, statistics and
configuration management. These standard interfaces provide
operators with common programmatic interface access to
operations and management functions and their status.
4.9 Detection of Denial of Service Attacks
The ability to detect denial of service (DoS) attacks against the
data or control plaes MUST be part of any security management
related to MPLS OAM tools or techniques.
MPLS Working Group Expires April 2005 [Page 8]
Internet Draft MPLS OAM Requirements September 2004
4.10 Per-LSP Accounting Requirements
In an MPLS network, service providers (SPs) can measure traffic
from an LSR to the egress of the network using some MPLS related
MIBs, for example. This means that it is a reasonable to know how
much traffic is traveling from where to where (i.e., a traffic
matrix) by analyzing the flow of traffic. Therefore, traffic
accounting in an MPLS network can be summarized as the following
three items.
(1) Collecting information to design network
Providers and their customers MAY need to verify high-level
service level specifications, either to continuously
optimize their networks, or to offer guaranteed bandwidth
services. Therefore, traffic accounting to monitor MPLS
applications is required.
(2) Providing a Service Level Specification
For the purpose of optimized network design, a service
provider may offer the traffic informationr. Optimizing
network design needs this information.
(3) Inter-AS environment
Service providers that offer inter-as services require
accounting of those services.
These three motivations need to satisfy the following.
- In (1) and (2), collection of information on a per-LSP
basis is a minimum level of granularity of collecting
accounting information at both of ingress and egress
of an LSP.
- In (3), SP's ASBR carry out interconnection functions as an
intermediate LSR. Therefore, identifying a pair of ingress
and egress LSRs using each LSP is needed to determine the
cost of the service that a customer is using.
4.10.1 Requirements
Accounting on a per-LSP basis encompasses the following set of
functions:
(1) At an ingress LSR accounting of traffic through LSPs
beginning at each egress in question.
(2) At an intermediate LSR, accounting of traffic through
MPLS Working Group Expires April 2005 [Page 9]
Internet Draft MPLS OAM Requirements September 2004
LSPs for each pair of ingress to egress.
(3) At egress LSR, accounting of traffic through LSPs
for each ingress.
(4) All LSRs that contain LSPs that are being measuremented
need to have a common key to distinguish each LSP.
The key MUST be unique to each LSP, and its mapping to
LSP SHOULD be provided from whether manual or automatic
configuration.
In the case of non-merged LSPs, this can be achieved by
simply reading traffic counters for the label stack associated
with the LSP at any LSR along its path. However, in order to
measure merged LSPs, an LSR MUST have a means to distinguish
the source of each flow so as to disambiguate the statistics.
For example, the LSR might use an additional label as a key,
further inspect the packet to uncover its source address,
etc...
4.10.2 Scalability
It is not realistic to perform the just described operations by
LSRs in a network on all LSPs that exist in a network.
At a minimum, per-LSP based accounting SHOULD be performed on the
edges of the network -- at the edges of both LSPs and the MPLS
domain.
5. Security Considerations
Provisions to any of the tools designed to satisfy the requirements
described herin are required to prevent their unauthorized use.
Likewise, these tools MUST provide a means by which an operator
can prevent denial of service attacks if those tools are used in
such an attack.
LSP mis-merging has security implications beyond that of simply
being a network defect. LSP mis-merging can happen due to a number
of potential sources of failure, some of which (due to MPLS label
stacking) are new to MPLS.
The performance of diagnostic functions and path characterization
involve extracting a significant amount of information about
network construction which the network operator MAY consider
private.
6. IANA Considerations
This document creates no new requirements on IANA namespaces
[RFC2434].
MPLS Working Group Expires April 2005 [Page 10]
Internet Draft MPLS OAM Requirements September 2004
7. References
7.1 Informative References
[RFC3812] Srinivasan, C., Viswanathan, A. and T.
Nadeau, "MPLS Traffic Engineering Management
Information Base Using SMIv2", RFC3812, June 2004.
[RFC3813] Srinivasan, C., Viswanathan, A. and T.
Nadeau, "MPLS Label Switch Router Management
Information Base Using SMIv2", RFC3813, June 2004.
[RFC3814] Nadeau, T., Srinivasan, C., and A.
Viswanathan, "Multiprotocol Label Switching
(MPLS) FEC-To-NHLFE (FTN) Management
Information Base", RFC3814, June 2004.
[MPLS-TE] Awduche et. al., "RSVP-TE: Extensions to RSVP for LSP
Tunnels", RFC 3209, December 2001
[FRR] Pan et.al., "Fast Reroute Extensions to RSVP-TE for LSP
Tunnels", Internet draft,
<draft-ietf-mpls-rsvp-lsp-fastreroute-04.txt>,
February 2004.
[Y1710] ITU-T Recommendation Y.1710, "Requirements for
OAM Functionality In MPLS Networks"
[I610] ITU-T Recommendation I.610, "B-ISDN operations and
maintenance principles and functions", February 1999
[RFC2434] Narten, T. and H. Alvestrand, "Guidelines for Writing
an IANA Considerations section in RFCs", BCP 26, RFC
2434, October 1998.
8. Authors' Addresses
Thomas D. Nadeau
Cisco Systems, Inc.
300 Beaver Brook Road
Boxboro, MA 01719
Phone: +1-978-936-1470
Email: tnadeau@cisco.com
Monique Jeanne Morrow
MPLS Working Group Expires April 2005 [Page 11]
Internet Draft MPLS OAM Requirements September 2004
Cisco Systems, Inc.
Glatt-Com, 2nd Floor
CH-8301
Switzerland
Voice: (0)1 878-9412
Email: mmorrow@cisco.com
George Swallow
Cisco Systems, Inc.
300 Beaver Brook Road
Boxboro, MA 01719
Voice: +1-978-936-1398
Email: swallow@cisco.com
David Allan
Nortel Networks
3500 Carling Ave.
Ottawa, Ontario, CANADA
Voice: 1-613-763-6362
Email: dallan@nortelnetworks.com
Satoru Matsushima
Japan Telecom
4-7-1, Hatchobori, Chuo-ku
Tokyo, 104-8508 Japan
Phone: +81-3-5540-8214
Email: satoru@ft.solteria.net
9. Copyright Notice
Copyright (C) The Internet Society (2004). All Rights Reserved.
10. Intellectual Property Statement
The IETF takes no position regarding the validity or scope of any
Intellectual Property Rights or other rights that might be claimed to
pertain to the implementation or use of the technology described in
this document or the extent to which any license under such rights
might or might not be available; nor does it represent that it has
made any independent effort to identify any such rights. Information
on the procedures with respect to rights in RFC documents can be
found in BCP 78 and BCP 79.
Copies of IPR disclosures made to the IETF Secretariat and any
assurances of licenses to be made available, or the result of an
attempt made to obtain a general license or permission for the use of
such proprietary rights by implementers or users of this
specification can be obtained from the IETF on-line IPR repository at
http://www.ietf.org/ipr.
MPLS Working Group Expires April 2005 [Page 12]
Internet Draft MPLS OAM Requirements September 2004
The IETF invites any interested party to bring to its attention any
copyrights, patents or patent applications, or other proprietary
rights that may cover technology that may be required to implement
this standard. Please address the information to the IETF at
ietf-ipr@ietf.org.
11. Disclaimer of Validity
This document and the information contained herein are provided on an
"AS IS" basis and THE CONTRIBUTOR, THE ORGANIZATION HE/SHE REPRESENTS
OR IS SPONSORED BY (IF ANY), THE INTERNET SOCIETY AND THE INTERNET
ENGINEERING TASK FORCE DISCLAIM ALL WARRANTIES, EXPRESS OR IMPLIED,
INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE
INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED
WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE.
12. Copyright Statement
Copyright (C) The Internet Society (2004). This document is subject
to the rights, licenses and restrictions contained in BCP 78, and
except as set forth therein, the authors retain all their rights.
13. Acknowledgment
Funding for the RFC Editor function is currently provided by the
Internet Society.
The authors wish to acknowledge and thank the following
individuals for their valuable comments to this document:
Adrian Smith, British Telecom; Chou Lan Pok, SBC; Mr.
Ikejiri, NTT Communications and Mr.Kumaki of KDDI.
Hari Rakotoranto, Miya Khono, Cisco Systems; Luyuan Fang, AT&T;
Danny McPherson, TCB; Dr.Ken Nagami, Ikuo Nakagawa, Intec Netcore,
and David Meyer.
MPLS Working Group Expires April 2005 [Page 13]
Html markup produced by rfcmarkup 1.129d, available from
https://tools.ietf.org/tools/rfcmarkup/