Plugin Development Guide
This document describes how to understand, develop and contribute a plugin.
There are 2 kinds of plugin:
- Tracing plugin. Follow the distributed tracing concept to collect spans with tags and logs.
- Meter plugin. Collect numeric metrics in Counter, Gauge, and Histogram formats.
We also provide the plugin test tool to verify the data collected and reported by the plugin. If you plan to contribute any plugin to our main repo, the data would be verified by this tool too.
Tracing plugin
Concepts
Span
The span is an important and recognized concept in the distributed tracing system. Learn about the span from the Google Dapper Paper and OpenTracing
SkyWalking has supported OpenTracing and OpenTracing-Java API since 2017. Our concepts of the span are similar to that of the Google Dapper Paper and OpenTracing. We have also extended the span.
There are three types of span:
1.1 EntrySpan The EntrySpan represents a service provider. It is also an endpoint on the server end. As an APM system, our target is the application servers. Therefore, almost all the services and MQ-consumers are EntrySpan.
1.2 LocalSpan The LocalSpan represents a normal Java method that does not concern remote services. It is neither a MQ producer/consumer nor a service (e.g. HTTP service) provider/consumer.
1.3 ExitSpan
The ExitSpan represents a client of service or MQ-producer. It is named the LeafSpan
in the early versions of SkyWalking.
For example, accessing DB through JDBC and reading Redis/Memcached are classified as an ExitSpan.
ContextCarrier
In order to implement distributed tracing, cross-process tracing has to be bound, and the context must propagate across the process. This is where the ContextCarrier comes in.
Here are the steps on how to use the ContextCarrier in an A->B
distributed call.
- Create a new and empty
ContextCarrier
on the client end. - Create an ExitSpan by
ContextManager#createExitSpan
or useContextManager#inject
to initalize theContextCarrier
. - Place all items of
ContextCarrier
into heads (e.g. HTTP HEAD), attachments (e.g. Dubbo RPC framework) or messages (e.g. Kafka). - The
ContextCarrier
propagates to the server end through the service call. - On the server end, obtain all items from the heads, attachments or messages.
- Create an EntrySpan by
ContextManager#createEntrySpan
or useContextManager#extract
to bind the client and server ends.
See the following examples, where we use the Apache HTTPComponent client plugin and Tomcat 7 server plugin:
- Using the Apache HTTPComponent client plugin on the client end
span = ContextManager.createExitSpan("/span/operation/name", contextCarrier, "ip:port");
CarrierItem next = contextCarrier.items();
while (next.hasNext()) {
next = next.next();
httpRequest.setHeader(next.getHeadKey(), next.getHeadValue());
}
- Using the Tomcat 7 server plugin on the server end
ContextCarrier contextCarrier = new ContextCarrier();
CarrierItem next = contextCarrier.items();
while (next.hasNext()) {
next = next.next();
next.setHeadValue(request.getHeader(next.getHeadKey()));
}
span = ContextManager.createEntrySpan(“/span/operation/name”, contextCarrier);
ContextSnapshot
Besides cross-process tracing, cross-thread tracing has to be supported as well. For instance, both async process (in-memory MQ) and batch process are common in Java. Cross-process and cross-thread tracing are very similar in that they both require propagating context, except that cross-thread tracing does not require serialization.
Here are the three steps on cross-thread propagation:
- Use
ContextManager#capture
to get the ContextSnapshot object. - Let the sub-thread access the ContextSnapshot through method arguments or being carried by existing arguments
- Use
ContextManager#continued
in sub-thread.
Core APIs
ContextManager
ContextManager provides all major and primary APIs.
- Create EntrySpan
public static AbstractSpan createEntrySpan(String endpointName, ContextCarrier carrier)
Create EntrySpan according to the operation name (e.g. service name, uri) and ContextCarrier.
- Create LocalSpan
public static AbstractSpan createLocalSpan(String endpointName)
Create LocalSpan according to the operation name (e.g. full method signature).
- Create ExitSpan
public static AbstractSpan createExitSpan(String endpointName, ContextCarrier carrier, String remotePeer)
Create ExitSpan according to the operation name (e.g. service name, uri) and the new ContextCarrier and peer address (e.g. ip+port, hostname+port).
AbstractSpan
/**
* Set the component id, which defines in {@link ComponentsDefine}
*
* @param component
* @return the span for chaining.
*/
AbstractSpan setComponent(Component component);
AbstractSpan setLayer(SpanLayer layer);
/**
* Set a key:value tag on the Span.
*
* @return this Span instance, for chaining
*/
AbstractSpan tag(String key, String value);
/**
* Record an exception event of the current walltime timestamp.
*
* @param t any subclass of {@link Throwable}, which occurs in this span.
* @return the Span, for chaining
*/
AbstractSpan log(Throwable t);
AbstractSpan errorOccurred();
/**
* Record an event at a specific timestamp.
*
* @param timestamp The explicit timestamp for the log record.
* @param event the events
* @return the Span, for chaining
*/
AbstractSpan log(long timestamp, Map<String, ?> event);
/**
* Sets the string name for the logical operation this span represents.
*
* @return this Span instance, for chaining
*/
AbstractSpan setOperationName(String endpointName);
Besides setting the operation name, tags and logs, two attributes must be set, namely the component and layer. This is especially important for the EntrySpan and ExitSpan.
SpanLayer is the type of span. There are 5 values:
- UNKNOWN (default)
- DB
- RPC_FRAMEWORK (designed for the RPC framework, rather than an ordinary HTTP call)
- HTTP
- MQ
Component IDs are defined and reserved by the SkyWalking project.
For extension of the component name/ID, please follow the OAP server Component library settings
document.
Special Span Tags
All tags are available in the trace view. Meanwhile, in the OAP backend analysis, some special tags or tag combinations provide other advanced features.
Tag key http.status_code
The value should be an integer. The response code of OAL entities corresponds to this value.
Tag keys db.statement
and db.type
.
The value of db.statement
should be a string that represents the database statement, such as SQL, or [No statement]/
+span#operationName if the value is empty.
When the exit span contains this tag, OAP samples the slow statements based on agent-analyzer/default/maxSlowSQLLength
.
The threshold of slow statement is defined in accordance with agent-analyzer/default/slowDBAccessThreshold
.
Check Slow Database Statement document of OAP server for details.
Extension logic endpoint: Tag key x-le
The logic endpoint is a concept that doesn’t represent a real RPC call, but requires the statistic.
The value of x-le
should be in JSON format. There are two options:
- Define a separated logic endpoint. Provide its own endpoint name, latency and status. Suitable for entry and local span.
{
"name": "GraphQL-service",
"latency": 100,
"status": true
}
- Declare the current local span representing a logic endpoint.
{
"logic-span": true
}
Virtual Database Relative Tags
SkyWalking analysis Database(SQL-like) performance metrics through the following tags.
public static final StringTag DB_TYPE = new StringTag(3, "db.type");
public static final StringTag DB_STATEMENT = new StringTag(5, "db.statement");
db.type
records database type, such as sql, cassandra, Elasticsearch.db.statement
records the sql statement of the database access.
Read backend’s virtual database doc for more details.
Virtual Cache Relative Tags
SkyWalking analysis cache performance related metrics through the following tags.
public static final StringTag CACHE_TYPE = new StringTag(15, "cache.type");
public static final StringTag CACHE_CMD = new StringTag(17, "cache.cmd");
public static final StringTag CACHE_OP = new StringTag(16, "cache.op");
public static final StringTag CACHE_KEY = new StringTag(18, "cache.key");
cache.type
indicates the cache type , usually it’s official name of cache (e.g. Redis)cache.cmd
indicates the cache command that would be sent to cache server (e.g. setnx)cache.op
indicates the command is used forwrite
orread
operation , usually the value is converting fromcommand
cache.key
indicates the cache key that would be sent to cache server , this tag maybe null , as string type key would be collected usually.
In order to decide which op
should be converted to flexibly , It’s better that providing config property .
Reference Jedis-4.x-plugin
Virtual Message Queue (MQ) Relative Tags
SkyWalking analysis MQ performance related metrics through the following tags.
public static final StringTag MQ_QUEUE = new StringTag(7, "mq.queue");
public static final StringTag MQ_TOPIC = new StringTag(9, "mq.topic");
public static final StringTag TRANSMISSION_LATENCY = new StringTag(15, "transmission.latency", false);
mq.queue
indicates MQ queue namemq.topic
indicates MQ topic name , It’s optional as some MQ don’t hava concept oftopic
transmission.latency
The transmission latency from consumer to producer. Usually you needn’t to record this tag manually, instead to callcontextCarrier.extensionInjector().injectSendingTimestamp();
to record tagsendingTimestamp
on producer side , and SkyWalking would record this tag on consumer side ifsw8-x
context carrier(from producer side) containssendingTimestamp
Notice , you should set peer
at both sides(producer and consumer). And the value of peer should represent the MQ server cluster.
Advanced APIs
Async Span APIs
There is a set of advanced APIs in Span which is specifically designed for async use cases. When tags, logs, and attributes (including end time) of the span need to be set in another thread, you should use these APIs.
/**
* The span finish at current tracing context, but the current span is still alive, until {@link #asyncFinish}
* called.
*
* This method must be called<br/>
* 1. In original thread(tracing context).
* 2. Current span is active span.
*
* During alive, tags, logs and attributes of the span could be changed, in any thread.
*
* The execution times of {@link #prepareForAsync} and {@link #asyncFinish()} must match.
*
* @return the current span
*/
AbstractSpan prepareForAsync();
/**
* Notify the span, it could be finished.
*
* The execution times of {@link #prepareForAsync} and {@link #asyncFinish()} must match.
*
* @return the current span
*/
AbstractSpan asyncFinish();
- Call
#prepareForAsync
in the original context. - Run
ContextManager#stopSpan
in the original context when your job in the current thread is complete. - Propagate the span to any other thread.
- Once the above steps are all set, call
#asyncFinish
in any thread. - When
#prepareForAsync
is complete for all spans, the tracing context will be finished and will report to the backend (based on the count of API execution).
Develop a plugin
Abstract
The basic method to trace is to intercept a Java method, by using byte code manipulation tech and AOP concept. SkyWalking has packaged the byte code manipulation tech and tracing context propagation, so you simply have to define the intercept point (a.k.a. aspect pointcut in Spring).
Intercept
SkyWalking provides two common definitions to intercept constructor, instance method and class method.
v1 APIs
- Extend
ClassInstanceMethodsEnhancePluginDefine
to defineconstructor
intercept points andinstance method
intercept points. - Extend
ClassStaticMethodsEnhancePluginDefine
to defineclass method
intercept points.
Of course, you can extend ClassEnhancePluginDefine
to set all intercept points, although it is uncommon to do so.
v2 APIs
v2 APIs provide an enhanced interceptor, which could propagate context through MIC(MethodInvocationContext).
- Extend
ClassInstanceMethodsEnhancePluginDefineV2
to defineconstructor
intercept points andinstance method
intercept points. - Extend
ClassStaticMethodsEnhancePluginDefineV2
to defineclass method
intercept points.
Of course, you can extend ClassEnhancePluginDefineV2
to set all intercept points, although it is uncommon to do so.
Implement plugin
See the following demonstration on how to implement a plugin by extending ClassInstanceMethodsEnhancePluginDefine
.
- Define the target class name.
protected abstract ClassMatch enhanceClass();
ClassMatch represents how to match the target classes. There are 4 ways:
byName
: Based on the full class names (package name +.
+ class name).byClassAnnotationMatch
: Depends on whether there are certain annotations in the target classes.byMethodAnnotationMatch
: Depends on whether there are certain annotations in the methods of the target classes.byHierarchyMatch
: Based on the parent classes or interfaces of the target classes.
Attention:
- Never use
ThirdPartyClass.class
in the instrumentation definitions, such astakesArguments(ThirdPartyClass.class)
, orbyName(ThirdPartyClass.class.getName())
, because of the fact thatThirdPartyClass
dose not necessarily exist in the target application and this will break the agent; we haveimport
checks to assist in checking this in CI, but it doesn’t cover all scenarios of this limitation, so never try to work around this limitation by something like using full-qualified-class-name (FQCN), i.e.takesArguments(full.qualified.ThirdPartyClass.class)
andbyName(full.qualified.ThirdPartyClass.class.getName())
will pass the CI check, but are still invalid in the agent codes. Therefore, Use Full Qualified Class Name String Literature Instead. - Even if you are perfectly sure that the class to be intercepted exists in the target application (such as JDK classes), still, do not use
*.class.getName()
to get the class String name. We recommend you to use a literal string. This is to avoid ClassLoader issues. by*AnnotationMatch
does not support inherited annotations.- We do not recommend using
byHierarchyMatch
unless necessary. Using it may trigger the interception of many unexcepted methods, which would cause performance issues.
Example:
@Override
protected ClassMatch enhanceClassName() {
return byName("org.apache.catalina.core.StandardEngineValve");
}
- Define an instance method intercept point.
public InstanceMethodsInterceptPoint[] getInstanceMethodsInterceptPoints();
public interface InstanceMethodsInterceptPoint {
/**
* class instance methods matcher.
*
* @return methods matcher
*/
ElementMatcher<MethodDescription> getMethodsMatcher();
/**
* @return represents a class name, the class instance must instanceof InstanceMethodsAroundInterceptor.
*/
String getMethodsInterceptor();
boolean isOverrideArgs();
}
You may also use Matcher
to set the target methods. Return true in isOverrideArgs
, if you want to change the argument
ref in interceptor.
Please refer to bytebuddy for details of defining ElementMatcher
.
In Skywalking, we provide 3 classes to facilitate ElementMatcher
definition:
AnnotationTypeNameMatch
: Check on whether there is a certain annotation in the target method.ReturnTypeNameMatch
: Check the return type name (package name +.
+ class name) of the target method.ArgumentTypeNameMatch
: Check on the argument index and the type name (package name +.
+ class name) of the target method.
Attention:
- In case of using
ReturnTypeNameMatch
andArgumentTypeNameMatch
, use [Lxxx; (Java file format defined in JVM Specification) to define an Array type. For example, you should write[Ljava.lang.String;
forjava.lang.String[]
.
The following sections will tell you how to implement the interceptor.
- Add plugin definition into the
skywalking-plugin.def
file.
tomcat-7.x/8.x=TomcatInstrumentation
-
Set up
witnessClasses
and/orwitnessMethods
if the instrumentation has to be activated in specific versions.Example:
// The plugin is activated only when the foo.Bar class exists. @Override protected String[] witnessClasses() { return new String[] { "foo.Bar" }; } // The plugin is activated only when the foo.Bar#hello method exists. @Override protected List<WitnessMethod> witnessMethods() { List<WitnessMethod> witnessMethodList = new ArrayList<>(); WitnessMethod witnessMethod = new WitnessMethod("foo.Bar", ElementMatchers.named("hello")); witnessMethodList.add(witnessMethod); return witnessMethodList; }
For more examples, see WitnessTest.java
Implement an interceptor
As an interceptor for an instance method, it has to implement
org.apache.skywalking.apm.agent.core.plugin.interceptor.enhance.InstanceMethodsAroundInterceptor
/**
* A interceptor, which intercept method's invocation. The target methods will be defined in {@link
* ClassEnhancePluginDefine}'s subclass, most likely in {@link ClassInstanceMethodsEnhancePluginDefine}
*/
public interface InstanceMethodsAroundInterceptor {
/**
* called before target method invocation.
*
* @param result change this result, if you want to truncate the method.
* @throws Throwable
*/
void beforeMethod(EnhancedInstance objInst, Method method, Object[] allArguments, Class<?>[] argumentsTypes,
MethodInterceptResult result) throws Throwable;
/**
* called after target method invocation. Even method's invocation triggers an exception.
*
* @param ret the method's original return value.
* @return the method's actual return value.
* @throws Throwable
*/
Object afterMethod(EnhancedInstance objInst, Method method, Object[] allArguments, Class<?>[] argumentsTypes,
Object ret) throws Throwable;
/**
* called when occur exception.
*
* @param t the exception occur.
*/
void handleMethodException(EnhancedInstance objInst, Method method, Object[] allArguments, Class<?>[] argumentsTypes,
Throwable t);
}
Use the core APIs before and after calling the method, as well as during exception handling.
V2 APIs
The interceptor of V2 API uses MethodInvocationContext context
to replace the MethodInterceptResult result
in the beforeMethod
,
and be added as a new parameter in afterMethod
and handleMethodException
.
MethodInvocationContext context
is only shared in one time execution, and safe to use when face concurrency execution.
/**
* A v2 interceptor, which intercept method's invocation. The target methods will be defined in {@link
* ClassEnhancePluginDefineV2}'s subclass, most likely in {@link ClassInstanceMethodsEnhancePluginDefine}
*/
public interface InstanceMethodsAroundInterceptorV2 {
/**
* called before target method invocation.
*
* @param context the method invocation context including result context.
*/
void beforeMethod(EnhancedInstance objInst, Method method, Object[] allArguments, Class<?>[] argumentsTypes,
MethodInvocationContext context) throws Throwable;
/**
* called after target method invocation. Even method's invocation triggers an exception.
*
* @param ret the method's original return value. May be null if the method triggers an exception.
* @return the method's actual return value.
*/
Object afterMethod(EnhancedInstance objInst, Method method, Object[] allArguments, Class<?>[] argumentsTypes,
Object ret, MethodInvocationContext context) throws Throwable;
/**
* called when occur exception.
*
* @param t the exception occur.
*/
void handleMethodException(EnhancedInstance objInst, Method method, Object[] allArguments,
Class<?>[] argumentsTypes, Throwable t, MethodInvocationContext context);
}
Bootstrap class instrumentation.
SkyWalking has packaged the bootstrap instrumentation in the agent core. You can easily implement it by declaring it in the instrumentation definition.
Override the public boolean isBootstrapInstrumentation()
and return true. Such as
public class URLInstrumentation extends ClassEnhancePluginDefine {
private static String CLASS_NAME = "java.net.URL";
@Override protected ClassMatch enhanceClass() {
return byName(CLASS_NAME);
}
@Override public ConstructorInterceptPoint[] getConstructorsInterceptPoints() {
return new ConstructorInterceptPoint[] {
new ConstructorInterceptPoint() {
@Override public ElementMatcher<MethodDescription> getConstructorMatcher() {
return any();
}
@Override public String getConstructorInterceptor() {
return "org.apache.skywalking.apm.plugin.jre.httpurlconnection.Interceptor2";
}
}
};
}
@Override public InstanceMethodsInterceptPoint[] getInstanceMethodsInterceptPoints() {
return new InstanceMethodsInterceptPoint[0];
}
@Override public StaticMethodsInterceptPoint[] getStaticMethodsInterceptPoints() {
return new StaticMethodsInterceptPoint[0];
}
@Override public boolean isBootstrapInstrumentation() {
return true;
}
}
ClassEnhancePluginDefineV2
is provided in v2 APIs, #isBootstrapInstrumentation
works too.
NOTE: Bootstrap instrumentation should be used only where necessary. During its actual execution, it mostly affects the JRE core(rt.jar). Defining it other than where necessary could lead to unexpected results or side effects.
Provide custom config for the plugin
The config could provide different behaviours based on the configurations. The SkyWalking plugin mechanism provides the configuration injection and initialization system in the agent core.
Every plugin could declare one or more classes to represent the config by using @PluginConfig
annotation. The agent core
could initialize this class' static field through System environments, System properties, and agent.config
static file.
The #root()
method in the @PluginConfig
annotation requires declaring the root class for the initialization process.
Typically, SkyWalking prefers to use nested inner static classes for the hierarchy of the configuration.
We recommend using Plugin
/plugin-name
/config-key
as the nested classes structure of the config class.
NOTE: because of the Java ClassLoader mechanism, the @PluginConfig
annotation should be added on the real class used in the interceptor codes.
In the following example, @PluginConfig(root = SpringMVCPluginConfig.class)
indicates that initialization should
start with using SpringMVCPluginConfig
as the root. Then, the config key of the attribute USE_QUALIFIED_NAME_AS_ENDPOINT_NAME
should be plugin.springmvc.use_qualified_name_as_endpoint_name
.
public class SpringMVCPluginConfig {
public static class Plugin {
// NOTE, if move this annotation on the `Plugin` or `SpringMVCPluginConfig` class, it no longer has any effect.
@PluginConfig(root = SpringMVCPluginConfig.class)
public static class SpringMVC {
/**
* If true, the fully qualified method name will be used as the endpoint name instead of the request URL,
* default is false.
*/
public static boolean USE_QUALIFIED_NAME_AS_ENDPOINT_NAME = false;
/**
* This config item controls that whether the SpringMVC plugin should collect the parameters of the
* request.
*/
public static boolean COLLECT_HTTP_PARAMS = false;
}
@PluginConfig(root = SpringMVCPluginConfig.class)
public static class Http {
/**
* When either {@link Plugin.SpringMVC#COLLECT_HTTP_PARAMS} is enabled, how many characters to keep and send
* to the OAP backend, use negative values to keep and send the complete parameters, NB. this config item is
* added for the sake of performance
*/
public static int HTTP_PARAMS_LENGTH_THRESHOLD = 1024;
}
}
}
Meter Plugin
Java agent plugin could use meter APIs to collect metrics for backend analysis.
Counter
API represents a single monotonically increasing counter which automatically collects data and reports to the backend.import org.apache.skywalking.apm.agent.core.meter.MeterFactory; Counter counter = MeterFactory.counter(meterName).tag("tagKey", "tagValue").mode(Counter.Mode.INCREMENT).build(); counter.increment(1d);
MeterFactory.counter
creates a new counter builder with the meter name.Counter.Builder.tag(String key, String value)
marks a tag key/value pair.Counter.Builder.mode(Counter.Mode mode)
changes the counter mode.RATE
mode means the reporting rate to the backend.Counter.Builder.build()
builds a newCounter
which is collected and reported to the backend.Counter.increment(double count)
increment counts to theCounter
. It could be a positive value.
Gauge
API represents a single numerical value.
import org.apache.skywalking.apm.agent.core.meter.MeterFactory;
ThreadPoolExecutor threadPool = ...;
Gauge gauge = MeterFactory.gauge(meterName, () -> threadPool.getActiveCount()).tag("tagKey", "tagValue").build();
MeterFactory.gauge(String name, Supplier<Double> getter)
creates a new gauge builder with the meter name and supplier function. This function must return adouble
value.Gauge.Builder.tag(String key, String value)
marks a tag key/value pair.Gauge.Builder.build()
builds a newGauge
which is collected and reported to the backend.
Histogram
API represents a summary sample observations with customized buckets.
import org.apache.skywalking.apm.agent.core.meter.MeterFactory;
Histogram histogram = MeterFactory.histogram("test").tag("tagKey", "tagValue").steps(Arrays.asList(1, 5, 10)).minValue(0).build();
histogram.addValue(3);
MeterFactory.histogram(String name)
creates a new histogram builder with the meter name.Histogram.Builder.tag(String key, String value)
marks a tag key/value pair.Histogram.Builder.steps(List<Double> steps)
sets up the max values of every histogram buckets.Histogram.Builder.minValue(double value)
sets up the minimal value of this histogram. Default is0
.Histogram.Builder.build()
builds a newHistogram
which is collected and reported to the backend.Histogram.addValue(double value)
adds value into the histogram, and automatically analyzes what bucket count needs to be incremented. Rule: count into [step1, step2).
Plugin Test Tool
The Apache SkyWalking Agent Test Tool Suite is an incredibly useful test tool suite that is available in a wide variety of agent languages. It includes the mock collector and validator. The mock collector is a SkyWalking receiver, like the OAP server.
You could learn how to use this tool to test the plugin in this doc. This is a must if you want to contribute plugins to the SkyWalking official repo.
Contribute plugins to the Apache SkyWalking repository
We welcome everyone to contribute their plugins.
Please follow these steps:
- Submit an issue for your plugin, including any supported versions.
- Create sub modules under
apm-sniffer/apm-sdk-plugin
orapm-sniffer/optional-plugins
, and the name should include supported library name and versions. - Follow this guide to develop. Make sure comments and test cases are provided.
- Develop and test.
- Provide the automatic test cases. Learn
how to write the plugin test case
from this doc - Send a pull request and ask for review.
- The plugin committers will approve your plugins, plugin CI-with-IT, e2e, and the plugin tests will be passed.
- The plugin is accepted by SkyWalking.