1 微服务架构下的问题
- 在大型系统的微服务化构建中,一个系统会被拆分成许多模块。这些模块负责不同的功能,组合成系统,最终可以提供丰富的功能。在这种架构中,一次请求往往需要涉及到多个服务。互联网应用构建在不同的软件模块集上,这些软件模块,可能是由不同团队开发、可能使用不同的编程语言来实现、可能部署在几千台服务器上,横跨多个不同的数据中心,也就意味着这种架构形式也会存在一些问题:
如何快速的发现问题?
如何判断故障影响范围?
如何梳理服务依赖以及依赖的合理性?
如何分析链路性能问题以及实时容量规划?
- 分布式链路追踪(Distributed Tracing),就是将一次分布式请求还原成调用链路,进行日志记录,性能监控并将一次分布式请求的调用情况集中展示。比如各个服务节点上的耗时、请求具体到达那台机器上、每个服务节点的请求状态等等。
- 目前业界比较流行的链路追踪系统如:Twitter的Zipkin,阿里的鹰眼,美团的Mtrace,大众点评的cat等,大部分都是基于Google发表的Dapper。Dapper阐述了分布式系统,特别是微服务架构中链路追踪的概念、数据展示、埋点、传递、收集、存储和展示等技术细节。
2 Sleuth概述
2.1 简介
- Spring Cloud Sleuth主要功能就是在分布式系统中提供追踪解决方案,并且兼容支持了zipkin,只需要在pom文件中引入相应的依赖即可。
2.2 相关概念
- Spring Cloud Sleuth为Spring Cloud提供了分布式追踪解决方案。它大量借用了Google的Dapper的设计。需要先了解一下Sleuth中的术语和相关概念。
- Spring Cloud Sleuth采用的是Google的开源项目Dapper的专业术语。
- Span:基本工作单元,例如:在一个新建的span中发送一个RPC等同于发送一次回应请求给RPC,span通过一个64位ID唯一标识,trace以另一个64位ID标识,span还有其他数据信息,比如摘要、时间戳事件、关键值注释(tags)、span的ID以及进度ID(通常是IP地址),span在不断的启动和停止的同时记录了时间信息,当你创建了一个span,你必须在未来的某个时刻停止它。
- Trace:一系列span组成的一个树状结构。例如,当你正在跑一个分布式大数据工程,你可能需要创建Trace。
- Annotation:用来及时记录一个事件的存在,一些核心annotations用来定义一个请求的开始和结束。
- cs-Client Server:客户端发送一个请求,这个annotation描述了这个span的开始。
- sr-Server Received:服务端获得请求并准备开始处理它,如果将其srj减去cs时间戳便可得到网络延迟。
- ss-Server Sent:注解表明请求处理的完成(当请求返回客户端),如果ss减去sr时间戳便可得到服务端需要的处理请求时间。
- cr-Client Received:表明span的结束,客户端成功接收到服务端的回复,如果cr减去cs时间戳便可得到客户端从服务端获取回复的所有所需时间。

3 链路追踪Sleuth入门
3.1 在网关层、订单微服务、商品微服务导入Sleuth的依赖
<dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-sleuth</artifactId></dependency>
<?xml version="1.0" encoding="UTF-8"?><project xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://maven.apache.org/POM/4.0.0" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd"> <parent> <artifactId>spring_cloud_demo</artifactId> <groupId>org.sunxiaping</groupId> <version>1.0</version> </parent> <modelVersion>4.0.0</modelVersion> <artifactId>api_gateway_server7007</artifactId> <dependencies> <!-- Spring Cloud Gateway使用的web框架是webflux,和SpringMVC不兼容。引入的限流组件是Hystrix。Redis底层不再使用jedis,而是lettuce。 --> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-gateway</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-netflix-eureka-client</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-sleuth</artifactId> </dependency> </dependencies></project>
<?xml version="1.0" encoding="UTF-8"?><project xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://maven.apache.org/POM/4.0.0" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd"> <parent> <artifactId>spring_cloud_demo</artifactId> <groupId>org.sunxiaping</groupId> <version>1.0</version> </parent> <modelVersion>4.0.0</modelVersion> <artifactId>order_service8003</artifactId> <dependencies> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-openfeign</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-web</artifactId> </dependency> <!-- 导入Eureka Client对应的坐标 --> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-netflix-eureka-client</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-actuator</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-sleuth</artifactId> </dependency> </dependencies></project>
<?xml version="1.0" encoding="UTF-8"?><project xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://maven.apache.org/POM/4.0.0" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd"> <parent> <artifactId>spring_cloud_demo</artifactId> <groupId>org.sunxiaping</groupId> <version>1.0</version> </parent> <modelVersion>4.0.0</modelVersion> <artifactId>product_service9004</artifactId> <dependencies> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-web</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-data-jpa</artifactId> </dependency> <dependency> <groupId>mysql</groupId> <artifactId>mysql-connector-java</artifactId> </dependency> <!-- 导入Eureka Client对应的坐标 --> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-netflix-eureka-client</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-actuator</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-sleuth</artifactId> </dependency> </dependencies></project>
3.2 修改网关层、订单微服务、商品微服务的配置文件
logging: level: root: INFO org.springframework.web.servlet.DispatcherServlet: DEBUG org.springframework.cloud.sleuth: DEBUG
server: port: 7007spring: application: name: api-gateway-server # 配置 Spring Cloud Gateway cloud: gateway: discovery: locator: enabled: true # 开启从注册中心动态创建路由的功能,利用微服务名进行路由 lower-case-service-id: true # 微服务名称以小写形式呈现 routes: # 配置路由: 路由id,路由到微服务的uri,断言(判断条件) - id: product-service # 路由id # uri: http://localhost:9004 uri: lb://service-product # 路由到微服务的uri。 lb://xxx,lb代表从注册中心获取服务列表,xxx代表需要转发的微服务的名称 predicates: # 断言(判断条件) # - Path=/product/** - Path=/product-service/** filters: # 配置路由过滤器 http://localhost:7007/product-service/product/findById/1 --> http://localhost:7007/product/findById/1 - RewritePath=/product-service/(?<segment>.*), /$\{segment} # 路径重写的过滤器# 配置 eurekaeureka: instance: # 主机名称:服务名称修改,其实就是向eureka server中注册的实例id instance-id: api-gateway-server:${server.port} # 显示IP信息 prefer-ip-address: true client: service-url: # 此处修改为 Eureka Server的集群地址 defaultZone: http://eureka7001.com:7001/eureka/,http://eureka7002.com:7002/eureka/,http://eureka7003.com:7003/eureka/logging: level: root: INFO org.springframework.web.servlet.DispatcherServlet: DEBUG org.springframework.cloud.sleuth: DEBUG org.springframework.cloud.gateway: trace org.springframework.http.server.reactive: debug org.springframework.web.reactive: debug reactor.ipc.netty: debug
server: port: 8003 # 微服务的端口号spring: application: name: service-order # 微服务的名称# 配置 eurekaeureka: instance: # 主机名称:服务名称修改,其实就是向eureka server中注册的实例id instance-id: service-order:${server.port} # 显示IP信息 prefer-ip-address: true client: service-url: # 此处修改为 Eureka Server的集群地址 defaultZone: http://eureka7001.com:7001/eureka/,http://eureka7002.com:7002/eureka/,http://eureka7003.com:7003/eureka/feign: hystrix: # 开启Feign中的Hystrix enabled: true# 暴露所有端点management: endpoints: web: exposure: include: '*'hystrix: command: default: execution: isolation: thread: timeoutInMilliseconds: 3000 # 默认的连接超时时间为1秒,如果1秒没有返回数据,就自动触发降级逻辑# 微服务info内容详细信息info: app.name: xxx company.name: xxx build.artifactId: $project.artifactId$ build.version: $project.version$logging: level: root: INFO org.springframework.web.servlet.DispatcherServlet: DEBUG org.springframework.cloud.sleuth: DEBUG org.springframework.cloud.gateway: trace org.springframework.http.server.reactive: debug org.springframework.web.reactive: debug reactor.ipc.netty: debug
server: port: 9004 # 微服务的端口号spring: application: name: service-product # 微服务的名称 datasource: url: jdbc:mysql://192.168.217.100:3306/test?useUnicode=true&characterEncoding=UTF-8&autoReconnect=true&useSSL=false&serverTimezone=GMT%2B8&allowPublicKeyRetrieval=true driver-class-name: com.mysql.cj.jdbc.Driver username: root password: 123456 jpa: generate-ddl: true show-sql: true open-in-view: true database: mysql# 配置 eurekaeureka: instance: # 主机名称:服务名称修改,其实就是向eureka server中注册的实例id instance-id: service-product:${server.port} # 显示IP信息 prefer-ip-address: true client: service-url: # 此处修改为 Eureka Server的集群地址 defaultZone: http://eureka7001.com:7001/eureka/,http://eureka7002.com:7002/eureka/,http://eureka7003.com:7003/eureka/logging: level: root: INFO org.springframework.web.servlet.DispatcherServlet: DEBUG org.springframework.cloud.sleuth: DEBUG org.springframework.cloud.gateway: trace org.springframework.http.server.reactive: debug org.springframework.web.reactive: debug reactor.ipc.netty: debug# 微服务info内容详细信息info: app.name: xxx company.name: xxx build.artifactId: $project.artifactId$ build.version: $project.version$
3.3 重启网关层、订单微服务、商品微服务
- 重启之后,我们可以在控制台观察到Sleuth的日志输出。

- 其中,
81a807d076c2a9f6是TraceId,后面跟着的是SpanId,依次调用有一个全局的TranceId,将调用链路串起来。仔细分析每个微服务的日志,不难看出请求的具体过程。 - 查看日志文件并不是一个很好的方法,当微服务越来越多日志文件也会越来越多,通过ZipKin可以将日志聚合,并进行可视化展示和全文检索。
4 ZipKin概述
- Zipkin是Twitter的一个开源项目,它基于Google Dapper实现,它致力于收集服务的定时数据,以解决微服务架构中的延迟问题,包括数据的收集、存储、查找和展现。我们可以使用它来收集各个服务器上请求链路的跟踪数据,并通过它提供的REST API接口来辅助我们查询跟踪以实现分布式系统的监控程序,从而及时的发现系统中出现的延迟升高的问题并找出系统性能瓶颈的根源。除了面向开发的API接口之外,它也提供了方便的UI组件来帮助我们直观的搜索跟踪信息和分析请求链路明细,比如:可以查询某段时间内各用户请求的处理时间等。Zipkin提供了可插拔数据存储方式:In-Memory、MySQL、Cassandra以及ElasticSearch。

- 上图展示了Zipkin的基础架构,它主要由4个核心组件构成:
Collector:收集器组件,它主要用于处理从外部系统发送过来的跟踪信息,将这些信息转换为Zipkin内部处理的span格式,以支持后续的存储、分析、展示等功能。Storage:存储组件,它主要对处理收集器接收到的跟踪信息,默认会将这些信息存储在内存中,我们也可以修改此存储策略,将跟踪信息存储到数据库中。Restful API:API组件,它主要用来提供外部访问接口。比如给客户端展示跟踪信息,或者外接系统访问以实现监控等。Web UI:UI组件,基于API组件实现的上层应用。通过UI组件用户可以方便而直观的查询和分析跟踪信息。
- Zipkin分为两端,一个是Zipkin服务端,一个是Zipkin客户端,客户端也就是微服务的应用。客户端会配置服务端的URL地址,一旦发生服务间的调用时候,会被配置在微服务里面的Sleuth的监听器监听,并生成相应的Trace和Span信息发送给服务端。
- 发送的方式有两种:一种是HTTP报文的方式,另外一种是消息总线的方式如RabbitMQ。
- 不论哪种方式,我们都需要:
- 一个Eureka服务注册中心。
- 一个Zipkin服务端。
- 多个微服务,这些微服务中配置了Zipkin客户端。
5 Zipkin Server的部署和配置
5.1 Zipkin Server下载
- 从SpringBoot2.0开始,官方就不再支持使用自建Zipkin Server的方式进行服务链路追踪,而是直接提供了编译好的jar包来给我们使用。可以从官方网站上下载Zipkin的Web UI。
5.2 启动
- 在命令行输入
java -jar zipkin-server-2.12.9-exec.jar启动Zipkin Server。

5.3 使用Docker启动Zipkin Server
docker run -d -p 9411:9411 --name zipkin openzipkin/zipkin:2.12.9
6 客户端Zipkin和Sleuth整合
6.1 概述
- 通过查看日志分析微服务的调用链路并不是一个很直观的方案,结合Zipkin可以很直观的显示微服务之间的调用关系。
6.2 客户端Zipkin和Sleuth整合
6.2.1 网关层、订单微服务和商品微服务添加Zipkin的依赖
<dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-zipkin</artifactId></dependency>
<?xml version="1.0" encoding="UTF-8"?><project xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://maven.apache.org/POM/4.0.0" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd"> <parent> <artifactId>spring_cloud_demo</artifactId> <groupId>org.sunxiaping</groupId> <version>1.0</version> </parent> <modelVersion>4.0.0</modelVersion> <artifactId>api_gateway_server7007</artifactId> <dependencies> <!-- Spring Cloud Gateway使用的web框架是webflux,和SpringMVC不兼容。引入的限流组件是Hystrix。Redis底层不再使用jedis,而是lettuce。 --> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-gateway</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-netflix-eureka-client</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-sleuth</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-zipkin</artifactId> </dependency> </dependencies></project>
<?xml version="1.0" encoding="UTF-8"?><project xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://maven.apache.org/POM/4.0.0" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd"> <parent> <artifactId>spring_cloud_demo</artifactId> <groupId>org.sunxiaping</groupId> <version>1.0</version> </parent> <modelVersion>4.0.0</modelVersion> <artifactId>order_service8003</artifactId> <dependencies> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-openfeign</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-web</artifactId> </dependency> <!-- 导入Eureka Client对应的坐标 --> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-netflix-eureka-client</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-actuator</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-sleuth</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-zipkin</artifactId> </dependency> </dependencies></project>
<?xml version="1.0" encoding="UTF-8"?><project xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://maven.apache.org/POM/4.0.0" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd"> <parent> <artifactId>spring_cloud_demo</artifactId> <groupId>org.sunxiaping</groupId> <version>1.0</version> </parent> <modelVersion>4.0.0</modelVersion> <artifactId>product_service9004</artifactId> <dependencies> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-web</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-data-jpa</artifactId> </dependency> <dependency> <groupId>mysql</groupId> <artifactId>mysql-connector-java</artifactId> </dependency> <!-- 导入Eureka Client对应的坐标 --> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-netflix-eureka-client</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-actuator</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-sleuth</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-zipkin</artifactId> </dependency> </dependencies></project>
6.2.2 网关层、订单微服务和商品微服务修改配置文件
spring: # zipkin zipkin: base-url: http://192.168.217.100:9411/ # Zipkin Server端的请求地址 sender: type: web # 数据的传输方式,以HTTP的形式向Zipkin Server端发送数据 sleuth: sampler: probability: 1 # 采样比 默认为0.1,即10%,这里配置1,是记录全部的sleuth信息,是为了收集到更多的数据(仅供测试使用)
server: port: 7007spring: application: name: api-gateway-server # 配置 Spring Cloud Gateway cloud: gateway: discovery: locator: enabled: true # 开启从注册中心动态创建路由的功能,利用微服务名进行路由 lower-case-service-id: true # 微服务名称以小写形式呈现 routes: # 配置路由: 路由id,路由到微服务的uri,断言(判断条件) - id: product-service # 路由id # uri: http://localhost:9004 uri: lb://service-product # 路由到微服务的uri。 lb://xxx,lb代表从注册中心获取服务列表,xxx代表需要转发的微服务的名称 predicates: # 断言(判断条件) # - Path=/product/** - Path=/product-service/** filters: # 配置路由过滤器 http://localhost:7007/product-service/product/findById/1 --> http://localhost:7007/product/findById/1 - RewritePath=/product-service/(?<segment>.*), /$\{segment} # 路径重写的过滤器 # zipkin zipkin: base-url: http://192.168.217.100:9411/ # Zipkin Server端的请求地址 sender: type: web # 数据的传输方式,以HTTP的形式向Zipkin Server端发送数据 sleuth: sampler: probability: 1 # 采样比 默认为0.1,即10%,这里配置1,是记录全部的sleuth信息,是为了收集到更多的数据(仅供测试使用)# 配置 eurekaeureka: instance: # 主机名称:服务名称修改,其实就是向eureka server中注册的实例id instance-id: api-gateway-server:${server.port} # 显示IP信息 prefer-ip-address: true client: service-url: # 此处修改为 Eureka Server的集群地址 defaultZone: http://eureka7001.com:7001/eureka/,http://eureka7002.com:7002/eureka/,http://eureka7003.com:7003/eureka/logging: level: root: INFO org.springframework.web.servlet.DispatcherServlet: DEBUG org.springframework.cloud.sleuth: DEBUG org.springframework.cloud.gateway: trace org.springframework.http.server.reactive: debug org.springframework.web.reactive: debug reactor.ipc.netty: debug
server: port: 8003 # 微服务的端口号spring: application: name: service-order # 微服务的名称 # zipkin zipkin: base-url: http://192.168.217.100:9411/ # Zipkin Server端的请求地址 sender: type: web # 数据的传输方式,以HTTP的形式向Zipkin Server端发送数据 sleuth: sampler: probability: 1 # 采样比 默认为0.1,即10%,这里配置1,是记录全部的sleuth信息,是为了收集到更多的数据(仅供测试使用)# 配置 eurekaeureka: instance: # 主机名称:服务名称修改,其实就是向eureka server中注册的实例id instance-id: service-order:${server.port} # 显示IP信息 prefer-ip-address: true client: service-url: # 此处修改为 Eureka Server的集群地址 defaultZone: http://eureka7001.com:7001/eureka/,http://eureka7002.com:7002/eureka/,http://eureka7003.com:7003/eureka/feign: hystrix: # 开启Feign中的Hystrix enabled: true# 暴露所有端点management: endpoints: web: exposure: include: '*'hystrix: command: default: execution: isolation: thread: timeoutInMilliseconds: 3000 # 默认的连接超时时间为1秒,如果1秒没有返回数据,就自动触发降级逻辑# 微服务info内容详细信息info: app.name: xxx company.name: xxx build.artifactId: $project.artifactId$ build.version: $project.version$logging: level: root: INFO org.springframework.web.servlet.DispatcherServlet: DEBUG org.springframework.cloud.sleuth: DEBUG org.springframework.cloud.gateway: trace org.springframework.http.server.reactive: debug org.springframework.web.reactive: debug reactor.ipc.netty: debug
server: port: 9004 # 微服务的端口号spring: application: name: service-product # 微服务的名称 datasource: url: jdbc:mysql://192.168.217.100:3306/test?useUnicode=true&characterEncoding=UTF-8&autoReconnect=true&useSSL=false&serverTimezone=GMT%2B8&allowPublicKeyRetrieval=true driver-class-name: com.mysql.cj.jdbc.Driver username: root password: 123456 jpa: generate-ddl: true show-sql: true open-in-view: true database: mysql # zipkin zipkin: base-url: http://192.168.217.100:9411/ # Zipkin Server端的请求地址 sender: type: web # 数据的传输方式,以HTTP的形式向Zipkin Server端发送数据 sleuth: sampler: probability: 1 # 采样比 默认为0.1,即10%,这里配置1,是记录全部的sleuth信息,是为了收集到更多的数据(仅供测试使用)# 配置 eurekaeureka: instance: # 主机名称:服务名称修改,其实就是向eureka server中注册的实例id instance-id: service-product:${server.port} # 显示IP信息 prefer-ip-address: true client: service-url: # 此处修改为 Eureka Server的集群地址 defaultZone: http://eureka7001.com:7001/eureka/,http://eureka7002.com:7002/eureka/,http://eureka7003.com:7003/eureka/logging: level: root: INFO org.springframework.web.servlet.DispatcherServlet: DEBUG org.springframework.cloud.sleuth: DEBUG org.springframework.cloud.gateway: trace org.springframework.http.server.reactive: debug org.springframework.web.reactive: debug reactor.ipc.netty: debug# 微服务info内容详细信息info: app.name: xxx company.name: xxx build.artifactId: $project.artifactId$ build.version: $project.version$
6.2.3 测试
- 分别重启网关层、订单微服务和商品微服务,通过浏览器发送一次微服务请求。打开Zipkin的Web UI控制台,我们可以根据条件追踪每次请求调用过程。


7 分析Zipkin整合Sleuth的问题

- 由上图可知:
链路数据如何持久化保存(当前数据保存在Zipkin服务端的内存中,断电易丢失)。
如何优化数据采集过程(HTTP方式是同步阻塞方式,一旦出现网络波动等情况,可能会波及业务系统)。
8 存储跟踪数据
8.1 概述
- Zipkin Server默认是将追踪数据信息保存到内存中,这种方式不适合生产环境。因为一旦Zipkin Server端关闭重启或者服务崩溃,就会导致历史数据小时。Zipkin支持将追踪数据持久化到MySQL数据库中或者存储到ElasticSearch中。
8.2 准备数据库
- 可以从官网中找到Zipkin Server持久化的MySQL数据库脚本:
SET NAMES utf8mb4;SET FOREIGN_KEY_CHECKS = 0;-- ------------------------------ Table structure for zipkin_annotations-- ----------------------------DROP TABLE IF EXISTS `zipkin_annotations`;CREATE TABLE `zipkin_annotations` ( `trace_id_high` bigint(20) NOT NULL DEFAULT 0 COMMENT 'If non zero, this means the trace uses 128 bit traceIds instead of 64 bit', `trace_id` bigint(20) NOT NULL COMMENT 'coincides with zipkin_spans.trace_id', `span_id` bigint(20) NOT NULL COMMENT 'coincides with zipkin_spans.id', `a_key` varchar(255) CHARACTER SET utf8mb4 COLLATE utf8mb4_general_ci NOT NULL COMMENT 'BinaryAnnotation.key or Annotation.value if type == -1', `a_value` blob NULL COMMENT 'BinaryAnnotation.value(), which must be smaller than 64KB', `a_type` int(11) NOT NULL COMMENT 'BinaryAnnotation.type() or -1 if Annotation', `a_timestamp` bigint(20) NULL DEFAULT NULL COMMENT 'Used to implement TTL; Annotation.timestamp or zipkin_spans.timestamp', `endpoint_ipv4` int(11) NULL DEFAULT NULL COMMENT 'Null when Binary/Annotation.endpoint is null', `endpoint_ipv6` binary(16) NULL DEFAULT NULL COMMENT 'Null when Binary/Annotation.endpoint is null, or no IPv6 address', `endpoint_port` smallint(6) NULL DEFAULT NULL COMMENT 'Null when Binary/Annotation.endpoint is null', `endpoint_service_name` varchar(255) CHARACTER SET utf8mb4 COLLATE utf8mb4_general_ci NULL DEFAULT NULL COMMENT 'Null when Binary/Annotation.endpoint is null', UNIQUE INDEX `trace_id_high`(`trace_id_high`, `trace_id`, `span_id`, `a_key`, `a_timestamp`) USING BTREE COMMENT 'Ignore insert on duplicate', INDEX `trace_id_high_2`(`trace_id_high`, `trace_id`, `span_id`) USING BTREE COMMENT 'for joining with zipkin_spans', INDEX `trace_id_high_3`(`trace_id_high`, `trace_id`) USING BTREE COMMENT 'for getTraces/ByIds', INDEX `endpoint_service_name`(`endpoint_service_name`) USING BTREE COMMENT 'for getTraces and getServiceNames', INDEX `a_type`(`a_type`) USING BTREE COMMENT 'for getTraces and autocomplete values', INDEX `a_key`(`a_key`) USING BTREE COMMENT 'for getTraces and autocomplete values', INDEX `trace_id`(`trace_id`, `span_id`, `a_key`) USING BTREE COMMENT 'for dependencies job') ENGINE = InnoDB CHARACTER SET = utf8mb4 COLLATE = utf8mb4_general_ci ROW_FORMAT = Compressed;-- ------------------------------ Table structure for zipkin_dependencies-- ----------------------------DROP TABLE IF EXISTS `zipkin_dependencies`;CREATE TABLE `zipkin_dependencies` ( `day` date NOT NULL, `parent` varchar(255) CHARACTER SET utf8mb4 COLLATE utf8mb4_general_ci NOT NULL, `child` varchar(255) CHARACTER SET utf8mb4 COLLATE utf8mb4_general_ci NOT NULL, `call_count` bigint(20) NULL DEFAULT NULL, `error_count` bigint(20) NULL DEFAULT NULL, PRIMARY KEY (`day`, `parent`, `child`) USING BTREE) ENGINE = InnoDB CHARACTER SET = utf8mb4 COLLATE = utf8mb4_general_ci ROW_FORMAT = Compressed;-- ------------------------------ Table structure for zipkin_spans-- ----------------------------DROP TABLE IF EXISTS `zipkin_spans`;CREATE TABLE `zipkin_spans` ( `trace_id_high` bigint(20) NOT NULL DEFAULT 0 COMMENT 'If non zero, this means the trace uses 128 bit traceIds instead of 64 bit', `trace_id` bigint(20) NOT NULL, `id` bigint(20) NOT NULL, `name` varchar(255) CHARACTER SET utf8mb4 COLLATE utf8mb4_general_ci NOT NULL, `remote_service_name` varchar(255) CHARACTER SET utf8mb4 COLLATE utf8mb4_general_ci NULL DEFAULT NULL, `parent_id` bigint(20) NULL DEFAULT NULL, `debug` bit(1) NULL DEFAULT NULL, `start_ts` bigint(20) NULL DEFAULT NULL COMMENT 'Span.timestamp(): epoch micros used for endTs query and to implement TTL', `duration` bigint(20) NULL DEFAULT NULL COMMENT 'Span.duration(): micros used for minDuration and maxDuration query', PRIMARY KEY (`trace_id_high`, `trace_id`, `id`) USING BTREE, INDEX `trace_id_high`(`trace_id_high`, `trace_id`) USING BTREE COMMENT 'for getTracesByIds', INDEX `name`(`name`) USING BTREE COMMENT 'for getTraces and getSpanNames', INDEX `remote_service_name`(`remote_service_name`) USING BTREE COMMENT 'for getTraces and getRemoteServiceNames', INDEX `start_ts`(`start_ts`) USING BTREE COMMENT 'for getTraces ordering and range') ENGINE = InnoDB CHARACTER SET = utf8mb4 COLLATE = utf8mb4_general_ci ROW_FORMAT = Compressed;SET FOREIGN_KEY_CHECKS = 1;
8.3 命令行方式配置启动服务端
# STORAGE_TYPE 存储类型 # MYSQL_HOST MySQL主机地址# MYSQL_TCP_PORT MySQL端口# MYSQL_DB MySQL数据库名称# MYSQL_USER MySQL用户名# MYSQL_PASS MySQL地址java -jar zipkin-server-2.12.9-exec.jar --STORAGE_TYPE=mysql --MYSQL_HOST=192.168.217.100 --MYSQL_TCP_PORT=3306 --MYSQL_DB=zipkin --MYSQL_USER=root --MYSQL_PASS=123456
8.4 Docker方式配置启动服务端
docker run -d \-p 9411:9411 \--restart always \-v /etc/localtime:/etc/localtime:ro \-e MYSQL_USER=root \-e MYSQL_PASS=123456 \-e MYSQL_HOST=192.168.217.100 \-e STORAGE_TYPE=mysql \-e MYSQL_DB=zipkin \-e MYSQL_TCP_PORT=3306 \--name zipkin \openzipkin/zipkin:2.12.9
8.5 测试
- 配置好服务daunt之后,可以在浏览器中请求几次。回到数据库查看,会发现数据已经持久化到MySQL中了。

9 基于消息中间件收集数据
9.1 概述
- 在默认情况下,Zipkin客户端和Zipkin服务端之间是使用HTTP请求的方式进行通信(即同步的请求方式)。
- 在网络波动,Zipkin服务端异常等情况下,可能会存在信息收集不机制的问题。
- Zipkin支持和RabbitMQ整合完整异步消息传输。

9.2 步骤
准备RabbitMQ中间件。
修改Zipkin客户端,将消息发送到MQ服务器。
修改Zipkin服务daunt,从MQ中拉取消息。
9.3 RabbitMQ的安装和启动
docker run -d --name rabbit -p 15672:15672 -p 5672:5672 rabbitmq:management
9.4 命令行方式配置启动服务端
# STORAGE_TYPE 存储类型 # MYSQL_HOST MySQL主机地址# MYSQL_TCP_PORT MySQL端口# MYSQL_DB MySQL数据库名称# MYSQL_USER MySQL用户名# MYSQL_PASS MySQL地址# RABBIT_ADDRESSES 指定Rabbitmq的地址# RABBIT_USER 用户名(默认为guest)# RABBIT_PASSWORD 密码(默认为guest)java -jar zipkin-server-2.12.9-exec.jar --STORAGE_TYPE=mysql --MYSQL_HOST=192.168.217.100 --MYSQL_TCP_PORT=3306 --MYSQL_DB=zipkin --MYSQL_USER=root --MYSQL_PASS=123456 --RABBIT_ADDRESSES=192.168.217.100:5672 --RABBIT_USER=guest --RABBIT_PASSWORD=guest
9.5 Docker方式启动配置启动服务端
docker run -d \-p 9411:9411 \--restart always \-v /etc/localtime:/etc/localtime:ro \-e MYSQL_USER=root \-e MYSQL_PASS=123456 \-e MYSQL_HOST=192.168.217.100 \-e STORAGE_TYPE=mysql \-e MYSQL_DB=zipkin \-e MYSQL_TCP_PORT=3306 \-e RABBIT_ADDRESSES=192.168.217.100:5672 \-e RABBIT_USER=guest \-e RABBIT_PASSWORD=guest \--name zipkin \openzipkin/zipkin:2.12.9

9.6 客户端配置
9.6.1 导入相关jar包的Maven坐标
<dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-sleuth</artifactId></dependency><dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-zipkin</artifactId></dependency><dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-sleuth-zipkin</artifactId></dependency><dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-amqp</artifactId></dependency>
<?xml version="1.0" encoding="UTF-8"?><project xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://maven.apache.org/POM/4.0.0" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd"> <parent> <artifactId>spring_cloud_demo</artifactId> <groupId>org.sunxiaping</groupId> <version>1.0</version> </parent> <modelVersion>4.0.0</modelVersion> <artifactId>api_gateway_server7007</artifactId> <dependencies> <!-- Spring Cloud Gateway使用的web框架是webflux,和SpringMVC不兼容。引入的限流组件是Hystrix。Redis底层不再使用jedis,而是lettuce。 --> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-gateway</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-netflix-eureka-client</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-sleuth</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-zipkin</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-sleuth-zipkin</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-amqp</artifactId> </dependency> </dependencies></project>
<?xml version="1.0" encoding="UTF-8"?><project xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://maven.apache.org/POM/4.0.0" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd"> <parent> <artifactId>spring_cloud_demo</artifactId> <groupId>org.sunxiaping</groupId> <version>1.0</version> </parent> <modelVersion>4.0.0</modelVersion> <artifactId>order_service8003</artifactId> <dependencies> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-openfeign</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-web</artifactId> </dependency> <!-- 导入Eureka Client对应的坐标 --> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-netflix-eureka-client</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-actuator</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-sleuth</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-zipkin</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-sleuth-zipkin</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-amqp</artifactId> </dependency> </dependencies></project>
<?xml version="1.0" encoding="UTF-8"?><project xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://maven.apache.org/POM/4.0.0" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd"> <parent> <artifactId>spring_cloud_demo</artifactId> <groupId>org.sunxiaping</groupId> <version>1.0</version> </parent> <modelVersion>4.0.0</modelVersion> <artifactId>product_service9004</artifactId> <dependencies> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-web</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-data-jpa</artifactId> </dependency> <dependency> <groupId>mysql</groupId> <artifactId>mysql-connector-java</artifactId> </dependency> <!-- 导入Eureka Client对应的坐标 --> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-netflix-eureka-client</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-actuator</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-sleuth</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-starter-zipkin</artifactId> </dependency> <dependency> <groupId>org.springframework.cloud</groupId> <artifactId>spring-cloud-sleuth-zipkin</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-amqp</artifactId> </dependency> </dependencies></project>
9.6.2 配置消息中间件的地址
spring: # zipkin zipkin: # base-url: http://192.168.217.100:9411/ # Zipkin Server端的请求地址 sender: type: rabbit # type: web # 数据的传输方式,以HTTP的形式向Zipkin Server端发送数据 sleuth: sampler: probability: 1 # 采样比 默认为0.1,即10%,这里配置1,是记录全部的sleuth信息,是为了收集到更多的数据(仅供测试使用) rabbitmq: host: 192.168.217.100 port: 5672 username: guest password: guest # 重试机制 listener: direct: retry: enabled: true simple: retry: enabled: true
server: port: 7007spring: application: name: api-gateway-server # 配置 Spring Cloud Gateway cloud: gateway: discovery: locator: enabled: true # 开启从注册中心动态创建路由的功能,利用微服务名进行路由 lower-case-service-id: true # 微服务名称以小写形式呈现 routes: # 配置路由: 路由id,路由到微服务的uri,断言(判断条件) - id: product-service # 路由id # uri: http://localhost:9004 uri: lb://service-product # 路由到微服务的uri。 lb://xxx,lb代表从注册中心获取服务列表,xxx代表需要转发的微服务的名称 predicates: # 断言(判断条件) # - Path=/product/** - Path=/product-service/** filters: # 配置路由过滤器 http://localhost:7007/product-service/product/findById/1 --> http://localhost:7007/product/findById/1 - RewritePath=/product-service/(?<segment>.*), /$\{segment} # 路径重写的过滤器 # zipkin zipkin: # base-url: http://192.168.217.100:9411/ # Zipkin Server端的请求地址 sender: type: rabbit # type: web # 数据的传输方式,以HTTP的形式向Zipkin Server端发送数据 sleuth: sampler: probability: 1 # 采样比 默认为0.1,即10%,这里配置1,是记录全部的sleuth信息,是为了收集到更多的数据(仅供测试使用) rabbitmq: host: 192.168.217.100 port: 5672 username: guest password: guest # 重试机制 listener: direct: retry: enabled: true simple: retry: enabled: true# 配置 eurekaeureka: instance: # 主机名称:服务名称修改,其实就是向eureka server中注册的实例id instance-id: api-gateway-server:${server.port} # 显示IP信息 prefer-ip-address: true client: service-url: # 此处修改为 Eureka Server的集群地址 defaultZone: http://eureka7001.com:7001/eureka/,http://eureka7002.com:7002/eureka/,http://eureka7003.com:7003/eureka/logging: level: root: INFO org.springframework.web.servlet.DispatcherServlet: DEBUG org.springframework.cloud.sleuth: DEBUG org.springframework.cloud.gateway: trace org.springframework.http.server.reactive: debug org.springframework.web.reactive: debug reactor.ipc.netty: debug
server: port: 8003 # 微服务的端口号spring: application: name: service-order # 微服务的名称 # zipkin zipkin: # base-url: http://192.168.217.100:9411/ # Zipkin Server端的请求地址 sender: type: rabbit # type: web # 数据的传输方式,以HTTP的形式向Zipkin Server端发送数据 sleuth: sampler: probability: 1 # 采样比 默认为0.1,即10%,这里配置1,是记录全部的sleuth信息,是为了收集到更多的数据(仅供测试使用) rabbitmq: host: 192.168.217.100 port: 5672 username: guest password: guest # 重试机制 listener: direct: retry: enabled: true simple: retry: enabled: true# 配置 eurekaeureka: instance: # 主机名称:服务名称修改,其实就是向eureka server中注册的实例id instance-id: service-order:${server.port} # 显示IP信息 prefer-ip-address: true client: service-url: # 此处修改为 Eureka Server的集群地址 defaultZone: http://eureka7001.com:7001/eureka/,http://eureka7002.com:7002/eureka/,http://eureka7003.com:7003/eureka/feign: hystrix: # 开启Feign中的Hystrix enabled: true# 暴露所有端点management: endpoints: web: exposure: include: '*'hystrix: command: default: execution: isolation: thread: timeoutInMilliseconds: 3000 # 默认的连接超时时间为1秒,如果1秒没有返回数据,就自动触发降级逻辑# 微服务info内容详细信息info: app.name: xxx company.name: xxx build.artifactId: $project.artifactId$ build.version: $project.version$logging: level: root: INFO org.springframework.web.servlet.DispatcherServlet: DEBUG org.springframework.cloud.sleuth: DEBUG org.springframework.cloud.gateway: trace org.springframework.http.server.reactive: debug org.springframework.web.reactive: debug reactor.ipc.netty: debug
server: port: 9004 # 微服务的端口号spring: application: name: service-product # 微服务的名称 datasource: url: jdbc:mysql://192.168.217.100:3306/test?useUnicode=true&characterEncoding=UTF-8&autoReconnect=true&useSSL=false&serverTimezone=GMT%2B8&allowPublicKeyRetrieval=true driver-class-name: com.mysql.cj.jdbc.Driver username: root password: 123456 jpa: generate-ddl: true show-sql: true open-in-view: true database: mysql # zipkin zipkin: # base-url: http://192.168.217.100:9411/ # Zipkin Server端的请求地址 sender: type: rabbit # type: web # 数据的传输方式,以HTTP的形式向Zipkin Server端发送数据 sleuth: sampler: probability: 1 # 采样比 默认为0.1,即10%,这里配置1,是记录全部的sleuth信息,是为了收集到更多的数据(仅供测试使用) rabbitmq: host: 192.168.217.100 port: 5672 username: guest password: guest # 重试机制 listener: direct: retry: enabled: true simple: retry: enabled: true# 配置 eurekaeureka: instance: # 主机名称:服务名称修改,其实就是向eureka server中注册的实例id instance-id: service-product:${server.port} # 显示IP信息 prefer-ip-address: true client: service-url: # 此处修改为 Eureka Server的集群地址 defaultZone: http://eureka7001.com:7001/eureka/,http://eureka7002.com:7002/eureka/,http://eureka7003.com:7003/eureka/logging: level: root: INFO org.springframework.web.servlet.DispatcherServlet: DEBUG org.springframework.cloud.sleuth: DEBUG org.springframework.cloud.gateway: trace org.springframework.http.server.reactive: debug org.springframework.web.reactive: debug reactor.ipc.netty: debug# 微服务info内容详细信息info: app.name: xxx company.name: xxx build.artifactId: $project.artifactId$ build.version: $project.version$
9.7 测试
- 关闭Zipkin Server,并随意发送请求。打开RabbitMQ管理后台可以看到,消费已经推送到了RabbitMQ中。
- 当Zipkin Server启动时,会自动从RabbitMQ获取消息并消费,展示追踪数据。