微服务在注册中心被剔除分为两种情况,
- 一种正常的下架,即客户端发送一个Http请求告诉注册中心,这个服务可以下架
 - 另一种是,注册中心长时间没有收到某个客户端的心跳,注册中心会定时剔除
 
1.正常情况下下架
@DELETEpublic Response cancelLease(@HeaderParam(PeerEurekaNode.HEADER_REPLICATION) String isReplication) {try {// 服务下架请求boolean isSuccess = registry.cancel(app.getName(), id,"true".equals(isReplication));if (isSuccess) {logger.debug("Found (Cancel): {} - {}", app.getName(), id);return Response.ok().build();} else {logger.info("Not Found (Cancel): {} - {}", app.getName(), id);return Response.status(Status.NOT_FOUND).build();}} catch (Throwable e) {logger.error("Error (cancel): {} - {}", app.getName(), id, e);return Response.serverError().build();}}
@Overridepublic boolean cancel(final String appName, final String id,final boolean isReplication) {if (super.cancel(appName, id, isReplication)) {// 下架成功后同步replicateToPeers(Action.Cancel, appName, id, null, null, isReplication);synchronized (lock) {if (this.expectedNumberOfRenewsPerMin > 0) {// Since the client wants to cancel it, reduce the threshold (1 for 30 seconds, 2 for a minute)this.expectedNumberOfRenewsPerMin = this.expectedNumberOfRenewsPerMin - 2;this.numberOfRenewsPerMinThreshold =(int) (this.expectedNumberOfRenewsPerMin * serverConfig.getRenewalPercentThreshold());}}return true;}return false;}
public boolean cancel(String appName, String id, boolean isReplication) {return internalCancel(appName, id, isReplication);}
protected boolean internalCancel(String appName, String id, boolean isReplication) {try {read.lock();CANCEL.increment(isReplication);// 通过微服务名拿到服务组Map<String, Lease<InstanceInfo>> gMap = registry.get(appName);Lease<InstanceInfo> leaseToCancel = null;if (gMap != null) {// 根据实例id将服务费实例从服务组中剔除leaseToCancel = gMap.remove(id);}synchronized (recentCanceledQueue) {recentCanceledQueue.add(new Pair<Long, String>(System.currentTimeMillis(), appName + "(" + id + ")"));}InstanceStatus instanceStatus = overriddenInstanceStatusMap.remove(id);if (instanceStatus != null) {logger.debug("Removed instance id {} from the overridden map which has value {}", id, instanceStatus.name());}if (leaseToCancel == null) {CANCEL_NOT_FOUND.increment(isReplication);logger.warn("DS: Registry: cancel failed because Lease is not registered for: {}/{}", appName, id);return false;} else {// 记录服务被剔除的时间leaseToCancel.cancel();InstanceInfo instanceInfo = leaseToCancel.getHolder();String vip = null;String svip = null;if (instanceInfo != null) {instanceInfo.setActionType(ActionType.DELETED);recentlyChangedQueue.add(new RecentlyChangedItem(leaseToCancel));instanceInfo.setLastUpdatedTimestamp();vip = instanceInfo.getVIPAddress();svip = instanceInfo.getSecureVipAddress();}invalidateCache(appName, vip, svip);logger.info("Cancelled instance {}/{} (replication={})", appName, id, isReplication);return true;}} finally {read.unlock();}}
2.客户端发生故障下架(服务剔除)
Eureka Server会启动一个定时器(默认15分钟),定时判断注册在上面的客户端是否过期。
定时器启动后会调用AbstractInstanceRegistry类中的evict方法
public void evict() {evict(0l);}
public void evict(long additionalLeaseMs) {logger.debug("Running the evict task");if (!isLeaseExpirationEnabled()) {logger.debug("DS: lease expiration is currently disabled.");return;}// We collect first all expired items, to evict them in random order. For large eviction sets,// if we do not that, we might wipe out whole apps before self preservation kicks in. By randomizing it,// the impact should be evenly distributed across all applications.// 定义一个list集合接收过期的微服务List<Lease<InstanceInfo>> expiredLeases = new ArrayList<>();// 遍历注册中心中所有的微服务for (Entry<String, Map<String, Lease<InstanceInfo>>> groupEntry : registry.entrySet()) {Map<String, Lease<InstanceInfo>> leaseMap = groupEntry.getValue();if (leaseMap != null) {for (Entry<String, Lease<InstanceInfo>> leaseEntry : leaseMap.entrySet()) {Lease<InstanceInfo> lease = leaseEntry.getValue();if (lease.isExpired(additionalLeaseMs) && lease.getHolder() != null) {// 拿到服务实例对象,判断服务实例对象是否过期(90s没有发送心跳的微服务实例),// 过期则加入到expiredLeases.add(lease);}}}}// To compensate for GC pauses or drifting local time, we need to use current registry size as a base for// triggering self-preservation. Without that we would wipe out full registry.// 拿到所有注册在Eureka Server上的微服务实例对象的数量int registrySize = (int) getLocalRegistrySize();// 微服务数量的阈值 = registrySize * 0.85int registrySizeThreshold = (int) (registrySize * serverConfig.getRenewalPercentThreshold());// 剔除服务极限值 = registrySize * 0.15,即15%的微服务数量int evictionLimit = registrySize - registrySizeThreshold;// 要被剔除服务的数量,每次不能超过微服务数量的15%int toEvict = Math.min(expiredLeases.size(), evictionLimit);if (toEvict > 0) {logger.info("Evicting {} items (expired={}, evictionLimit={})", toEvict, expiredLeases.size(), evictionLimit);Random random = new Random(System.currentTimeMillis());for (int i = 0; i < toEvict; i++) {// Pick a random item (Knuth shuffle algorithm)int next = i + random.nextInt(expiredLeases.size() - i);Collections.swap(expiredLeases, i, next);// 随机获取过期集合中的过期的服务实例Lease<InstanceInfo> lease = expiredLeases.get(i);String appName = lease.getHolder().getAppName();String id = lease.getHolder().getId();EXPIRED.increment();logger.warn("DS: Registry: expired lease for {}/{}", appName, id);// 调用服务下架方法internalCancel(appName, id, false);}}}
步骤梳理:
- 启动一个定时器(默认15分钟一次),调用AbstractInstanceRegistry类中的evict方法
 - 定义一个过期的实例集合用于接收过期的微服务实例
 - 遍历注册中心上的所有的微服务实例,判断实例是否过期(90s没有发送心跳的微服务实例),过期则加入到过期集合中
 - 由于Eureka Server的自我保护机制,每次剔除不超过15%的注册中心的总量的过期实例
 - 随机从过期集合中剔除不超过15%的注册中心的总量的过期实例
 
