2020-12-25-Tomcat-用线程池处理http并发请求

Tomcat用线程池处理http并发请求

通过了解学习tomcat如何处理并发请求了解到线程池，锁，队列，unsafe类，下面的主要代码来自

java-jre： sun.misc.Unsafe java.util.concurrent.ThreadPoolExecutor java.util.concurrent.ThreadPoolExecutor.Worker java.util.concurrent.locks.AbstractQueuedSynchronizer java.util.concurrent.locks.AbstractQueuedLongSynchronizer java.util.concurrent.LinkedBlockingQueue

tomcat: org.apache.tomcat.util.net.NioEndpoint org.apache.tomcat.util.threads.ThreadPoolExecutor org.apache.tomcat.util.threads.TaskThreadFactory org.apache.tomcat.util.threads.TaskQueue

ThreadPoolExecutor

是一个线程池实现类，管理线程，减少线程开销，可以用来提高任务执行效率，

构造方法中的参数有

1
public ThreadPoolExecutor(
2
    int corePoolSize,
3
    int maximumPoolSize,
4
    long keepAliveTime,
5
    TimeUnit unit,
6
    BlockingQueue<Runnable> workQueue,
7
    ThreadFactory threadFactory,
8
    RejectedExecutionHandler handler) {
9

10
}

corePoolSize 是核心线程数 maximumPoolSize 是最大线程数 keepAliveTime 非核心线程最大空闲时间（超过时间终止） unit 时间单位 workQueue 队列，当任务过多时，先存放在队列 threadFactory 线程工厂，创建线程的工厂 handler 拒绝策略，当任务数过多，队列不能再存放任务时，该如何处理，由此对象去处理。这是个接口，你可以自定义处理方式

ThreadPoolExecutor在Tomcat中http请求的应用

tomcat有一个自己的线程池类：org.apache.tomcat.util.threads.ThreadPoolExecutor，继承原先java.util.concurrent.ThreadPoolExecutor类，此线程池是tomcat用来在接收到远程请求后，将每次请求单独作为一个任务去处理使用，即调用execute(Runnable)，此类重写了execute方法，做了一点功能扩展，有一个功能是为了判断worker数量是否足够，判断不足够时，添加非核心线程worker

org.apache.tomcat.util.threads.ThreadPoolExecutor 部分功能扩展代码：

1
private final AtomicInteger submittedCount = new AtomicInteger(0); //提交任务总数
2
// 重写 execute(Runnable command)
3
public void execute(Runnable command) {
4
        execute(command,0,TimeUnit.MILLISECONDS);
5
    }
6
public void execute(Runnable command, long timeout, TimeUnit unit) {
7
        submittedCount.incrementAndGet(); // 提交任务之前，总数 + 1
8
        try {
9
            super.execute(command);
10
        } catch (RejectedExecutionException rx) {
11
        }
12
    }
13

14
//重写 afterExecute 添加任务完成后的逻辑
15
@Override
16
    protected void afterExecute(Runnable r, Throwable t) {
17
        if (!(t instanceof StopPooledThreadException)) {
18
            submittedCount.decrementAndGet(); // 完成任务后 总数 -1
19
        }
20
        if (t == null) {
21
            stopCurrentThreadIfNeeded();
22
        }
23
    }

上面是tomcat自己的线程池判断是否需要添加非核心线程关键部分，在workQueue.offer时，会拿submittedCount这个数作为是否添加woker的一个依据。 workQueue.offer见下文

初始化

org.apache.tomcat.util.net.NioEndpoint

创建线程池

NioEndpoint初始化的时候，创建了线程池

1
public void createExecutor() {
2
        internalExecutor = true;
3
        TaskQueue taskqueue = new TaskQueue();
4
        //TaskQueue无界队列，可以一直添加，因此handler 等同于无效
5
        TaskThreadFactory tf = new TaskThreadFactory(getName() + "-exec-", daemon, getThreadPriority());
6
        executor = new ThreadPoolExecutor(getMinSpareThreads(), getMaxThreads(), 60, TimeUnit.SECONDS,taskqueue, tf);
7
        taskqueue.setParent( (ThreadPoolExecutor) executor);
8
    }

创建工作线程worker

在线程池创建时，调用prestartAllCoreThreads(), 初始化核心工作线程worker，并启动

1
public int prestartAllCoreThreads() {
2
        int n = 0;
3
        while (addWorker(null, true))
4
            ++n;
5
        return n;
6
    }

当addWorker 数量等于corePoolSize时，addWorker(null,ture)会返回false,停止worker工作线程的创建

addWorker时，会启动worker线程

1
private boolean addWorker(Runnable firstTask, boolean core) {
2
      //......省去判断代码（是否需要添加worker的判断）
3

4
        boolean workerStarted = false;
5
        boolean workerAdded = false;
6
        Worker w = null;
7
        try {
8
            w = new Worker(firstTask);//1 创建worker线程
9
            final Thread t = w.thread;
10
            if (t != null) {
11
                final ReentrantLock mainLock = this.mainLock;
12
                mainLock.lock();
13
                try {
14
                        workers.add(w);
15
                        workerAdded = true;
16
                    }
17
                } finally {
18
                    mainLock.unlock();
19
                }
20
                if (workerAdded) {
21
                    t.start(); //2 如果worker创建成功，启动这个工作线程
22
                    workerStarted = true; //返回true
23
                }
24
            }
25
        } finally {
26
            if (! workerStarted)
27
                addWorkerFailed(w);
28
        }
29
        return workerStarted;
30
    }

接收任务放入队列

每次客户端过来请求（http），就会提交一次处理任务， poller对象的run方法中开始 -> processKey() -> processSocket() -> executor.execute()

1
//org.apache.tomcat.util.net.NioEndpoint.Poller.run()
2
@Override
3
public void run() {
4
    // Loop until destroy() is called
5
    while (true) {
6
        //...............
7
            NioSocketWrapper socketWrapper = (NioSocketWrapper) sk.attachment();
8
            if (socketWrapper != null) {
9
                //1调用processKey方法
10
                processKey(sk, socketWrapper);
11
            }
12
        //.............
13
        }
14
    }
15

16
//org.apache.tomcat.util.net.NioEndpoint.Poller.processKey(SelectionKey, NioSocketWrapper)
17
protected void processKey(SelectionKey sk, NioSocketWrapper socketWrapper) {
18
            try {
19
                    //....................
20
          // 2调用processSocket方法
21
                   processSocket(socketWrapper, SocketEvent.OPEN_WRITE, true))
22
                    //..................
23
          }
24
}
25

26
//org.apache.tomcat.util.net.AbstractEndpoint.processSocket(SocketWrapperBase<S>, SocketEvent, boolean)
27
public boolean processSocket(SocketWrapperBase<S> socketWrapper,
28
            SocketEvent event, boolean dispatch) {
29
        try {
30
            //...............
31
            Executor executor = getExecutor();
32
            if (dispatch && executor != null) {
33
                executor.execute(sc); // 3调用ThreadPoolExecutor.execute提交新请求任务
34
            } else {
35
                sc.run();
36
            }
37
            //.....................
38
        return true;
39
    }

ThreadPoolExecutor.execute

worker 从队列中获取任务运行，下面是将任务放入队列的逻辑代码

ThreadPoolExecutor.execute(Runnable) 提交任务：

1
public void execute(Runnable command) {
2
        if (command == null)
3
            throw new NullPointerException();
4

5
        int c = ctl.get();
6
      // worker数 是否小于 核心线程数   tomcat中初始化后，一般不满足第一个条件，不会addWorker
7
        if (workerCountOf(c) < corePoolSize) {
8
            if (addWorker(command, true))
9
                return;
10
            c = ctl.get();
11
        }
12
      // workQueue.offer(command)，将任务添加到队列
13
        if (isRunning(c) && workQueue.offer(command)) {
14
            int recheck = ctl.get();
15
            if (! isRunning(recheck) && remove(command))
16
                reject(command);
17
            else if (workerCountOf(recheck) == 0)
18
                addWorker(null, false);
19
        }
20
        else if (!addWorker(command, false)) //workQueue.offer 返回false时，添加非核心线程
21
            reject(command);
22
    }

workQueue.offer(command) 最终完成了任务的提交(在tomcat处理远程http请求时)。

workQueue.offer

TaskQueue 是 BlockingQueue 具体实现类，TaskQueue在offer时，首先会判断一些条件，如果TaskQueue觉得worker数量不够，会添加worker，但不是核心线程； corePoolSize = 10， maximumPoolSize=200 时，并发量小，一般线程数10（核心线程数），若并发非常大，最多也只能创建200个worker线程，190个线程在任务处理完后，闲时状态下会被回收，worker数回到10的数量； workQueue.offer(command)实际代码：

1
//TaskQueue
2
@Override
3
public boolean offer(Runnable o) {
4
    if (parent.getSubmittedCount()<=(parent.getPoolSize())) return super.offer(o);
5
    if (parent.getPoolSize()<parent.getMaximumPoolSize()) return false;
6
    // 当任务提交过多：未处理任务数(SubmittedCount) > 线程数，并且 poolSize < maximumPoolSize
7
    // 返回false  ThreadPoolExecutor会 addWorker(command, false) 添加worker线程
8
    return super.offer(o);
9
}
10

11
//super.offer LinkedBlockingQueue
12
public boolean offer(E e) {
13
    if (e == null) throw new NullPointerException();
14
    final AtomicInteger count = this.count;
15
    if (count.get() == capacity)
16
        return false;
17
    int c = -1;
18
    Node<E> node = new Node<E>(e);
19
    final ReentrantLock putLock = this.putLock;
20
    putLock.lock();
21
    try {
22
        if (count.get() < capacity) {
23
            enqueue(node); //此处将任务添加到队列
24
            c = count.getAndIncrement();
25
            if (c + 1 < capacity)
26
                notFull.signal();
27
        }
28
    } finally {
29
        putLock.unlock();
30
    }
31
    if (c == 0)
32
        signalNotEmpty();
33
    return c >= 0;
34
}
35

36
// 添加任务到队列
37
/**
38
     * Links node at end of queue.
39
     *
40
     * @param node the node
41
     */
42
private void enqueue(Node<E> node) {
43
    // assert putLock.isHeldByCurrentThread();
44
    // assert last.next == null;
45
    last = last.next = node; //链表结构 last.next = node; last = node
46
}

之后是worker的工作，worker在run方法中通过去getTask()获取此处提交的任务，并执行完成任务。

线程池如何处理新提交的任务

添加worker之后，提交任务，因为worker数量达到corePoolSize，任务都会将放入队列，而worker的run方法则是循环获取队列中的任务（不为空时），

worker run方法：

1
/** Delegates main run loop to outer runWorker  */
2
        public void run() {
3
            runWorker(this);
4
 }

循环获取队列中的任务

runWorker(worker)方法循环部分代码：

1
final void runWorker(Worker w) {
2
        Thread wt = Thread.currentThread();
3
        Runnable task = w.firstTask;
4
        w.firstTask = null;
5
        w.unlock(); // allow interrupts
6
        boolean completedAbruptly = true;
7
        try {
8
            while (task != null || (task = getTask()) != null) { //循环获取队列中的任务
9
                w.lock(); // 上锁
10
                try {
11
                    // 运行前处理
12
                    beforeExecute(wt, task);
13
                    // 队列中的任务开始执行
14
                    task.run();
15
                    // 运行后处理
16
                    afterExecute(task, thrown);
17
                } finally {
18
                    task = null;
19
                    w.completedTasks++;
20
                    w.unlock(); // 释放锁
21
                }
22
            }
23
            completedAbruptly = false;
24
        } finally {
25
            processWorkerExit(w, completedAbruptly);
26
        }
27
    }

task.run()执行任务

锁运用

锁用于保证过程的有序，一般一段代码上锁后，同一时间只允许一个线程去操作

ThreadPoolExecutor 使用锁主要保证两件事情， 1.给队列添加任务，释放锁之前，保证其他线程不能操作队列-添加队列任务） 2.获取队列的任务，释放锁之前，保证其他线程不能操作队列-取出队列任务）在高并发情况下，锁能有效保证请求的有序处理，不至于混乱

给队列添加任务时上锁

1
public boolean offer(E e) {
2
        if (e == null) throw new NullPointerException();
3
        final AtomicInteger count = this.count;
4
        if (count.get() == capacity)
5
            return false;
6
        int c = -1;
7
        Node<E> node = new Node<E>(e);
8
        final ReentrantLock putLock = this.putLock;
9
        putLock.lock();  //上锁
10
        try {
11
            if (count.get() < capacity) {
12
                enqueue(node);
13
                c = count.getAndIncrement();
14
                if (c + 1 < capacity)
15
                    notFull.signal();
16
            }
17
        } finally {
18
            putLock.unlock();  //释放锁
19
        }
20
        if (c == 0)
21
            signalNotEmpty();
22
        return c >= 0;
23
    }

获取队列任务时上锁

1
private Runnable getTask() {
2
        boolean timedOut = false; // Did the last poll() time out?
3
    // ...省略
4
        for (;;) {
5
            try {
6
                Runnable r = timed ?
7
                    workQueue.poll(keepAliveTime, TimeUnit.NANOSECONDS) :
8
                    workQueue.take(); //获取队列中一个任务
9
                if (r != null)
10
                    return r;
11
                timedOut = true;
12
            } catch (InterruptedException retry) {
13
                timedOut = false;
14
            }
15
        }
16
    }
17
public E take() throws InterruptedException {
18
        E x;
19
        int c = -1;
20
        final AtomicInteger count = this.count;
21
        final ReentrantLock takeLock = this.takeLock;
22
        takeLock.lockInterruptibly(); // 上锁
23
        try {
24
            while (count.get() == 0) {
25
                notEmpty.await(); //如果队列中没有任务，等待
26
            }
27
            x = dequeue();
28
            c = count.getAndDecrement();
29
            if (c > 1)
30
                notEmpty.signal();
31
        } finally {
32
            takeLock.unlock(); // 释放锁
33
        }
34
        if (c == capacity)
35
            signalNotFull();
36
        return x;
37
    }

其他

volatile

在并发场景这个关键字修饰成员变量很常见，

主要目的公共变量在被某一个线程修改时，对其他线程可见（实时）

sun.misc.Unsafe 高并发相关类API

线程池使用中，有平凡用到Unsafe类，这个类在高并发中，能做一些原子CAS操作，锁线程，释放线程等。

sun.misc.Unsafe 类是底层类，openjdk源码中有

原子操作数据

java.util.concurrent.locks.AbstractQueuedSynchronizer 类中就有保证原子操作的代码，

1
protected final boolean compareAndSetState(int expect, int update) {
2
        // See below for intrinsics setup to support this
3
        return unsafe.compareAndSwapInt(this, stateOffset, expect, update);
4
    }

对应Unsafe类的代码:

1
//对应的java底层，实际是native方法，对应C++代码
2
/**
3
* Atomically update Java variable to <tt>x</tt> if it is currently
4
* holding <tt>expected</tt>.
5
* @return <tt>true</tt> if successful
6
*/
7
public final native boolean compareAndSwapInt(Object o, long offset,
8
                                              int expected,
9
                                              int x);

方法的作用简单来说就是更新一个值，保证原子性操作当你要操作一个对象o的一个成员变量offset时,修改o.offset，高并发下为保证准确性，你在操作o.offset的时候，读应该是正确的值，并且中间不能被别的线程修改来保证高并发的环境数据操作有效。

即 expected 期望值与内存中的值比较是一样的expected == 内存中的值，则更新值为 x，返回true代表修改成功

否则，期望值与内存值不同，说明值被其他线程修改过，不能更新值为x，并返回false，告诉操作者此次原子性修改失败。

注意一下能知道这是locks包下的类，ReentrantLock锁的底层原理就与unsafe类有关，以及下面的park，unpark。线程可以通过这个原子操作放回true或者false的机制，定义自己获取锁成功还是失败。

阻塞和唤醒线程

ThreadPoolExecute设计在请求队列任务为空时，worker线程可以是等待或者中断的（非销毁状态）。这种做法避免了没必要的循环，节省了硬件资源，提高线程使用效率，

线程池的worker角色循环获取队列任务，如果队列中没有任务，worker.run 还是在等待的，不会退出线程，代码中用了notEmpty.await() 中断此worker线程，放入一个等待线程队列（区别去任务队列）；当有新任务需要时，再notEmpty.signal()唤醒此线程

底层分别是

park

unsafe.park() 阻塞(停止)当前线程 public native void park(boolean isAbsolute, long time);

unpark

unsafe.unpark() 唤醒(取消停止)线程 public native void unpark(Object thread);

这个操作是对应的，阻塞时，先将thread放入队列,再park，唤醒时，从队列拿出被阻塞的线程，unpark(thread)唤醒指定线程。

java.util.concurrent.locks.AbstractQueuedLongSynchronizer.ConditionObject 类中

通过链表存放线程信息

1
// 添加一个阻塞线程
2
private Node addConditionWaiter() {
3
            Node t = lastWaiter;
4
            // If lastWaiter is cancelled, clean out.
5
            if (t != null && t.waitStatus != Node.CONDITION) {
6
                unlinkCancelledWaiters();
7
                t = lastWaiter;
8
            }
9
            Node node = new Node(Thread.currentThread(), Node.CONDITION);
10
            if (t == null)
11
                firstWaiter = node;
12
            else
13
                t.nextWaiter = node;
14
            lastWaiter = node; //将新阻塞的线程放到链表尾部
15
            return node;
16
        }
17

18
// 拿出一个被阻塞的线程
19
 public final void signal() {
20
            if (!isHeldExclusively())
21
                throw new IllegalMonitorStateException();
22
            Node first = firstWaiter; //链表中第一个阻塞的线程
23
            if (first != null)
24
                doSignal(first);
25
        }
26

27
// 拿到后，唤醒此线程
28
final boolean transferForSignal(Node node) {
29
            LockSupport.unpark(node.thread);
30
        return true;
31
    }
32
public static void unpark(Thread thread) {
33
        if (thread != null)
34
            UNSAFE.unpark(thread);
35
    }

这里要区分park 和 compareAndSwapInt是两个完全不同的东西，可以单独或者组合使用，比如ReentrantLock实现锁功能这两个都需要