Paper Info Reviews Meta-review Author Feedback Post-Rebuttal Meta-reviews

Authors

Zhuowei Li, Qing Xia, Zhiqiang Hu, Wenji Wang, Lijian Xu, Shaoting Zhang

Abstract

Vessel centerline extraction is fundamental for plentiful medical applications. Majority of current methods require pre-segmentations, distance maps or similar sorts of scanning whole volume action and followed by minimal-path or skeletonization algorithms. In this paper, we demonstrate a deep reinforced tree-traversal agent that automatically traces tree-structure centerlines assuming no post-prune or post-merging. It takes raw images as input and generates tree-structure centerlines naturally. To this end, road mark and dynamic reward mechanisms are proposed to make tree-structure vessels learnable and impart the agent how to learn correspondingly. Besides, a multi-task discriminator is raised to simultaneously detect bifurcations and decide terminations. We experimentally show that traced centerlines have an overlap of more than 90% and a distance less than 0.25mm with annotated reference centerlines on coronary arteries. Beyond the promising accuracy, the proposed method also surpasses other existing methods by a large margin in terms of the time and memory efficiency. And a flexible trade-off between accuracy and time efficiency is exhibited at the inference. Codes are available at https://github.com/LzVv123456/Deep-Reinforced-Tree-Traversal.

Link to paper

DOI: https://doi.org/10.1007/978-3-030-87240-3_40

SharedIt: https://rdcu.be/cyl6e

Link to the code repository

https://github.com/LzVv123456/Deep-Reinforced-Tree-Traversal

Link to the dataset(s)

N/A


Reviews

Review #1

  • Please describe the contribution of the paper

    The paper presents a framework to extract the coronary artery centerline without post-prune or fusion by combining a reinforcement learning model with the tree-traversal process. The authors proposed a multi-task discriminator to detect the bifurcations of the tree structures and when to terminate. By using a dynamic rewarding mechanism, the model is able to trace and find the correct directions. Specifically, the authors designed road mark mechanism to ensure the tracing not getting into traps.

  • Please list the main strengths of the paper; you should write about a novel formulation, an original way to use data, demonstration of clinical feasibility, a novel application, a particularly strong evaluation, or anything else that is a strong aspect of this work. Please provide details, for instance, if a method is novel, explain what aspect is novel and why this is interesting.

    The proposed method combines the deep reinforced learning (DRL) and the tree-traversal process to extract centerlines for tubular structures with side branches. 1.Coronary artery has more complex structures compared to published paper using DRL on aortic artery. 2.The method automatically identifies the bifurcation and terminate the tracing which was a challenge 3.The method uses a road map to mark the location that being visited to avoid the trapping situation often happened in the tracing task.

  • Please list the main weaknesses of the paper. Please provide details, for instance, if you think a method is not novel, explain why and provide a reference to prior work.

    The discriminator for detecting the bifurcations or the termination of the tracing is trained separately from the Agent which makes the inference a bit complex.

  • Please rate the clarity and organization of this paper

    Very Good

  • Please comment on the reproducibility of the paper. Note, that authors have filled out a reproducibility checklist upon submission. Please be aware that authors are not required to meet all criteria on the checklist - for instance, providing code and data is a plus, but not a requirement for acceptance

    Overall, the paper has high reproducibility 1.Based on the reproducibility checklist, the authors make the train/evaluation codes available together with readme file to make a full description 2.The authors have a relatively detailed descriptions regarding the training and inference: (1)The pre-processing steps (2)The split of the train and validation datasets (3)The performance metrics (4)Some of the parameters set in their experiments

  • Please provide detailed and constructive comments for the authors. Please also refer to our Reviewer’s guide on what makes a good review: https://miccai2021.org/en/REVIEWER-GUIDELINES.html

    Overall, the paper is well written with clear structure and detailed description of the methodology and results. I appreciate the work from the authors. 1.The author proposes a road mark mechanism to remind the agent of trajectories that being visited. Page 4, section “Road Mark” describes that “the road mark, a cube with size n and value c, will be left to the raw environment where the agent has passed”. My question is how the road mark being used? Do you set all passed location with road marks? And how the value c will be applied during the tracing? Please Clarify. 2.In the section of “Dataset”, the authors state that “Most of these patients contain a certain degree of stenosis and plaques”. (1)Do you have a distribution of the severity of the stenosis or plaques? Severe stenosis or plaques will sure affect the lumen of the coronary artery and therefore affect the centerlines. (2) Did you evaluate the performance of your method regarding the different severity of the stenosis or plaques? 3.The last line in the first paragraph, page6 “The whole dataset is split into 140 training and 140 validation”. In earlier part of this section, it says that the 280 patients were from four clinical institutions. (1) How do you split the train and validation, by randomly split these 280 patients? (2) If randomly, how do you verify your method to deal with a different distribution, such as from a fifth institution? Please add some details. 4.In the metrics section, the authors describe two metrics, the overlap and the distance between the extracted and the ground truth. The metrics calculate the overlap and also the distance using both directions, from extracted centerline to ground truth, and from ground truth to the extracted centerline to make the values more robust, similar to a point set registration results evaluation. But the description needs to be improved to make it more readable, especially that the terms Rt, Rf, Tt,Tf, and similary Dr, Dt which actually confuses me more. 5.Table 2 listed the results for using different step-size, and the author concludes in the last two lines, section “coronary artery centerline extraction”, page 7, that step-size equaling to 1.0 generates the highest cost performance. Why didn’t the author chooses the step-size as 1.2 which has higher overlap accuracy, and less time cost with only 0.01mm average distance increase compared to the step-size as 1.0. Should the reader need to interpret the table in a different way? 6.Some settings of the experimental parameters are missing, such as the maximum searching steps, the cubic size for the road mark, etc. adding these values will increase the reproducibility of the proposed method.

  • Please state your overall opinion of the paper

    accept (8)

  • Please justify your recommendation. What were the major factors that led you to your overall score for this paper?
    1. The proposed method combines the deep reinforced learning (DRL) and the tree-traversal process to extract centerlines for tubular structures with side branches.
    2. The paper is well written with clear structure and detailed description of the methodology and results.
  • What is the ranking of this paper in your review stack?

    1

  • Number of papers in your stack

    5

  • Reviewer confidence

    Confident but not absolutely certain



Review #2

  • Please describe the contribution of the paper

    This paper is about segmentation of coronary arteries with a traversal agent i.e. starting from a reference point and exploring the tree. It deals with the challenge of bifurcation.

  • Please list the main strengths of the paper; you should write about a novel formulation, an original way to use data, demonstration of clinical feasibility, a novel application, a particularly strong evaluation, or anything else that is a strong aspect of this work. Please provide details, for instance, if a method is novel, explain what aspect is novel and why this is interesting.

    The main strength of this paper is to expand use the traversal agent for this segmentation task. Traversal agent have the capability to process very efficiently since they do not need to explore the full volume. The focus on bifucation is pretty useful since it is an important challenge of this approach.

  • Please list the main weaknesses of the paper. Please provide details, for instance, if you think a method is not novel, explain why and provide a reference to prior work.

    The paper suffers from several weaknesses:

    • the internal behavior of the traversal agent is neither explained. What is the criteria used to analyze the pixels values? It is mentioned that it is trained episodically (page 5). I have a hard time to understand that it means.
    • The concept of “road mark” used to avoid circular trajectories lacks clarity. You write “‘As indicated by the name, a distinctive mark (a cube with size n and value c in our setting) is left to the raw environment where the agent has passed.’ what does it mean? Are you modifying the image content ?

    • You paper does not mention at all the concept of scale. How do you deal with vessels of different diameters which are likely to show up as you go through a bifurcation?
  • Please rate the clarity and organization of this paper

    Satisfactory

  • Please comment on the reproducibility of the paper. Note, that authors have filled out a reproducibility checklist upon submission. Please be aware that authors are not required to meet all criteria on the checklist - for instance, providing code and data is a plus, but not a requirement for acceptance

    In my view, main limitation of the reproducibility is that some key algorithmic elements such as the internal design of the traversal agent are not explained at all. Did I miss a cross-reference?

  • Please provide detailed and constructive comments for the authors. Please also refer to our Reviewer’s guide on what makes a good review: https://miccai2021.org/en/REVIEWER-GUIDELINES.html

    The concept of “reference point” and “reference centerlines” is used before being introduced even intuitively. The reading of the paragraph “Reward design” is very difficult. It is only later that it looks to be the ground-truth. Why not calling it simply ground-truth?

    closet point → closest point

    Why taking into account the length of the step I.e   xt   in the definition of the reward function? It is not explained in the text. Is it just to make a scale normalization. Is the length of a step constant or not ?

    In defining T for the reward as the difference between the norm of v1 and v2, you treat equally two vector components the parallel and the orthogonal to the trajectory. In my mind, they play a different role. Indeed both curves are discretized and so the parallel component is the consequence of using discrete step to model the centerline. Consequently it shall not account the same way.

    In the presented result, the ratio of coverage of traced centerline an the ratio for the reference centerline are only given as a sum of both. Consequently, the reader cannot estimate if there is tendency to over segment or under segment. The included mp4 shows several segmented branches without reference centerline.

  • Please state your overall opinion of the paper

    Probably accept (7)

  • Please justify your recommendation. What were the major factors that led you to your overall score for this paper?

    I think that the work is interesting although the writing of the paper is weak and lacks key details.

  • What is the ranking of this paper in your review stack?

    1

  • Number of papers in your stack

    5

  • Reviewer confidence

    Very confident



Review #3

  • Please describe the contribution of the paper

    The authors propose a deep reinforcement learning method to segment vessel centerline in 3D medical images. Starting with a manual seed point, a trained agent is traversing patch by patch the image in order to segment the vascular tree. They design a reward system and mechanisms to choose right bifurcations order and avoid going backward during the tracing. The orientations to choose and the finding of bifurcations are inspired from previous methods. The evaluation on annotated cardiac CTA shows a better robustness to the state-of-the-art, and a faster and more efficient computation. Ablation study is also done to control how useful are the different rewards.

  • Please list the main strengths of the paper; you should write about a novel formulation, an original way to use data, demonstration of clinical feasibility, a novel application, a particularly strong evaluation, or anything else that is a strong aspect of this work. Please provide details, for instance, if a method is novel, explain what aspect is novel and why this is interesting.
    • It is a novel method using deep reinforcement learning combining previous method designs and new designs. The method is fully sequential which explain the fast and efficient computation.
    • There is comparisons with state-of-the-art methods and robustness is the best for their method.
    • Ablation study is done
    • Evaluation data is sufficient
  • Please list the main weaknesses of the paper. Please provide details, for instance, if you think a method is not novel, explain why and provide a reference to prior work.
    • The method is not fully clear to me
      • How the road mark is combined with dynamic reward and buffer?
      • How is combined the reward metric with the loss function of the agent? Is the CNN tracker neural network pre-trained beforehand? or at the same time as an agent?
    • For the reproducibility of the methods, a lot of used parameters are missing for the experiments (see reproducibility part), but it is stated that code will be shared.
  • Please rate the clarity and organization of this paper

    Good

  • Please comment on the reproducibility of the paper. Note, that authors have filled out a reproducibility checklist upon submission. Please be aware that authors are not required to meet all criteria on the checklist - for instance, providing code and data is a plus, but not a requirement for acceptance
    • It is stated that code will be available which is nice
    • In the paper itself, there is no information on the following parameters used during the experiments: gamma (loss function), k (target reference point), cube size n (road mark), Z (buffer zone time step), T (multi-task discriminator), safe range and maximum steps (end of trajectory), discriminator regression parameters? optimizer + parameters used by the agent? number of episodes during the training?
    • Data is not shared
  • Please provide detailed and constructive comments for the authors. Please also refer to our Reviewer’s guide on what makes a good review: https://miccai2021.org/en/REVIEWER-GUIDELINES.html

    Page 1 Abstract

    • “no post-prone” -> Do you mean post-prune?
    • “assuming no post-prune or merging” -> This is two times in two consecutive sentences, maybe rephrase this part.
    • “and a distance less than 0.25mm” -> average distance?
    • “with annotated reference centerlines on coronary arteries” -> Can you precise if you work with 2D or 3D images
    • “surpasses other existing methods by a large margin in terms of efficiency” -> Efficiency is too vague. Can you please rephrase.

    Page 2

    • “Despite the heuristic effect of the work” -> What do you mean by heuristic effect of the work?
    • “post-prone and merging” -> post-prune?
    1. Methods
      • Equation 1: is r_{t+1} the reward?

    Page 3

    • Figure 1
      • What is the purpose of “Null ?” as you just push p_{t+1}? Maybe the null should be on the green line? Or the backtrack routine is computed in parallel with the sequential tracing process?
      • It is not clear to me what’s happening when “Stop?” is No and at the same time “Bifurcation?” is Yes. In this case p_{t+1} will be traced two times? Maybe the Road Mark mechanism could be added in the figure to understand how previous done bifurcation are discarded.
      • Is it possible to have two bifurcations at the same time for one point p_t?

      Reward Design

    • “Here we propose a target that merges two subgoals into one by directly pulling current proposed centerline point p_t to next target reference point g_{t+k}” -> It is not clear to me why g_{t+1} is not used all the time. Can you give the intuition on why g_{t+k} with k > 1 is relevant?
    • “by finding closet point” -> closest?
    • Equation 2 & 3: you should define v1, v2 and x_t (even though it is in Fig. 2)
    • Equation 3: why do you divide by 2?

    Page 4

    • Figure 2
      • W.r.t to T, I would have chosen v1 as the vector from p_t to g_t. Can you give the intuition on the reason why v1 is the vector from p_t to g_{t+k}?
    • The auxiliary signal A is a sigmoid. So the closest you are from the reference point the closest A is to 0.5. The farther it is and A reaches 1. So you get more reward when the point is far from the reference point. It is not what we expect. What am I doing wrong in my reasoning?

      Dynamic Reward.

    • Is there any interference with the road mark and the dynamic reward? when all potential candidates are collected you do traverse the buzzer zone several times?
    • “in what order should the agent trace at bifurcations” -> How a bifurcation is defined? When to decide to start a buffer zone?
    • “Then accumulated rewards Rn” -> maybe for clarity add that n \in [0,N-1]

    Page 5 Dynamic Reward

    • “When the agent backtracks to the same bifurcation, the reference centerline already used will be disregarded from the candidate pool, then the same strategy is executed again.” -> Is the Road Mark mechanism used here to disregard candidate? How the Road Mark is done here with the bifurcation point, the buffer? Do you remove the Road Mark there when you go back here to do the other paths?

      Multi-task discriminator.

    • “jiitering” -> jittering

    Page 6 Metric.

    • “and it’s the corresponding point” -> and its corresponding point

    Page 7

    • Table 1: Could you discuss the cases with minimum overlap (e.g. the one with 54.93%)?
    • Table 1: Your 3D image has been resampled with a spacing of 0.5mm. It would be nice in the dataset section that you mention what is the pixel size and slice thickness of your raw image. The ground truth centerline is extracted with the original image and here the methods extract the centerline with the 0.5mm image sampling. When we look at the results, the maximum distance is 0.41mm. So below the voxel sampling. It makes me wonder if all these numbers are significant between all the methods as they are all pretty good and below the voxel sampling.

      Coronary Artery Centerline Extraction.

    • “The architecture used in our work is the same as what was proposed in the CNN Tracker to perform an impartial comparison. Ostium locations are provided for both CNN Tracker and our method for the sake of justice” -> In your work, where this network is used? Is this the target network of the agent? I am a bit confused with target and policy network. Are they the same network?
    • “Here we measure step-size from 0.5 to 2.0” -> What unit is it? What step-size did you use at training time. Do you mean the step-size between p_t to p_{t+1}?

    Page 8

    • “According to our off-line visualization results, the agent will always trace the same path and sometimes traces loops without a road mark” -> Does the tracing continue endlessly sometime?
  • Please state your overall opinion of the paper

    accept (8)

  • Please justify your recommendation. What were the major factors that led you to your overall score for this paper?

    The major factor is providing a novel method combining robustness, accuracy, speed and memory efficiency all at the same time.

  • What is the ranking of this paper in your review stack?

    1

  • Number of papers in your stack

    5

  • Reviewer confidence

    Confident but not absolutely certain




Primary Meta-Review

  • Please provide your assessment of this work, taking into account all reviews. Summarize the key strengths and weaknesses of the paper and justify your recommendation. In case you deviate from the reviewers’ recommendations, explain in detail the reasons why. In case of an invitation for rebuttal, clarify which points are important to address in the rebuttal.

    This paper presents a method for extracting the coronary artery centerline. The new framework uses a reinforcement learning model with the tree-traversal agent. Bifurcations are detected by a multi-task discriminator, which is interesting. The method has been evaluated, and the results are good. The reviewers are all positive about this paper. After reading the comments raised by the reviewers, my recommendation is given in Q3.

  • What is the ranking of this paper in your stack? Use a number between 1 (best paper in your stack) and n (worst paper in your stack of n papers).

    1




Author Feedback

  1. Road-mark in detail (R1, R2, R3) We leave the road-mark (a cube with size n=3 and value c and c > 1) at every time step in the raw environment (CTA volume). Since HU value of CTA image is normalized to (0,1). So the road-mark is visualized as a highlight cube and we modify the input environment as train or inference process progressing.

  2. Dataset in detail (R1) We randomly split 280 patients into 140 train and 140 validation. We do not comprehensively or statistically evaluate how stenosis or plaques can impact our method at this stage. However according to off-table visualizations. Extremely severe stenosis or poor image quality do can cause early stop of the agent.

  3. Agent in detail (R2) Both the tracing agent and the discriminator is a relatively shallow CNN.

  4. Scales in our method (R2) We leverage the agent (CNN) to implicitly learn vessels with different scales. However, the input patch size is designed to cover vessels with the largest diameter in order to guarantee a full observation of vessels.

  5. Reward & loss in detail (R3) The agent (CNN) is trained with a standard DRL framework. So the agent is trained with rewards solely. Rewards is calculated according to reference centerlines.



back to top