Back to home page

EIC code displayed by LXR

 
 

    


Warning, /east/docs/_content/proposal.md is written in an unsupported language. File is not indexed.

0001 ---
0002 title: "eAST proposal"
0003 layout: base
0004 name: proposal
0005 ---
0006 
0007 <h1>Project eAST proposal</h1>
0008 
0009 ---
0010 
0011 * TOC
0012 {:toc}
0013 
0014 ---
0015 
0016 ## Motivation 
0017 
0018 
0019 Large-scale detector systems for the EIC are designed by larger
0020 communities. The simulation efforts typically start as standalone
0021 exercises for each detector component with various levels of maturity,
0022 analytical calculations, simplified Monte Carlo exercises (fast
0023 simulations), Geant4-based Monte Carlo approaches (full simulations),
0024 and are later being extended in several different frameworks.
0025 
0026 It is critical in the longer term to perform studies where the
0027 information from various detector elements and also the interaction
0028 region, support structures and other dead material is taken into
0029 account, which is only possible in a comprehensive simulation. This is
0030 essential to understand, e.g., the performance at the edges of the
0031 detector system or the effect of combining different technologies for
0032 the electro-magnetic calorimeter. For studies of the physics reach and
0033 detection capabilities, it is important to be able to switch detector
0034 options with varying levels of detail, combining full simulations for
0035 some detector components with fast simulations for the rest of the
0036 detector system. It is also critical to build a sustainable effort for
0037 the entire time scale of the experiments with common tunes and commonly
0038 validated results for both fast and full simulations.
0039 
0040 However, such detailed studies are time consuming, both from people and
0041 computing view. To ease leveraging new and rapidly evolving computing
0042 technologies, we plan to implement a common and integrated approach for
0043 fast and full simulations in Geant4 with a plug and play modular
0044 approach.
0045 
0046 ---
0047 
0048 ## Requirements 
0049 
0050 A comprehensive and centrally maintained simulation tool based on Geant4
0051 for both fast and full simulations with a library of potential detector
0052 options has to be developed. The initial focus on the development will
0053 be:
0054 
0055 -   ability to reuse existing simulation works,
0056 -   ease of switching detector options with comparable levels of detail,
0057 -   ease of switching between detailed and coarse detector descriptions,
0058 -   ease of leveraging new and rapidly evolving technologies
0059 (e.g., AI/ML) and computing hardware (e.g. heterogeneous architectures)
0060 
0061 After validation of the Geant4 simulations, they will be simplified
0062 (replacing some physics aspects) for fast simulations. This approach
0063 will allow us to use everywhere the same geometry (very important to
0064 reduce debugging and development time) and to combine full simulation
0065 for a subset of detectors with fast simulations for the rest of the
0066 integrated detector. Geant4 itself and various NP and HEP experiments
0067 already provide sub-systems for fast simulations. This will reduce the
0068 development time of the fast simulations substantially.
0069 
0070 The detector simulation tool will be modular, and its development will
0071 be targeted. It will provide an interface to the output of Monte Carlo
0072 event generators but no further work on generators. The simulation of
0073 detector effects and detector responses (digitization) will be clearly
0074 separated. There will be a common geometry interface between the
0075 detector simulation and reconstruction.
0076 
0077 The timely development of a comprehensive, unified and centrally
0078 maintained detector simulation tool for both fast and full simulations
0079 to serve the needs of the detector collaborations or groups is no small
0080 task. It must take place in the context of strong development teams in
0081 the labs together with important contributors in the universities, and
0082 go beyond legacy software to a new common effort to be successful. This
0083 effort builds on a sustainable common project based on Geant4 and
0084 tailored to needs of NP experiments as they exist today and will evolve
0085 to the 2030s.
0086 
0087 Geant4 has the capability to support both fast and full simulation,
0088 which we want to use for large-scale detector systems. Geant4 through
0089 its multi-threaded reengineering has already been able to support high
0090 concurrency heterogeneous architectures, with excellent results in the
0091 memory economizations achieved. Leveraging and evolving this capability
0092 as heterogeneous architectures become ever more prevalent is of great
0093 importance for data processing and analyses for large-scale detector
0094 systems at the EIC.
0095 
0096 ---
0097 ## Work program and deliverables
0098 
0099 ### Create CAD interface to the detector simulation tool
0100 
0101 Fast and detailed simulations need to be able to implement updates on
0102 detector layout fast, where these updates need to be based on detailed
0103 design drawings with parametric surfaces in 3D. This requires as crucial
0104 *first step* compatibility with the CAD applications being used at BNL
0105 and JLab, for easy exchange. The most flexible format for Geant4 to
0106 import CAD geometry is GDML. STEP is the file format commonly available
0107 to many CAD systems. Several tools exist that convert STEP files to GDML
0108 files. We will work with experts at BNL and JLab to conduct a survey if
0109 STEP is available for the variety of CAD applications used. These break
0110 down into two broad categories: mesh vs. nurbs. Mesh CAD applications
0111 represent a structure using a collection of flat polygons that emulate
0112 curved surfaces that can be used to represent volumes. The granularity
0113 of the polygon surfaces can be scaled up or down to create smoother or
0114 coarser surfaces, respectively. Examples are AutoCad and Sketchup which
0115 are excellent for expansive structural/architectural representations of
0116 components and buildings. Nurbs models use parameters that describe the
0117 shape and extent of a surface, the curvature of each element, and the
0118 thickness of the material. In effect they can produce very close
0119 approximations of real-life entities but are difficult to exchange or
0120 import by non-nurbs systems. Examples are CREO, SolidWorks, and Siemens
0121 NX which are used for mechanical design. GDML also supports various
0122 shapes and extruded solids, some of them may correspond to nurbs.
0123 Lastly, CAD files often only contain sparse information about the
0124 composed material, often just the name of the materials. On the other
0125 hand, detailed detector simulations require the full composition of each
0126 material, often down to the isotope composition level. We will develop a
0127 mechanism, e.g., through a macro file, to associate such additional
0128 information needed to the detector simulation.
0129 
0130 *Deliverables:
0131 1) Development of a CAD Interface for Detector
0132 Simulations.
0133 2) Macro file that allows material composition information to be easily ported to detector simulations.*
0134 
0135 ### Create an initial version of the fast and full detector simulation tool
0136 
0137 The detector simulation tool has to be capable of doing both fast and
0138 full simulations in one application, easily configurable for each
0139 detector component. Also, the detector simulation tool has to easily
0140 integrate (“plug-in”) already existing standalone simulation
0141 applications. These requirements will be fulfilled by utilizing the
0142 “region” mechanism of Geant4. Each detector component is represented as
0143 a region, where geometry description including different levels of
0144 detail, physics options including fast simulation and unique physics
0145 model configurations, and detector responses based on geometry and
0146 physics options are taken care of. We will develop an initial version of
0147 such a detector simulation tool.
0148 
0149 *Deliverable: First running version of detector simulation tool.*
0150 
0151 ### Communicate with detector study groups
0152 
0153 The existing standalone simulation applications that are to be adapted
0154 to the new detector simulation tool have to be examined and converted to
0155 be “plug-able” using the “region” mechanism. We will communicate with
0156 detector study groups and work with them to drive the efforts of
0157 integration. Through these interactions the requirements to the detector
0158 simulation tool will be refined as needed. The detector simulation tool
0159 may also integrate beam-test configurations if applicable. The
0160 deliverable will be a prototype integration of an existing simulation,
0161 the specific target to be chosen in the first month on the basis of
0162 importance and available developer effort during the initial funded
0163 period. Other detector components will be added by the detector
0164 community based on the experience from the prototype.
0165 
0166 *Deliverable: Documented prototype integration of an existing simulation.*
0167 
0168 ### Develop and deliver a common physics list 
0169 
0170 Geant4 offers several alternative physics models. To make comparisons
0171 over different detector configurations, one has to use a common set of
0172 physics models that is appropriate to NP detectors. We will develop and
0173 deliver such a physics list. We will consult with detector study groups
0174 and advise on validations of the physics models with beam-test results.
0175 
0176 *Deliverable: Documented common physics list.*
0177 
0178 ### Integrate with overall software efforts
0179 
0180 We will compile the requirements of detector simulation to the software
0181 infrastructure (e.g., common geometry and data formats) to be shared
0182 across the whole software chain including physics event generation,
0183 simulation, digitization, reconstruction and analysis programs.
0184 
0185 *Deliverable: Requirements document. *
0186 
0187 ### Deliver a detector simulation tool extensible to heterogeneous architectures
0188 
0189 <span id="_4d34og8" class="anchor"></span>In the future, simulation jobs
0190 may run on computers with heterogeneous hardware configurations, e.g.,
0191 with GPGPU and/or FPGA, and with cutting edge IT technologies such as
0192 AI/ML. Enabling the effective use of such technologies requires design
0193 and implementation choices in the detector simulation tool. We will
0194 ensure the extensibility of the detector simulation tool through the use
0195 of a tasking mechanism (either PTL or TBB). We will also support the
0196 developers of detector components of the thread-safety of their code.
0197 
0198 *Deliverable: Proven use of a tasking mechanism.*
0199 
0200 ---
0201 ## Project leader
0202 
0203 <span id="_2s8eyo1" class="anchor"></span>This project aims to take
0204 advantage of the proven value for detailed nuclear and particle physics
0205 experiment simulations by the HEP-developed Geant4 software and the
0206 growing prospects of use of advanced computing techniques. Any
0207 large-scale detector system will benefit from the project on “*Fast and
0208 full simulations in Geant4 for large-scale detector systems with a plug
0209 and play modular approach”*, with the foreseen EIC detectors prime
0210 examples. For example, EIC detector simulation development must be able
0211 to progress rapidly to provide the EIC user community proper flexible
0212 and forward-looking tools to meet the simulation needs of the detailed
0213 detector design and EIC science performance studies. This requires a
0214 common simulation framework and effort if the EIC community is to be
0215 properly served by a unified effort drawing on pooled expertise. This
0216 also requires simulation tools ready to take advantage of the growing
0217 use of heterogeneous computing environments and the prospects they
0218 offer. This work will seed a new common effort through the leadership of
0219 an expert of unique stature and technical expertise, who has the trust
0220 and confidence of the full community to establish a common effort, while
0221 also being a technical expert. The *“Fast and full simulations in Geant4
0222 for large-scale detector systems with a plug and play modular approach”*
0223 will ensure that any work on AI/ML and running on computers with
0224 heterogeneous hardware configurations can be directly applied.
0225 
0226 In Dr. Makoto Asai we identify the individual who can uniquely fill this
0227 role. Dr. Asai has been both a Geant4 project leader and deep technical
0228 expert for over 20 years. He is well known to and respected by the EIC
0229 community with which he has collaborated for many years on EIC Geant4
0230 simulation needs. He is the designer and principal developer of Geant4's
0231 capability to support both fast and full simulation, which we want to
0232 use for the EIC. He led Geant4 through its multi-threaded reengineering
0233 to support high concurrency heterogeneous architectures, with excellent
0234 results in the memory economizations achieved. Leveraging and evolving
0235 this capability as heterogeneous architectures become ever more
0236 prevalent is of great importance for the EIC, and together with the
0237 integrated fast simulation support opens the door to leveraging AI/ML in
0238 the unified simulation. Dr. Asai is also an expert in the geometry and
0239 modular detector description tools that will be the basis of unified
0240 geometry in detector simulations. He is also an expert in the Geant4
0241 physics models that have to be tuned for NP experiments, having presided
0242 over their development and integration for much of the past 20 years. No
0243 other individual has an array of attributes better tailored to the
0244 leader we are looking for. Finally and crucially, Dr. Asai is available
0245 for this work if we act promptly.
0246 
0247 ---
0248 ## Appendix
0249 ### Notes from the [*March 25 EIC software meeting*](https://indico.bnl.gov/event/11102/) discussion (Torre)
0250 
0251 -   CAD work is a distinct project, \~1mo duration, it goes first both
0252     because Elke and Rolf need it and because it gives us a month to
0253     plan the simu project specifics before it launches
0254 -   discussion about time ordering and priorities. Note that there is no
0255     time or priority ordering implicit in the ordering of deliverables
0256     in the proposal.
0257 -   comments from several that physics list should be highest/earliest
0258     priority
0259 -   test beams should be considered and included. Test beam data is
0260     essential to validating physics lists. The simulation software
0261     should support test beam configurations.
0262     -   comment asking whether factoring out digitization is compatible
0263         with physics list tuning, which can be dependent on digi
0264         tuning, e.g. Birk’s constant.
0265     -   tuning won’t happen without actual data, will rely on test beam
0266     -   noted that we do already have relevant test beam data already in
0267         hand
0268     -   comment that in talking about ‘tuning’ we should really be
0269         calling it ‘validation’
0270 -   discussion about the extent to which (very) fast 4vector based
0271     simulation like eic-smear should be integrated. Would Geant4
0272     integration slow it down? Why carry Geant4 dependencies for what
0273     would otherwise have lightweight dependencies?
0274     -   in writing the proposal we expected that eic-smear type
0275         simulations would not be integrated into Geant4 (because it
0276         would slow them down), but would be incorporated in as
0277         seamlessly as possible for ease of use, e.g. uniformity of
0278         user interfaces, of course uniformity of MCEG interfaces
0279     -   Makoto commented that it is possible to integrate such very fast
0280         simulations in Geant4 without a performance hit.
0281 -   comment that it should be possible for the user to “own the event
0282     loop”, treating the simulation software fully as an on-call
0283     toolkit rather than handing over control, especially for fast
0284     simulation
0285 -   comment on whether we should put such emphasis on Geant4; what if a
0286     new simu toolkit comes along or ‘GeantV’ emerges.
0287 
0288     -   ‘GeantV’ as a rewrite/reengineering of Geant4 has been laid to
0289         rest in recent years, as a useful learning exercise that was
0290         successful when it worked to augment Geant4 rather than
0291         replace it (e.g. VecGeom came out of it). There is not a
0292         prospective Geant4 replacement in sight, and there is a wide
0293         range of Geant4-directed R&D underway, including new projects
0294         in utilizing GPUs, particularly for EM/photon physics.
0295 -   comment that it is difficult to evaluate the intended scope of
0296     the proposal. How will it integrate with the simu activities of
0297     the proto-collaborations.
0298     -   one reason the scope is somewhat fuzzy is that the duration
0299         is fuzzy. The present commitment is 4 months. Clearly the
0300         complete program envisioned can’t be done in 4 months (the
0301         specific deliverables though are intended to be achievable in
0302         4 months). We’ll seek support for extending it if it looks
0303         successful after the first couple of months.
0304     -   the project isn’t intended to have (and can’t
0305         realistically have) deliverables targeting the
0306         proto-collaborations in 2021, ie pre detector proposals (such
0307         is our expectation in writing it at least). The 2021 work in
0308         both the project and the proto-collaborations can focus on
0309         2022 convergence.
0310         -   possible/probable exception is providing an EIC physics list
0311         asap, could be used this year.
0312 -   this simulation tool software should be in self-consistent repo(s)
0313     in the EIC GitHub
0314 -   should be containerized and easily distributable/usable
0315 
0316 ### Notes from the [*March 2 EIC software meeting*](https://indico.bnl.gov/event/11151/) discussion (Markus)
0317 
0318 -   Wouter asked about the steps after the community input. We clarified
0319     that there will be regular updates in the weekly EIC Software
0320     meetings as well as regular meetings by the developers.
0321 
0322 -   Jin asked to publish the validation setup as part of the deliverable
0323     (see his comment above). We will add this to the deliverable. We
0324     also summarized last week’s discussion on the topic, in particular
0325     the requirement for test-beam data and Elke’s suggestion to
0326     collect information from the EIC Generic Detector R&D consortia on
0327     tuning and validation of detector simulations in Geant4.
0328 
0329 -   Dmitry asked how the plug and play modular approach will
0330     be implemented. He also asked about the user interface, in
0331     particular using the Geant4 macros or Python bindings.