Developing a grid based system for distributed analysis. This enables physicists at different institutes to use the computing resources available at various sites (including Tier 1 and Tier 2 centres). EasyGrid is an intermediate layer between user software and grid middleware. EasyGrid is responsible for job submission, copy executables at grid Storage Elements, recovery results and collect all information for post-mortem analysis (if something goes wrong). Users have to know nothing about grid to use EasyGrid and do their analysis. It suits users whose main interest is not in computer science but in High Energy Physics.
After install LCG software, grid manager needs to configure the site following Standards and configuration - making all transparent.
Installing BaBar software from scratch
These pages describe how to install BaBar software in a storage system (AFS or NFS).
The diagram shows the relationship between sites, and contains the links between the application, the configuration file and the physical data for each type of information. The path points to the directories and the configuration files. You will need this information to customise each package for your own environment.
The use of web pages to store specifications allows updates and immediate distribution for all users, as soon as new ideas and suggestions came from new tests and users. This section contains the available specification and describes how it was implemented in the prototype and in production, as well as how it will improve as project goes on. It will be part of users' manual as well, which makes its use more wide than a specification document only.
Important news: There are discussions about the final implementation for UK. See http://hepunx.rl.ac.uk/BaBar/uk-spgrid/babar-env-setup.html for more details.
The meeting 2nd June gave new directions for the project:
--> EasyGrid will not optimise grid load, only provide a list of CE in the requirements clause. This function will be performed in the future by Resource Broker/RAL.
--> Instead use Glue Schema to publish site resources, will be use tags. RAL will update resources tags.
--> Datasets will be published by tags, with site resources and not more by RLS. RAL will update datasets tags.
--> dCache will be part of BookKeeping manager through some change in KanAccess.cfg. This is transparent to grid and babar software. Its implementation follows xroot approach and will be done by Bbk developers.
At the end of the process (0.5 day work), you will be able to run analyses using release 14.5.2 and BaBar CM2 software.
There are several ways to represent the information from the experiment, depending the user's knowledge of the reaction and the data available. The initial study uses histograms (mono and bi-parametric) to represent the number of events for some physical variable. Its possible to plot the number of particles with rest mass between 100MeV and 200MeV from the combination of 2 or more particles or gammas. Histograms are static because they do not allow a different study using their data. One can only plot different sections and using colors and superposition of histograms. Another possibility is to recover events with some specific characteristic, e.g. which could be selected through "cuts", and to generate ntuples. This is a more interesting way to store information, because as far as users know what events they want, it is possible to test several models from the selected events, with no need to search all the dataset. Each program is an independent approach for one specific problem. Probably, the user will need to join more than one of these programs to write a useful analysis software.
The grid available at Manchester University consists of 6 Worker Nodes, one Computer Element running, one resource broker, one information system, and one storage element running LCG2. The user interface is a remote computer. Data transfer is done through NFS. See the hardware configuration here.
The project is a test analysis to assess the job submission prototype and to acquire information to write the specification for a production job submission system using grid infrastructure in a reliable way. It covers a range of branching rates, and physical results were assessed from previous experiments. Pi0 physics and decay is very well understood by QED, and is a common decay of tau leptons. This project will help newbies understand how to select events using cuts and selectors, and what criteria should be used in the selection process.
The results of this project were discussed in the BaBar Users Meeting
on 9th February 2005, where I got users feedback to improve the
specification. The total time to develop it was 4 weeks to show the
proposal, write the code, select events twice over 81 million events,
and analyse the results.
I hope PhD students will be able to do
this as well, and use the remaining 2 years and 11 months of their
scholarship developing and learning high energy physics.
Results from Benchmark 2 using discriminate function: [doc] [pdf] [ps]
Benchmark 3 - Searching for anti-deuteron.
The idea of this grid benchmark is to look for anti-deuterons in all tracks from all events. This job requires a huge amount of computing and storage resources, and will provide all necessary information to study grid performance and bottlenecks.
There are evidences of anti deuterons production in HERA and argus (1,2). There are also recent models about coalescent process.
This benchmark applies functional gridification in genetic programming evolving discriminate functions to distinguish neutral pions from background in hadronic tau decays using EasyGrid job submission system. Accurate discrimination is important to obtain high purity samples with small systematic errors that allows ratify theoretical predictions with experimental results. Four different data selections criteria according to gammas energies and conventional cut were used to validate our method. A comparison between real data using the same data selection criteria and the discriminate shows our approach is better to discriminate real neutral pions from background than the conventional approach. Monte Carlo and experiment data adherence on neutral pion energy distribution proofs methods effectiveness. The complete analysis is available at Elementary particle identification using evolvable discriminate function and grid and IoP2007 poster has the beatiful plots.
EasyGrid performance and Grid bottlenecks.
Pi0 project, eta decays with generic root application, and searching for anti-deuteron benchmarks provided enough data to support EasyGrid as a submission system for high energy physics in general.
EasyGrid provides the output (if the job runs OK), or abend listings (if the job fails), or abort listings (if grid fails) in 100% of time. Users will have all necessary information in dataset.histo file to find out what to do, and ask for support.
Programming techniques provided in the technical report allows use grid efficiently.
Despite I have succeeded doing distributed analysis, I still have the following concerns about grid implementation by GridPP project:
Project start point: BabarGrid Roger Barlow's talk at GridPP10 (Switzerland, 2nd June 2004). General Overview: Enabling Grid Computer for HEP [ppt] [ps] Talks at BabarGrid UK (Manchester, 8th Sept 2004), * GridPP11 (Liverpool,14th Sept 2004), and BaBar Meeting (Dresden, 20th Sept 2004) New BaBar Software Installation (This project was cancelled by Roger Barlow) Group meeting 27/10/2004 at Manchester [ppt] [pdf] Christmas meeting / Manchester 2005: [ppt] [ps] MetaData Workshop / Grenoble France 2005: [ppt] [pdf] Grid2005 Workshop / Seattle, Washington, USA 2005 (preliminar): [doc] ICFA Workshop / Daegu, Korea 2005: [ppt] Babar Collaboration Meeting / Elba, Italy 2005: [doc] I will use the web pages and run grid remotely alive to show the resources and how easy is to use it. Babar Grid UK Meeting / Manchester, 2nd June 2005: Tier1 Grid from users point of view: urge of standards Internal report: How to succeed using grid in High Energy Physics experiments. BaBar Collaboration,Grid Applications for High Energy Physics Experiments Accepted to the Grid 2005 - 6th IEEE/ACM International Workshop on Grid Computing, November 13-14, 2005, Seattle, Washington, USA Babar Grid UK Meeting / Liverpool, 30th November 2005: EasyGrid, LCG testbed and gridification techniques [Ppt] [Ps] [PDF] Christmas meeting / Manchester 2005: [ppt] [pdf] Gridpp15 meeting - Rutherford Appleton Laboratory: ppt IoP 2006 - University of Warwick: ppt pdf ps Also available at GridPP web page (third poster from third row) GridPP17 - National e-Science Centre - Edinburgh Distributed analysis Christmas meeting Manchester 2006: Grid Neutral Pions Discriminate Function Gridpp18 meeting - University of Glasgow: EasyGrid BaBar Grid Project IoP 2007 - University of Surrey: ppt EGEE User Forum Manchester 9-11 May, 2007 PPT PDF Registered at EGEE-II Technical Pages - Related projects GridPP Report: 3Q2004 [PDF] [Doc] 4Q2004 [PDF] [Doc] 1Q2005 [PDF] [Doc] 2Q2005 [PDF] [Doc] 3Q2005 [PDF] [Doc] 4Q2005 [PDF] [Doc] 1Q2006 [PDF] [Doc] 2Q2006 [PDF] [Doc] 3Q2006 [PDF] [Doc] 4Q2006 [PDF] [Doc] 1Q2007 [PDF] [Doc] GridPP Logbook
|
|
Feedback to: jamwer@hep.man.ac.uk |