%% Computer Physics Communications class file, slightly modified %% for reduced inter-paragraph and inter-section spacing. \documentclass{elsartmod} % Include graphics package. \usepackage{axodraw} % Standardised indentation. \newcommand{\cindent}{\hspace*{10mm}~} %Vertical positioning of page %\setlength{\topmargin}{20mm} \begin{document} \begin{frontmatter} \begin{flushright} arXiv:0710.3820\\ CERN-LCGAPP-2007-04\\ LU TP 07-28\\ FERMILAB-PUB-07-512-CD-T\\ October 2007\\ \end{flushright} \title{A Brief Introduction to PYTHIA 8.1} \author[a,b]{Torbj\"orn Sj\"ostrand\thanksref{author}}, \author[c]{Stephen Mrenna}, \author[a,c]{Peter Skands} \thanks[author]{Corresponding author, e-mail: torbjorn@thep.lu.se} \address[a]{CERN/PH, CH--1211 Geneva 23, Switzerland} \address[b]{Department of Theoretical Physics, Lund University,\\ S\"olvegatan 14A, SE-223 62 Lund, Sweden} \address[c]{Fermi National Accelerator Laboratory, Batavia, IL 60510, USA} \begin{abstract} The \textsc{Pythia} program is a standard tool for the generation of high-energy collisions, comprising a coherent set of physics models for the evolution from a few-body hard process to a complex multihadronic final state. It contains a library of hard processes and models for initial- and final-state parton showers, multiple parton-parton interactions, beam remnants, string fragmentation and particle decays. It also has a set of utilities and interfaces to external programs. While previous versions were written in Fortran, \textsc{Pythia}~8 represents a complete rewrite in C++. The current release is the first main one after this transition, and does not yet in every respect replace the old code. It does contain some new physics aspects, on the other hand, that should make it an attractive option especially for LHC physics studies. \begin{flushleft} %Insert your suggested PACS number here PACS: 13.66.-a, 13.85.-t, 12.38.-t, 12.15.-y, 12.60.-i \end{flushleft} \begin{keyword} event generators, multiparticle production, parton showers, multiple interactions, hadronisation \end{keyword} \end{abstract} \dedicated{\rule{0mm}{8mm}Dedicated to the memory of\\[3mm] \textbf{\textit{\Large Hans-Uno Bengtsson}}\\[1mm] 1953 -- 2007\\ The father of PYTHIA} \end{frontmatter} \clearpage % Computer program descriptions should contain the following % PROGRAM SUMMARY. {\bf NEW VERSION PROGRAM SUMMARY} %Delete as appropriate. \begin{small} \noindent {\em Manuscript Title:A Brief Introduction to \textsc{Pythia} 8.1} \\ {\em Authors:Torbj\"orn Sj\"ostrand, Stephen Mrenna, Peter Skands} \\ {\em Program Title: \textsc{Pythia} 8.1} \\ {\em Journal Reference:} \\ %Leave blank, supplied by Elsevier. {\em Catalogue identifier:} \\ %Leave blank, supplied by Elsevier. {\em Licensing provisions: GPL version 2} \\ %enter "none" if CPC non-profit use license is sufficient. {\em Programming language: C++} \\ {\em Computer: commodity PCs} \\ %Computer(s) for which program has been designed. {\em Operating systems: Linux; should also work on other systems} \\ %Operating system(s) for which program has been designed. {\em RAM: 8} megabytes \\ %RAM in bytes required to execute program with typical data. %{\em Number of processors used:} \\ % %If more than one processor. %{\em Supplementary material:} \\ % % Fill in if necessary, otherwise leave out. {\em Keywords: event generators, multiparticle production, parton showers, multiple interactions, hadronisation} \\ % Please give some freely chosen keywords that we can use in a % cumulative keyword index. {\em PACS: 13.66.-a, 13.85.-t, 12.38.-t, 12.15.-y, 12.60.-i} \\ % see http://www.aip.org/pacs/pacs.html {\em Classification: 11.2 Phase Space and Event Simulation} \\ %Classify using CPC Program Library Subject Index, see ( % http://cpc.cs.qub.ac.uk/subjectIndex/SUBJECT_index.html) %e.g. 4.4 Feynman diagrams, 5 Computer Algebra. %{\em External routines/libraries: none} \\ % % Fill in if necessary, otherwise leave out. %{\em Subprograms used:} \\ % %Fill in if necessary, otherwise leave out. {\em Catalogue identifier of previous version: ADNN\_v1\_0} \\ %Only required for a New Version summary, otherwise leave out. {\em Journal reference of previous version: T. Sj\"ostrand, P. Ed\'en, C. Friberg, L. L\"onnblad, G. Miu, S. Mrenna and E. Norrbin, Computer Physics Commun. {\bf 135} (2001) 238} \\ %Only required for a New Version summary, otherwise leave out. {\em Does the new version supersede the previous version?: yes, partly} \\ %Only required for a New Version summary, otherwise leave out. \\ {\em Nature of problem: high-energy collisions between elementary particles normally give rise to complex final states, with large multiplicities of hadrons, leptons, photons and neutrinos. The relation between these final states and the underlying physics description is not a simple one, for two main reasons. Firstly, we do not even in principle have a complete understanding of the physics. Secondly, any analytical approach is made intractable by the large multiplicities. }\\ %Describe the nature of the problem here. \\ {\em Solution method: complete events are generated by Monte Carlo methods. The complexity is mastered by a subdivision of the full problem into a set of simpler separate tasks. All main aspects of the events are simulated, such as hard-process selection, initial- and final-state radiation, beam remnants, fragmentation, decays, and so on. Therefore events should be directly comparable with experimentally observable ones. The programs can be used to extract physics from comparisons with existing data, or to study physics at future experiments.}\\ %Describe the method solution here. \\ {\em Reasons for the new version: improved and expanded physics models, transition from Fortran to C++}\\ %Only required for a New Version summary, otherwise leave out. \\ {\em Summary of revisions: new user interface, transverse-momentum-ordered showers, interleaving with multiple interactions, and much more}\\ %Only required for a New Version summary, otherwise leave out. \\ {\em Restrictions: depends on the problem studied}\\ %Describe any restrictions on the complexity of the problem here. \\ {\em Unusual features: none}\\ %Describe any unusual features of the program/problem here. \\ %{\em Additional comments:}\\ %Provide any additional comments here. % \\ {\em Running time: 10--1000 events per second, depending on process studied}\\ %Give an indication of the typical running time here. % \\ %{\em References:} %\begin{refnummer} %\item Reference 1 % This is the reference list of the Program Summary %\item Reference 2 % Type references in text as [1], [2], etc. %\item Reference 3 % This list is different from the bibliography, which % % you can use in the Long Write-Up. %\end{refnummer} \end{small} \clearpage % In program descriptions the main text of the paper is listed under % the heading LONG WRITE-UP. \hspace{1pc} {\bf LONG WRITE-UP} \section{Introduction} The development of \textsc{Jetset} \cite{jetset}, containing several of the components that later were merged with \textsc{Pythia} \cite{pythiaearly}, was begun in 1978. Thus the current \textsc{Pythia}~6 generator \cite{pythiasixone, pythiasixfour} is the product of almost thirty years of development, and some of the code has not been touched in a very long time. New options have been added, but old ones seldom removed. The basic structure has been expanded in different directions, well beyond what it was once intended for, making it rather cumbersome by now. {}From the onset, all code has been written in Fortran~77. For the LHC era, the experimental community has made the decision to move heavy computing completely to C++. Fortran support may be poor to non-existing, and young experimenters will not be conversant in Fortran any longer. Therefore it is logical also to migrate \textsc{Pythia} to C++, and in the process clean up and modernise various aspects. A first attempt in this direction was the \textsc{Pythia}~7 project \cite{pythiaseven}. However, priority came to be given to the development of a generic administrative structure, renamed \textsc{ThePEG} \cite{thepeg} and adopted by the \textsc{Herwig++} \cite{herwigpp} group, while the physics parts of \textsc{Pythia}~7 remained underdeveloped. \textsc{Pythia}~8 is a clean new start, to provide a successor to \textsc{Pythia}~6. It is a completely standalone generator, thus not relying on \textsc{ThePEG} or any other external library. Some optional hooks for links to other programs are provided, however. The version presented here is the first operational one in the \textsc{Pythia}~8 series. As such it is not yet tested and tuned enough to have reached the same level of maturity as \textsc{Pythia}~6, so we expect the two to coexist for a while. It is only by an increasing use of the new version that it will mature, however, so we encourage a critical try-out, and look forward to feedback. The intention is to release a version in time for comparisons with first LHC data. Thus some areas, like $\gamma\mathrm{p}$ and $\gamma\gamma$ physics, are not yet addressed. Further, some intended processes remain to be implemented. Actually, with the rise of automatic matrix-element code generation and phase-space sampling, input of process-level events via the Les Houches Accord (LHA) \cite{lha} and with Les Houches Event Files (LHEF) \cite{lhef} reduces the need to have an extensive process library inside \textsc{Pythia} itself. Thus emphasis is more on providing a good description of subsequent steps of the story, involving elements such as initial- and final-state parton showers, multiple parton--parton interactions, string fragmentation, and decays. The current article provides an introduction to \textsc{Pythia}~8. The programming aspects are covered in more detail in a set of interlinked HTML (or alternatively PHP) pages that comes in the same package as the program files, see below. Much of the physics aspects are unchanged relative to the \textsc{Pythia}~6.4 manual \cite{pythiasixfour}, and so we refer to it and to other physics articles for that. Instead what we here give is an overview for potential users who already have some experience with event generators and want to understand how to get going with \textsc{Pythia}~8. Section \ref{sec:physics} contains an ultra-brief summary of the physics of \textsc{Pythia}~8, with particular emphasis on aspects that are different relative to \textsc{Pythia}~6. The program structure (including flow, files, documentation, and a few important warnings) is described in section \ref{sec:structure}; summaries of the main user methods, including the event record and particle classes, in section \ref{sec:main}. Section \ref{sec:databases} is concerned with the databases of flags, modes, parameters, processes, and particle data which exist in \textsc{Pythia}~8. Those who wish to link to external programs, e.g.\ to gain access to external parton distributions, standard input/output formats, and much more, will find the relevant information in section \ref{sec:external}. A brief how-to on getting going is then included in section \ref{sec:how-to}. Section \ref{sec:outlook} rounds off with an outlook. \section{Physics Summary \label{sec:physics}} This article is not intended to provide a complete description of the physics content. For this we primarily refer to the \textsc{Pythia}~6 manual \cite{pythiasixfour} and associated literature. We would like to draw attention to some key points of difference, however. Further details are available on the HTML/PHP pages in the program distribution. Some new physics aspects will eventually be covered in separate articles. The physics components are controlled by many parameters. These have been assigned sensible default values, based on previous experience with \textsc{Pythia}~6 and some first studies with the new code. We look forward to more extensive tunes by the experimental community, however. \subsection{Hard processes} Currently the program only works with $\mathrm{p}\mathrm{p}$, $\overline{\mathrm{p}}\mathrm{p}$, $\mathrm{e}^+\mathrm{e}^-$ and $\mu^+\mu^-$ incoming beams. In particular, there is no provision for $\mathrm{e}\mathrm{p}$ collisions or for incoming photon beams, neither on their own nor as flux around an electron. The list of processes currently implemented is summarised further down; it corresponds to most of the ones in \textsc{Pythia}~6, with the exception of the Supersymmetry and Technicolor sectors, which are yet to come. The cross-section expressions should be identical, but default scale choices have been changed, so that cross sections may be somewhat different for that reason. The default parton distribution remains CTEQ 5L, but ones found in the \textsc{LhaPdf} library \cite{lhapdf} can easily be linked. It is now possible to use separate PDF sets for the hard interaction, on one hand, and for the subsequent showers and multiple interactions, on the other. \subsection{Parton showers} The initial- and final-state algorithms are based on the new $p_{\perp}$-ordered evolution introduced in \textsc{Pythia}~6.3 \cite{ptshowers}, while the older mass-ordered ones have not been implemented. It is now additionally possible to have a branching of a photon to a fermion pair as part of the final-state evolution. Already in \textsc{Pythia}~6.3 the initial-state evolution and the multiple interactions were interleaved into one common decreasing $p_{\perp}$ sequence. Now also the final-state evolution is interleaved with the other two. In this context, some of that final-state radiation gets to be associated with dipoles stretched between a final-state parton and the ``hole'' left by an initial-state one, which therefore now can take a recoil. The initial-state-radiation algorithm remains unchanged in this respect, with recoils always taken by the hard scattering subsystem as a whole. \subsection{Multiple interactions and beam remnants} The multiple-interactions machinery as such contains the full functionality introduced in \textsc{Pythia}~6.3 \cite{newremnants}. Rescaled parton densities are defined after the first interaction, that take into account the nature of the previous partons extracted. Currently there is only one scenario for colour-reconnection in the final state, in which there is a certain probability for the partons of two subscatterings to have their colours interarranged in a way that reduces the total string length. (This is intermediate in character between the original strategy \cite{zijl} and the more recent ones \cite{wicke}.) The description of beam remnants is based on the new framework. In addition to the standard QCD $2 \to 2$ processes, the possibility of multiple interactions producing prompt photons, charmonia and bottomonia, low-mass Drell-Yan pairs, and $t$-channel $\gamma^*/\mathrm{Z}^0/\mathrm{W}^{\pm}$ exchange is now also included. For dedicated studies of two low-rate processes in coincidence, two hard interactions can now be set in the same event. There are no Sudakov factors included for these two interactions, similarly to normal events with one hard interaction. \subsection{Hadronisation} Hadronisation is based solely on the Lund string fragmentation framework \cite{lundreview}; older alternative descriptions have been left out. Particle data have been updated in agreement with the 2006 PDG tables \cite{pdg}. This also includes a changed content of the scalar meson multiplet. Some updated charm and bottom decay tables have been obtained from the DELPHI and LHCb collaborations. The BE$_{32}$ model for Bose--Einstein effects \cite{boseeinstein} has been implemented, but is not on by default. \subsection{Other program components} Standardised procedures have been introduced to link the program to various external programs for specific tasks, see section \ref{sec:external}. Finally, some of the old jet finders and other analysis routines are made available. Also included is a utility to generate, display and save simple one-dimensional histograms. \section{Program Structure \label{sec:structure}} \subsection{Program flow} The physics topics that have to come together in a complete event generator can crudely be subdivided into three stages: \begin{enumerate} \item The generation of a ``process'' that decides the nature of the event. Often it would be a ``hard process'', such as $\mathrm{g} \mathrm{g} \to \mathrm{h}^0 \to \mathrm{Z}^0 \mathrm{Z}^0 \to \mu^+ \mu^- \mathrm{q} \overline{\mathrm{q}}$, that is calculated in perturbation theory, but a priori we impose no requirement that a hard scale must be involved. Only a very small set of partons/particles is defined at this level, so only the main aspects of the event structure are covered. \item The generation of all subsequent activity on the partonic level, involving initial- and final-state radiation, multiple parton--parton interactions and the structure of beam remnants. Much of the phenomena are under an (approximate) perturbative control, but nonperturbative physics aspects are also important. At the end of this step, a realistic partonic structure has been obtained, e.g. with broadened jets and an underlying-event activity. \item The hadronisation of this parton configuration, by string fragmentation, followed by the decays of unstable particles. This part is almost completely nonperturbative, and so requires extensive modelling and tuning or, especially for decays, parametrisations of existing data. It is only at the end of this step that realistic events are available, as they could be observed by a detector. \end{enumerate} This division of tasks is not watertight --- parton distributions span and connect the two first steps, to give one example --- but it still helps to focus the discussion. \begin{figure}[t] \begin{picture}(430,370)(-215,10) \GBox(-215,350)(215,380){0.9} \Text(0,365)[]{The User ($\approx$ Main Program)} \GBox(-215,300)(215,330){0.9} \Text(0,315)[]{\texttt{Pythia}} \GBox(-215,250)(-170,280){0.9} \Text(-192.5,265)[]{\texttt{Info}} \GBox(-130,250)(-20,280){0.9} \Text(-75,265)[]{\texttt{Event~~process}} \GBox(20,250)(215,280){0.9} \Text(105,265)[]{\texttt{Event~~event}} \GBox(-215,110)(-85,230){0.9}\Line(-215,200)(-85,200) \Text(-150,215)[]{\texttt{ProcessLevel}} \Text(-150,185)[]{\texttt{ProcessContainer}} \Text(-150,165)[]{\texttt{PhaseSpace}} \Text(-150,145)[]{\texttt{LHAinit, LHAevnt}} \Text(-150,125)[]{\texttt{ResonanceDecays}} \GBox(-65,110)(65,230){0.9}\Line(-65,200)(65,200) \Text(0,215)[]{\texttt{PartonLevel}} \Text(0,185)[]{\texttt{TimeShower}} \Text(0,165)[]{\texttt{SpaceShower}} \Text(0,145)[]{\texttt{MultipleInteractions}} \Text(0,125)[]{\texttt{BeamRemnants}} \GBox(85,110)(215,230){0.9}\Line(85,200)(215,200) \Text(150,215)[]{\texttt{HadronLevel}} \Text(150,185)[]{\texttt{StringFragmentation}} \Text(150,165)[]{\texttt{MiniStringFrag\ldots}} \Text(150,145)[]{\texttt{ParticleDecays}} \Text(150,125)[]{\texttt{BoseEinstein}} \GBox(-130,60)(-20,90){0.9} \Text(-75,75)[]{\texttt{BeamParticle}} \GBox(20,60)(200,90){0.9} \Text(110,75)[]{\texttt{SigmaProcess, SigmaTotal}} \GBox(-215,10)(215,40){0.9} \Text(0,25)[]{\texttt{Vec4, Rndm, Hist, Settings, % ParticleDataTable, ResonanceWidths, \ldots}} \SetWidth{2} \LongArrow(0,350)(0,332) \LongArrow(-150,300)(-150,232) \LongArrow(0,300)(0,232) \Line(150,300)(150,280) \DashLine(150,280)(150,250){4} \LongArrow(150,250)(150,232) \SetWidth{1} \LongArrow(-192.5,230)(-192.5,248) \Line(-192.5,280)(-192.5,300) \DashLine(-192.5,300)(-192.5,330){4} \LongArrow(-192.5,330)(-192.5,348) \LongArrow(-107.5,230)(-107.5,248) \LongArrow(-42.5,250)(-42.5,232) \Line(-42.5,280)(-42.5,300) \DashLine(-42.5,300)(-42.5,330){4} \LongArrow(-42.5,330)(-42.5,348) \LongArrow(42.5,230)(42.5,248) \LongArrow(107.5,250)(107.5,232) \LongArrow(182.5,230)(182.5,248) \Line(182.5,280)(182.5,300) \DashLine(182.5,300)(182.5,330){4} \LongArrow(182.5,330)(182.5,348) \LongArrow(-107.5,100)(-107.5,108) \LongArrow(-107.5,100)(-107.5,92) \LongArrow(-42.5,100)(-42.5,108) \LongArrow(-42.5,100)(-42.5,92) \LongArrow(42.5,100)(42.5,108) \LongArrow(42.5,100)(42.5,92) \LongArrow(-160,50)(-160,108) \Line(-160,50)(0,50) \LongArrow(0,50)(19,59) \end{picture} \caption{The relationship between the main classes in \textsc{Pythia}~8. The thick arrows show the flow of commands to carry out different physics tasks, whereas the thinner show the flow of information between the tasks. The bottom box contains common utilities that may be used anywhere. Obviously the picture is strongly simplified. \label{fig:generatorstructure}} \hrulefill \end{figure} The structure of the \textsc{Pythia}~8 generator, as illustrated in Fig.~\ref{fig:generatorstructure}, is based on this subdivision. The main class for all user interaction is called \texttt{Pythia}. It calls on the three classes \texttt{ProcessLevel}, \texttt{PartonLevel} and \texttt{HadronLevel}, corresponding to points 1, 2 and 3 above. Each of these, in their turn, call on further classes that perform the separate kinds of physics tasks. Information is flowing between the different program elements in various ways, the most important being the event record, represented by the \texttt{Event} class. Actually, there are two objects of this class, one called \texttt{process}, that only covers the few partons of the ``hard'' process of point 1 above (i.e., containing information corresponding to what might be termed the ``matrix element'' level), and another called \texttt{event}, that covers the full story from the incoming beams to the final hadrons. A small \texttt{Info} class keeps track of useful one-of-a-kind information, such as kinematical variables of the hard process. There are also two incoming \texttt{BeamParticle}s, that keep track of the partonic content left in the beams after a number of interactions and initial-state radiations, and rescales parton distributions accordingly. The process library, as well as parametrisations of total, elastic and diffractive cross sections, are used both by the hard-process selection machinery and the multiple-interactions one. The \texttt{Settings} database keeps track of all integer, double, boolean and string variables that can be changed by the user to steer the performance of \textsc{Pythia}, except that \texttt{ParticleDataTable} is its own separate database. Finally, a number of utilities can be used just about anywhere, for Lorentz four-vectors, random numbers, jet finding and simple histograms, and for a number of other ``minor'' tasks. Orthogonally to the subdivision above, there is another, more technical classification, whereby the user interaction with the generator occurs in three phases: \begin{itemize} \item Initialisation, where the tasks to be performed are specified. \item Generation of individual events (the ``event loop''). \item Finishing, where final statistics is made available. \end{itemize} Again the subdivision (and orthogonality) is not strict, with many utilities and tasks stretching across the borders, and with no finishing step required for many aspects. Nevertheless, as a rule, these three phases are represented by different methods inside the class of a specific physics task. \subsection{Program files and documentation} The code is subdivided into a set of files, mainly by physics task. Each file typically contains one main class, but often with a few related helper classes that are not used elsewhere in the program. Normally the files come in pairs. \begin{enumerate} \item A header file, \texttt{.h} in the \texttt{include} subdirectory, where the public interface of the class is declared, and inline methods are defined. \item A source code file, \texttt{.cc} in the \texttt{src} subdirectory, where the lengthier methods are implemented. \end{enumerate} During compilation, related dependency files, \texttt{.d}, and compiled code, \texttt{.o} are created in the \texttt{tmp} subdirectory. In part the \texttt{.xml} documentation files in the \texttt{xmldoc} subdirectory have matching names, but the match is broken by the desire to group topics more by user interaction than internal operation. These files contain information on all settings and particle data, but not in a convenient-to-read format. Instead they are translated into a corresponding set of \texttt{.html} files in the \texttt{htmldoc} subdirectory and a set of \texttt{.php} files in \texttt{phpdoc}. The former set can easily be read if you open the \texttt{htmldoc/Welcome.html} file in your favourite web browser, but offers no interactivity. The latter set must be installed under a webserver (like a homepage) to function properly, and then provides a simple Graphical User Interface if you open the \texttt{phpdoc/Welcome.php} file in a web browser. For output to the \textsc{HepMC} event record format \cite{hepmc}, an interface is provided in the \texttt{hepmcinterface} subdirectory. There are also interfaces to allow parton distribution functions to be used from the \textsc{LhaPdf} library \cite{lhapdf} and hard processes from external programs. The installation procedure is described in a \texttt{README} file; it involves running a \texttt{configure} script, whereafter an ordinary \texttt{Makefile} is used. The former should be invoked with command-line arguments (or be edited) to provide the path to the \textsc{HepMC} library if this is going to be used. Compiled libraries are put in the \texttt{lib} subdirectory. Default is to build archive libraries, but optionally also shared-object ones can be built. The standard setup is intended for Linux systems, but a simplified alternative is provided for Windows users. Finally, some examples of main programs, along with input files, or ``cards'', for them, are found in the \texttt{examples} subdirectory. This directory contains its own \texttt{configure} script and \texttt{Makefile} which will allow you to build executables, see the \texttt{examples/README} file. As above, command-line arguments or brute-force editing allows you to set the \textsc{LhaPdf} and \textsc{Pythia}~6.4 paths, if so required. The executables are placed in the \texttt{bin} directory, but with links from \texttt{examples}. \subsection{Important warnings} Playing with the files in the \texttt{examples} subdirectory is encouraged, to familiarise oneself with the program. Modifying the \texttt{configure} files may be required during installation. For the rest, files should not be modified, at least not without careful consideration of consequences. In particular, the \texttt{.xml} files are set read-only, and should not be tampered with. Interspersed in them, there are lines beginning with \texttt{<flag}, \texttt{<mode}, \texttt{<parm}, \texttt{<word}, \texttt{<particle}, \texttt{<channel}, or \texttt{<a}. They contain instructions from which \texttt{Settings} and \texttt{ParticleDataTable} build up their respective databases of user-accessible variables, see further below. Any stupid changes here will cause difficult-to-track errors! Further, sometimes you will see two question marks, ``??'', in the text or code. This is for internal usage, to indicate loose ends or preliminary thoughts. Please disregard. \section{Main Program and Event Information \label{sec:main}} \subsection{The \texttt{Pythia} class} The \texttt{Pythia} class is the main means of communication between the user and the event-generation process. We here present the key methods for the user to call, ordered by context. Firstly, at the top of the main program, the proper header file must be included:\\ \cindent \texttt{\#include "Pythia8/Pythia.h"}\\ To simplify typing, it also makes sense to declare\\ \cindent \texttt{using namespace Pythia8;}\\ Given this, the first step in the main program is to create a generator object, e.g. with\\ \cindent \texttt{Pythia pythia;}\\ In the following we will assume that the \texttt{pythia} object has been created with this name, but of course you are free to pick another one. When this object is declared, the \texttt{Pythia} constructor initialises all the default values for the \texttt{Settings} and the \texttt{ParticleDataTable} data bases. These data are now present in memory and can be modified in a number of ways before the generator is initialised (see below). Most conveniently, \textsc{Pythia}'s settings and parameters can be changed by the two methods\\ \cindent \texttt{pythia.readString(string);}\\ for changing a single variable, and\\ \cindent \texttt{pythia.readFile(fileName);}\\ for changing a set of variables, one per line in the input file. The allowed form for a string/line will be explained as we consider the databases in the next section. Further, methods will be introduced to list all or only the changed settings and particle data. At this stage you can also optionally hook up with some external facilities, see section \ref{sec:external}. After this, in the initialisation call all remaining details of the generation are to be specified. The \texttt{pythia.init(...)} method allows a few different input formats, so you can pick the one convenient for you:\\ \cindent \texttt{pythia.init(idA, idB, eA, eB);}\\ lets you specify the identities and energies of the two incoming beam particles, with A (B) assumed moving in the $+z$ ($-z$) direction;\\ \cindent \texttt{pythia.init(idA, idB, eCM);}\\ is similar, but you specify the CM energy, and you are assumed in the rest frame;\\ \cindent \texttt{pythia.init(LHAinit*, LHAevnt*);}\\ assumes LHA initialisation information is available in an \texttt{LHAinit} class object, and that LHA event information will be provided by the \texttt{LHAevnt} class object, see below;\\ \cindent \texttt{pythia.init(fileName);}\\ assumes that the file obeys the LHEF standard format and that information can be extracted from it accordingly; and finally\\ \cindent \texttt{pythia.init();}\\ will take its values from the beam specification stored in the \texttt{Settings} database. It is when the \texttt{init(...)} call is executed that all the settings values are propagated to the various program elements, and used to precalculate quantities that will be used at later stages of the generation. Further settings changed after the \texttt{init(...)} call will be ignored (unless methods are used to force a partial or complete re-initialisation). By contrast, the particle properties database is queried all the time, and so a later change would take effect immediately, for better or worse. The bulk of the code is concerned with the event generation proper. However, all the information on how this should be done has already been specified. Therefore only a command\\ \cindent \texttt{pythia.next();}\\ is required to generate the next event. This method would be located inside an event loop, where a required number of events are to be generated. The key output of the \texttt{pythia.next()} command is the event record found in \texttt{pythia.event}, see below. A process-level summary of the event is stored in \texttt{pythia.process}. When problems are encountered, in \texttt{init(...)} or \texttt{next()}, they can be assigned one of three degrees of severity. Abort is the highest. In that case the call could not complete its tasks, and returns the value \texttt{false}. If this happens in \texttt{init(...)} it is then not possible to generate any events at all. If it happens in \texttt{next()} only the current event must be skipped. In a few cases the abort may be predictable and desirable, e.g.\ when a file of LHA events comes to an end. Errors are less severe, and the program can usually work around them, e.g.\ by backing up one step and trying again. Should that not succeed, an abort may result. Warnings are of informative character only, and do not require any corrective actions (except, in the longer term, to find more reliable algorithms). At the end of the generation process, you can call\\ \cindent \texttt{pythia.statistics();}\\ to get some run statistics, both on cross sections for the subprocesses generated and on the number of aborts, errors and warnings issued. \subsection{The event record} The \texttt{Event} class for event records is not much more than a wrapper for a vector of \texttt{Particle}s. This vector can expand to fit the event size. The index operator is overloaded, so that \texttt{event[i]} corresponds to the \texttt{i}'th particle of an \texttt{Event} object called \texttt{event}. For instance, given that the PDG identity code \cite{pdg} of a particle is provided by the \texttt{id()} method, \texttt{event[i].id()} returns the identity of the \texttt{i}'th particle. Line 0 is used to represent the event as a whole, with its total four-momentum and invariant mass, but does not form part of the event history, and only contains redundant information. When you translate to another event-record format where the first particle is assigned index 1, such as \textsc{HepMC}, this line should therefore be dropped so as to keep the rest of the indices synchronised. It is only with lines 1 and 2, which contain the two incoming beams, that the history tracing begins. That way unassigned mother and daughter indices can be put 0 without ambiguity. In this section, first the \texttt{Particle} methods are surveyed, and then the further aspects of the event record. \subsubsection{The particle} A \texttt{Particle} corresponds to one entry/slot/line in the event record. Its properties therefore mix ones belonging to a particle-as-such, like its identity code or four-momentum, and ones related to the event-as-a-whole, like which mother it has. The following properties are stored for each particle, listed by the member functions you can use to extract the information: \begin{itemize} \item \texttt{id()} : the identity of a particle, according to the PDG particle codes. \item \texttt{status()} : status code. The full set of codes provides information on where and why a given particle was produced. The key feature is that a particle is assigned a positive status code when it is created, which then is negated if later it branches into other particles. The mechanism of this branching can be inferred from the status code of the daughters. Thus, at any given stage of the event-generation process, the current final state consists of the particles with positive status code. \item \texttt{mother1(), mother2()} : the indices in the event record where the first and last mothers are stored, if any. A few different cases are possible, to allow for one or many mothers. The \texttt{motherList(i)} method (see below) can return a vector with all the mother indices, based on this info. \item \texttt{daughter1(), daughter2()} : the indices in the event record where the first and last daughters are stored, if any. A few different cases are possible, to allow for one or many daughters. The \texttt{daughterList(i)} method (see below) can return a vector with all the daughter indices, based on this info. \item \texttt{col(), acol()} : the colour and anticolour tags, LHA style. \item \texttt{px(), py(), pz(), e()} : the particle four-momentum components (in GeV, with $c = 1$), alternatively extracted as a \texttt{Vec4 p()}. \item \texttt{m()} : the particle mass (in GeV). \item \texttt{scale()} : the scale at which a parton was produced (in GeV); model-specific but relevant in the processing of an event. \item \texttt{xProd(), yProd(), zProd(), tProd()} : the production vertex coordinates (in mm or mm/$c$), alternatively extracted as a \texttt{Vec4 vProd()}. \item \texttt{tau()} : the proper lifetime (in mm/$c$). \end{itemize} The same method names, with a value inserted between the brackets, set these quantities. In addition, a number of derived quantities can easily be obtained, but cannot be set, such as: \begin{itemize} \item \texttt{isFinal()} : \texttt{true} for a remaining particle, i.e. one with positive status code, else \texttt{false}. \item \texttt{pT(), pT2()} : (squared) transverse momentum. \item \texttt{mT(), mT2()} : (squared) transverse mass. \item \texttt{pAbs(), pAbs2()} : (squared) three-momentum magnitude. \item \texttt{theta(), phi()} : polar and azimuthal angle (in radians). \item \texttt{y(), eta()} : rapidity and pseudorapidity. \item \texttt{xDec(), yDec(), zDec(), tDec()} : the decay vertex coordinates, assuming free-streaming propagation, alternatively extracted as a \texttt{Vec4 vDec()}. \end{itemize} Each \texttt{Particle} contains a pointer to the respective \texttt{ParticleDataEntry} object in the particle data tables. This pointer gives access to properties of the particle species as such. It is there mainly for convenience, and should be thrown if an event is written to disk, to avoid any problems of object persistency. This pointer is used by member functions such as: \begin{itemize} \item \texttt{name()} : the name of the particle, as a string. \item \texttt{spinType()} : $2 s + 1$, or 0 where undefined spin. \item \texttt{charge(), chargeType()} : charge, and three times it to make an integer. \item \texttt{isCharged(), isNeutral()} : \texttt{bool}s for charged or not. \item \texttt{colType()} : 0 for colour singlets, 1 for triplets, $-1$ for antitriplets and 2 for octets. \item \texttt{m0()} : the nominal mass of the particle species. \end{itemize} \subsubsection{Other methods in the event record} While the \texttt{Particle} vector is the key component of an \texttt{Event}, a few further methods are available. The event size can be found with \texttt{size()}, i.e. valid particles are stored in the range $0 \leq $\texttt{i}$ <$ \texttt{event.size()}. A listing of the whole event is obtained with \texttt{list()}. The basic identity, status, mother, daughter, colour, four-momentum and mass data are always given, but optional arguments can be set to provide further information, on the complete lists of mothers and daughters, and on production vertices. The user would normally be concerned with the \texttt{Event} object that is a public member \texttt{event} of the \texttt{Pythia} class. Thus \texttt{pythia.event[i].id()} would be used to return the identity of the \texttt{i}'th particle, and \texttt{pythia.event.size()} to give the size of the event record. A \texttt{Pythia} object contains a second event record for the hard process alone, similar to the LHA process specification, called \texttt{process}. This record is used as input for the generation of the complete event. Thus one may e.g. call either \texttt{pythia.process.list()} or \texttt{pythia.event.list()}. To distinguish those two rapidly at visual inspection, the ``Pythia Event Listing'' header is printed out differently, adding either ``(hard process)'' or ``(complete event)''. There are also a few methods with an individual particle index \texttt{i} as input, but requiring some search operations in the event record, and therefore not possible to define as methods of the \texttt{Particle} class. The most important ones are \texttt{motherList(i)}, \texttt{daughterList(i)} and \texttt{sisterList(i)}. These return a \texttt{vector<int>} containing a list of all the mothers, daughters or sisters of a particle. This list may be empty or arbitrarily large, and is given in ascending order. One data member in an Event object is used to keep track of the largest \texttt{col()} or \texttt{acol()} tag set so far, so that new ones do not clash. The event record also contains two further sets of vectors. These are intended for the expert user only, so only a few words on each. The first is a vector of junctions, i.e.\ vertices where three string pieces meet. This list is often empty or else contains only a very few per event. The second is a storage area for parton indices, classified by subsystem. Such information is needed to interleave multiple interactions, initial-state showers, final-state showers and beam remnants. It can also be used in the hadronisation. \subsection{Other event information} A set of one-of-a-kind pieces of event information is stored in the \texttt{info} object (an instance of the class \texttt{Info}) in the \texttt{Pythia} class. This is mainly intended for processes generated internally, but some of the information is also available for external processes. You can use \texttt{pythia.info.method()} to extract e.g.\ the following information: \begin{itemize} \item \texttt{list()} : list some information on the current event. \item \texttt{eCM(), s()} : the cm energy and its square. \item \texttt{name(), code()} : the name and code of the subprocess. \item \texttt{id1(), id2()} : the identities of the two partons coming in to the hard subprocess. \item \texttt{x1(), x2()} : $x$ fractions of the two partons coming in to the hard subprocess. \item \texttt{pdf1(), pdf2(), QFac(), Q2Fac()} : parton densities $x \, f_i(x,Q^2 )$ evaluated for the two incoming partons, and the associated factorisation scale $Q$ and its square. \item \texttt{mHat(), sHat(), tHat(), uHat()} : the invariant mass of the hard subprocess and the Mandelstam variables for $2 \to 2$ processes. \item \texttt{pTHat(), thetaHat()} : transverse momentum and polar scattering angle of the hard subprocess for $2 \to 2$ processes. \item \texttt{alphaS(), alphaEM(), QRen(), Q2Ren()} : $\alpha_{\mathrm{s}}$ and $\alpha_{\mathrm{em}}$ values for the hard process, and the associated renormalisation scale $Q$ and its square. \item \texttt{nTried(), nAccepted(), sigmaGen(), sigmaErr()} : the number of trial and accepted events, and the resulting estimated cross section and estimated statistical error, in units of mb, summed over the included processes. \end{itemize} In other classes there are also methods that can be called to do a sphericity or thrust analysis or search for jets with a clustering or simple cone jet finder. These take the event record as input. \section{Databases \label{sec:databases}} Inevitably one wants to be able to modify the default behaviour of a generator. Currently there are two \textsc{Pythia}~8 databases with modifiable values. One deals with general settings, the other specifically with particle data. The key method to set a new value is\\ \cindent \texttt{pythia.readString(string);}\\ The typical form of a string is\\ \cindent \texttt{"variable = value"}\\ where the equal sign is optional and the variable begins with a letter for settings and a digit for particle data. A string not beginning with either is considered as a comment and ignored. Therefore inserting an initial !, \#, \$, \%, or another such character, is a good way to comment out a command. For non-commented strings, the match of the name to the database is case-insensitive. Strings that do begin with a letter or digit and still are not recognised cause a warning to be issued, unless a second argument \texttt{false} is used in the call. Any further text after the value is ignored, so the rest of the string can be used for any comments. For variables with an allowed range, values below the minimum or above the maximum are set at the respective border. For \texttt{bool} values, the following notation may be used interchangeably: \texttt{true} = \texttt{on} = \texttt{yes} = \texttt{ok} = 1. Everything else gives \texttt{false} (including but not limited to \texttt{false}, \texttt{off}, \texttt{no} and \texttt{0}). The \texttt{readString(...)} method is convenient for changing one or two settings, but becomes cumbersome for more extensive modifications. In addition, a recompilation and relinking of the main program is necessary for any change of values. Alternatively, the changes can therefore be collected in a file, for historical reasons often called a ``card file'', where each line is a character string defined in the same manner as above (without quotation marks). The whole file can then be read and processed with a command\\ \cindent \texttt{pythia.readFile(fileName);}\\ As above, comments can be freely interspersed. \subsection{Settings} We distinguish four kinds of user-modifiable variables, by the way they have to be stored: \begin{enumerate} \item A \texttt{Flag} is an on/off switch, and is stored as a \texttt{bool}. \item A \texttt{Mode} corresponds to an enumeration of separate options, and is stored as an \texttt{int}. \item A \texttt{Parm} --- short for parameter --- takes a continuum of values, and is stored as a \texttt{double}. \item A \texttt{Word} is a text string (with no embedded blanks) and is stored as as a \texttt{string}. \end{enumerate} Collectively the four above kinds of variables are called settings. Not surprisingly, the class that stores them is called \texttt{Settings}. Each variable stored in \texttt{Settings} is associated with a few pieces of information. These are: \begin{itemize} \item The variable name, of the form \texttt{class:name} (or \texttt{file:name}, or \texttt{task:name}, usually these agree), e.g. \texttt{TimeShower:pTmin}. \item The default value, set in the original declaration, and intended to represent a reasonable choice. This value is not user modifiable. \item The current value. During construction of the \texttt{Settings} object, this value is set equal to the default value. It can subsequently be modified, e.g.\ by the \texttt{pythia.readString()} or \texttt{pythia.readFile()} methods discussed above. During the \texttt{pythia.init()} initialisation this value will be stored as a local copy in the class(es) where it is used, and thereby also control the subsequent generation. \item An allowed range of values, represented by meaningful minimum and maximum values. This has no sense for a flag or a word, is usually rather well-defined for a mode, but less so for a parameter. Either of the minimum and maximum may be left free, giving an open-ended range. Often the allowed range exaggerates the uncertainty in our current knowledge, so as not to restrict too much what the user can do. All the same, this information should not be modified by the user. \end{itemize} Technically, the \texttt{Settings} class is implemented with the help of four separate maps, one for each kind of variable, with the name used as key. The default values are taken from the \texttt{.xml} files in the \texttt{xmldoc} subdirectory. The \texttt{Settings} class is purely static, i.e.\ exists only as one global copy, that you can interact with directly by \texttt{Settings::command(argument)}. However, a \texttt{settings} object is a public member of the \texttt{Pythia} class, so an alternative notation would be \texttt{pythia.settings.command(argument)}. As already mentioned, for input the \texttt{pythia.readString(...)} method is to be preferred, since it also can handle particle data. A typical example would be\\ \cindent\texttt{pythia.readString("TimeShower:pTmin = 1.0");} You may obtain a listing of all variables in the database by calling\\ \cindent \texttt{pythia.settings.listAll();}\\ The listing is strictly alphabetical, which at least means that names in the same area are kept together, but otherwise may not be so well-structured: important and unimportant ones will appear mixed. A useful alternative is\\ \cindent \texttt{pythia.settings.listChanged();}\\ which will only print out a list of those variables that differ from their defaults. \subsection{Processes} All internal processes available in \textsc{Pythia}~8 can be switched on and off via the ordinary settings machinery just discussed, using flags of the generic type \texttt{ProcessGroup:ProcessName}. A complete list of processes currently implemented is given in Table~\ref{processes}. By default all processes are off. A whole group can be turned on by a \texttt{ProcessGroup:all = on} command, then overriding the individual flags. \begin{table} \caption{Currently implemented processes, complete with respect to groups, but with some individual processes missing for lack of space (represented by ``...''). In the names, a ``2'' separates initial and final state, an ``(s:X)'', ``(t:X)'' or ``(l:X)'' occasionally appends info on an $s$- or $t$-channel- or loop-exchanged particle $X$. \protect\label{processes}} \vspace{2mm} \texttt{ \begin{tabular}{|l|l|@{\protect\rule[-1mm]{0mm}{6mm}}} \hline ProcessGroup & ProcessName\\ \hline SoftQCD & minBias,elastic, singleDiffractive,\\ & doubleDiffractive\\ \hline HardQCD & gg2gg, gg2qqbar, qg2qg, qq2qq, qqbar2gg,\\ & qqbar2qqbarNew, gg2ccbar, qqbar2ccbar,\\ & gg2bbbar, qqbar2bbbar\\ \hline PromptPhoton & qg2qgamma, qqbar2ggamma, gg2ggamma,\\ & ffbar2gammagamma, gg2gammagamma\\ \hline WeakBosonExchange & ff2ff(t:gmZ), ff2ff(t:W)\\ \hline WeakSingleBoson & ffbar2gmZ, ffbar2W, ffbar2ffbar(s:gm)\\ \hline WeakDoubleBoson & ffbar2gmZgmZ, ffbar2ZW, ffbar2WW\\ \hline WeakBosonAndParton & qqbar2gmZg, qg2gmZq, ffbar2gmZgm, fgm2gmZf\\ & qqbar2Wg, qg2Wq, ffbar2Wgm, fgm2Wf\\ \hline Charmonium & gg2QQbar[3S1(1)]g, qg2QQbar[3PJ(8)]q, \ldots\\ \hline Bottomonium & gg2QQbar[3S1(1)]g, gg2QQbar[3P2(1)]g, \ldots\\ \hline Top & gg2ttbar, qqbar2ttbar, qq2tq(t:W), \\ & ffbar2ttbar(s:gmZ), ffbar2tqbar(s:W) \\ \hline \multicolumn{2}{|l|@{\protect\rule[-1mm]{0mm}{6mm}}}% {FourthBottom, FourthTop, FourthPair \textrm{(fourth generation)}} \\ \hline HiggsSM & ffbar2H, gg2H, ffbar2HZ, ff2Hff(t:WW), \ldots\\ \hline HiggsBSM & \textrm{h, H and A as above, charged Higgs, pairs}\\ \hline SUSY & qqbar2chi0chi0 \textrm{(not yet completed)}\\ \hline NewGaugeBoson & ffbar2gmZZprime, ffbar2Wprime, ffbar2R0\\ \hline LeftRightSymmmetry & ffbar2ZR, ffbar2WR, ffbar2HLHL, \ldots\\ \hline LeptoQuark & ql2LQ, qg2LQl, gg2LQLQbar, qqbar2LQLQbar\\ \hline ExcitedFermion & dg2dStar, qq2uStarq, qqbar2muStarmu, \ldots\\ \hline ExtraDimensionsG* & gg2G*, qqbar2G*, \ldots\\ \hline \end{tabular} } \end{table} Note that processes in the \texttt{SoftQCD} group are of a kind that cannot be input via the LHA, while essentially all other kinds could. Each process is assigned an integer code. This code is not used in the internal administration of events; it is only intended to allow a simpler user separation of different processes. Also the process name is available, as a string. For many processes it makes sense to apply phase space cuts. The ones currently available (in the \texttt{Settings} database) in particular include \begin{itemize} \item \texttt{PhaseSpace:mHatMin, PhaseSpace:mHatMax} : the range of invariant masses of the scattering process. \item \texttt{PhaseSpace:pTHatMin, PhaseSpace:pTHatMax} : the range of transverse momenta in the rest frame of the process for $2 \to 2$ and $2 \to 3$ processes (for each of the products). \end{itemize} In addition, for any resonance with a Breit-Wigner mass distribution, the allowed mass range of that particle species is taken into account, both for $2 \to 1$, $2 \to 2$ and $2 \to 3$ processes, thereby providing a further cut possibility. Note that the \texttt{SoftQCD} processes do not use any cuts but generate their respective cross sections in full. \subsection{Particle data} The following particle properties are stored in the \texttt{ParticleDataTable} class for a given PDG particle identity code \texttt{id}, here presented by the method used to access this property: \begin{itemize} \item \texttt{name(id)} : particle and antiparticle names are stored separately, the sign of \texttt{id} determines which of the two is returned, with ``void'' used to indicate the absence of an antiparticle. \item \texttt{hasAnti(id)} : \texttt{bool} whether a distinct antiparticle exists or not. \item \texttt{spinType(id)} : $2 s + 1$ for particles with defined spin, else 0. \item \texttt{chargeType(id)} : three times the charge (to make it an integer); can also be read as a \texttt{double charge(id) = chargeType(id)/3}. \item \texttt{colType(id)} : the colour type, with 0 uncoloured, 1 triplet, $-1$ antitriplet and 2 octet. \item \texttt{m0(id)} : the nominal mass $m_0$ (in GeV). \item \texttt{mWidth(id)} : the width $\Gamma$ of the Breit-Wigner mass distribution (in GeV). \item \texttt{mMin(id), mMax(id)} : the allowed mass range generated by the Breit-Wigner, $m_{\mathrm{min}} < m < m_{\mathrm{max}}$ (in GeV). \item \texttt{tau0(id)} : the nominal proper lifetime $\tau_0$ (in mm/$c$). \item \texttt{constituentMass(id)} : the constituent mass for a quark, hardcoded as $m_{\mathrm{u}} = m_{\mathrm{d}} = 0.325$, $m_{\mathrm{s}} = 0.50$, $m_{\mathrm{c}} = 1.60$ and $m_{\mathrm{b}} = 5.0$ GeV, for a diquark the sum of quark constituent masses, and for everything else the same as the ordinary mass. \item \texttt{mRun(id, massScale)} : the running mass for quarks, else the same as the nominal mass. \item \texttt{mayDecay(id)} : a flag telling whether a particle species may decay or not, offering the main user switch (whether a given particle of this kind then actually will decay also depends on other flags in the \texttt{ParticleDecays} class). \end{itemize} Similar methods can also be used to set most of these properties. Each particle kind in the \texttt{ParticleDataTable} also has a a vector of \texttt{DecayChannel}s associated with it. The following properties are stored for each decay channel: \begin{itemize} \item \texttt{onMode()} : whether a channel is on (1) or off (0), or on only for particles (2) or antiparticles (3). \item \texttt{bRatio()} : the branching ratio. \item \texttt{meMode()} : the mode of processing this channel, possibly with matrix-element information; 0 gives isotropic phase space. \item \texttt{multiplicity()} : the number of decay products in a channel, at most 8. \item \texttt{product(i)} : a list of the decay products, 8 products $0 \leq $\texttt{i}$ < 8$, with trailing unused ones set to 0. \end{itemize} The original particle data and decay table is read in from the \texttt{ParticleData.xml} file. The \texttt{ParticleDataTable} class is purely static, i.e. exists as one global copy, that you can interact directly with by \texttt{ParticleDataTable::command(argument)}. However, a \texttt{particleData} object of the \texttt{ParticleDataTable} class is a public member of the \texttt{Pythia} class, which offers an alternatively notation. As already mentioned, for input the \texttt{pythia.readString(string)} method is to be preferred, since it also can handle settings. It is only the form of the \texttt{string} that needs to be specified slightly differently than for settings, as\\ \cindent \texttt{id:property = value}.\\ The \texttt{id} part is the standard PDG particle code, i.e.\ a number, and \texttt{property} is one of the ones already described above, with a few minor differences: \texttt{name}, \texttt{antiName}, \texttt{spinType}, \texttt{chargeType}, \texttt{colType}, \texttt{m0}, \texttt{mWidth}, \texttt{mMin}, \texttt{mMax}, \texttt{tau0}, \texttt{mayDecay}, \texttt{isResonance}, \texttt{isVisible}, \texttt{doExternalDecay}, and \texttt{doForceWidth}. As before, several commands can be stored as separate lines in a file, say\\ \cindent \texttt{111:name = piZero ! normal notation pi0}\\ \cindent \texttt{3122:mayDecay = false ! Lambda0 stable}\\ \cindent \texttt{431:tau0 = 0.15 ! D\_s proper lifetime}\\ and then be read with \texttt{pythia.readFile(fileName)}. For major changes of the properties of a particle, the above one-at-a-time changes can become rather cumbersome. Therefore a few extended input formats are available, where a whole set of properties can be given after the equal sign, separated by blanks and/or by commas. One line like\\ \cindent \texttt{id:all = name antiName spinType chargeType % colType m0 mWidth mMin mMax tau0}\\ replaces all the current information on the particle itself, but keeps its decay channels, if any, while using \texttt{new} instead of \texttt{all} also removes any previous decay channels. (The flags \texttt{mayDecay}, \texttt{isResonance}, \texttt{isVisible}, \texttt{doExternalDecay}, and \texttt{doForceWidth} are in either case reset to their defaults and would have to be changed separately.) In order to change the decay data, the decay channel number needs to be given right after the particle number, i.e. the command form becomes\\ \cindent \texttt{id:channel:property = value}.\\ Recognised properties are \texttt{onMode}, \texttt{bRatio}, \texttt{meMode} and \texttt{products}, where the latter expects a list of all the decay products, separated by blanks, up until the end of the line, or until a non-number is encountered. The property \texttt{all} will replace all the information on the channel, i.e.\\ \cindent \texttt{id:channel:all = onMode bRatio meMode products }\\ To add a new channel at the end, use\\ \cindent \texttt{id:addChannel = onMode bRatio meMode products } \\ To remove all existing channels and force decays into one new channel, use\\ \cindent \texttt{id:oneChannel = onMode bRatio meMode products } \\ A first \texttt{oneChannel} command could be followed by several subsequent \texttt{addChannel} ones, to build up a completely new decay table for an existing particle.\\ It is currently not possible to remove a channel selectively, but setting its branching ratio vanishing is as effective. Often one may want to allow only a specific subset of decay channels for a particle. This can be achieved e.g.\ by a repeated use of \texttt{id:channel:onMode} commands, but there also is a set of commands that initiates a loop over all decay channels and allows a matching to be carried out. The \texttt{id:onMode} command can switch \texttt{on} or \texttt{off} all channels. The \texttt{id:onIfAny} and \texttt{id:offIfAny} will switch on/off all channels that contain any of the enumerated particles. For instance\\ \cindent \texttt{23:onMode = off}\\ \cindent \texttt{23:onIfAny = 1 2 3 4 5}\\ first switches off all $\mathrm{Z}^0$ decay modes and then switches back on any that contains one of the five lighter quarks. Other methods are \texttt{id:onIfAll} and \texttt{id:offIfAll}, and \texttt{id:onIfMatch} and \texttt{id:offIfMatch}, where all the enumerated products must be present for a decay channel to be switched on/off. The difference is that the former two allow further non-matched particles in a decay channel while the latter two do not. There are also further methods to switch on channels selectively either for the particle or for the antiparticle. When a particle is to be decayed, the branching ratios of the allowed channels are always rescaled to unit sum. There are also methods for by-hand rescaling of branching ratios. You may obtain a listing of all the particle data by calling\\ \cindent \texttt{pythia.particleData.listAll()}.\\ The listing is by increasing \texttt{id} number. To list only those particles that have been changed, instead use\\ \cindent \texttt{pythia.particleData.listChanged()}.\\ To list only one specific particle \texttt{id}, use \texttt{list(id)}. It is also possible to \texttt{list} a \texttt{vector<int>} of \texttt{id}'s. \section{Links to external programs \label{sec:external}} While \textsc{Pythia}~8 itself is self-contained and can be run without reference to any external library, often one does want to make use of other programs that are specialised on some aspect of the generation process. The HTML/PHP documentation accompanying the code contains full information on how the different links should be set up. Here the purpose is mainly to point out the possibilities that exist. \subsection{The Les Houches interface} The Les Houches Accord for user processes (LHA) \cite{lha} is the standard way to input parton-level information from a matrix-elements-based generator into \textsc{Pythia}. The conventions for which information should be stored has been defined in a Fortran context, as two commonblocks. Here a C++ equivalent is defined, as two separate classes. The \texttt{LHAinit} and \texttt{LHAevnt} classes are base classes, containing reading and printout methods, plus a pure virtual method \texttt{set()} each. Derived classes have to provide these two virtual methods to do the actual work. Currently the only examples are for reading information at runtime from the respective Fortran commonblock or for reading it from a Les Houches Event File (LHEF) \cite{lhef}. The \texttt{LHAinit} class stores information equivalent to the \texttt{/HEPRUP/} commonblock, as required to initialise the event-generation chain. The \texttt{LHAevnt} class stores information equivalent to the \texttt{/HEPEUP/} commonblock, as required to hand in the next parton-level configuration for complete event generation. The \texttt{LHAinitFortran} and \texttt{LHAevntFortran} are two derived classes, containing \texttt{set()} members that read the respective LHA Fortran commonblock for initialisation and event information. This can be used for a runtime link to a Fortran library. As an example, an interface is provided to the \textsc{Pythia}~6.4 process library. The \texttt{LHAinitLHEF} and \texttt{LHAevntLHEF} are two other derived classes, that can read a file with initialisation and event information, assuming that the file has been written in the LHEF format. You do not need to declare these classes yourself, since a shortcut is provided by the \texttt{pythia.init(fileName)} command. If you create \texttt{LHAinit} and \texttt{LHAevnt} objects yourself, pointers to those should be handed in with the \texttt{init(...)} call, then of the form \texttt{pythia.init(LHAinit*, LHAevnt*)}. \subsection{Semi-internal processes and resonances} When you implement new processes via the Les Houches Accord you do all flavour, colour and phase-space selection externally, before your process-level events are input for further processing by \textsc{Pythia}. However, it is also possible to implement a new process in exactly the same way as the internal \textsc{Pythia} ones, thus making use of the internal phase-space selection machinery to sample an externally provided cross-section expression. The matrix-element information has to be put in a new class that derives from one of the existing classes, \texttt{Sigma1Process} for $2 \to 1$ processes, \texttt{Sigma2Process} for $2 \to 2$ ones, and \texttt{Sigma3Process} for $2 \to 3$ ones, which in their turn derive from the \texttt{SigmaProcess} base class. Note that \texttt{Pythia} is rather good at handling the phase space of $2 \to 1$ and $2 \to 2$ processes, is more primitive for $2 \to 3$ ones and does not at all address higher multiplicities. This limits the set of processes that you can implement in this framework. The produced particles may be resonances, however, so it is possible to end up with bigger "final" multiplicities through sequential decays, and to include further matrix-element weighting in those decays. In your new class you have to implement a number of methods. Chief among them is one to return the matrix-element weight for an already specified kinematics configuration and another one to set up the final-state flavours and colour flow of the process. Further methods exist, some of more informative character, such as providing the name of the process. Should you actually go ahead, it is strongly recommended to shop around for a similar process that has already been implemented, and to use that existing code as a template. Once a class has been written, a pointer of type \texttt{SigmaProcess*} to a \texttt{new} instance of your class needs to be created in the main program, and handed in with the \texttt{pythia.setSigmaPtr(...)} method. {}From there on the process will be handled on equal footing with internally implemented processes. If your new process introduces a new particle you have to add it and its decay channels to the particle database, as already explained. This only allows for a fixed width and fixed branching ratios, however, with only some minor generalisations. To obtain a dynamical calculation, where the width and the branching ratios can vary as a function of the currently chosen mass, you must also create a new class for it that derives from the \texttt{ResonanceWidths} class. In it you have to implement a method that returns the partial width for each of the possible decay channels. The structure is simpler than for the \texttt{SigmaProcess} case, but again it may be convenient to use a similar existing resonance as a template. You then hand in a pointer to an instance of this new class with the \texttt{pythia.setResonancePtr(...)} method. \subsection{Parton distribution functions} The \texttt{PDF} class is the base class for all parton distribution function parametrisations, from which specific \texttt{PDF} classes are derived. Currently the selection of sets that comes with the program is very limited; for protons only CTEQ 5L (default) and GRV 94L are available. However, a built-in interface to the \textsc{LhaPdf} library \cite{lhapdf} allows a much broader selection, if only \textsc{LhaPdf} is linked together with \textsc{Pythia}. Should this not be enough, it is possible to write your own class derived from the \texttt{PDF} base class, wherein you implement the \texttt{xfUpdate(...)} member to do the actual updating of PDFs. Once you have created two distinct \texttt{PDF} objects, \texttt{pdfA} and \texttt{pdfB}, you should supply pointers to these as arguments in a \texttt{pythia.setPDFPtr(pdfA*, pdfB*)} call. A word of warning: to switch to a new PDF set implies that a complete retuning of the generator may be required, since the underlying-event activity from multiple interactions and parton showers is changed. There is an option that allows a replacement of the PDF for the hard process only, so that this is not required. Inconsistent but convenient. \subsection{External decay packages} While \texttt{Pythia} is set up to handle any particle decays, decay products are often (but not always) distributed isotropically in phase space, i.e.\ polarisation effects and nontrivial matrix elements usually are neglected in \textsc{Pythia}. Especially for the $\tau$ lepton and for some $\mathrm{B}$ mesons it is therefore common practice to rely on dedicated decay packages \cite{tauola, evtgen}. To this end, \texttt{DecayHandler} is a base class for the external handling of decays. The user-written derived class is called if a pointer to it has been given with the \texttt{pythia.setDecayPtr(DecayHandler*, vector<int>)} method. The second argument to this method should contain the \texttt{id} codes of all the particles that should be decayed by the external program. It is up to the author of the derived class to send different of these particles on to separate packages, if so desired. The \texttt{decay(...)} method in the user-written \texttt{DecayHandler} class should do the decay, or return \texttt{false} if it fails. In the latter case \texttt{Pythia} will try to do the decay itself. Thus one may implement some decay channels externally and leave the rest for \texttt{Pythia}, assuming the \texttt{Pythia} decay tables are adjusted accordingly. \subsection{User hooks} Sometimes it may be convenient to step in during the generation process: to modify the built-in cross sections, to veto undesirable events or simply to collect statistics at various stages of the evolution. There is a base class \texttt{UserHooks} that gives you this access at a few selected places. This class in itself does nothing; the idea is that you should write your own derived class for your task. A few very simple derived classes come with the program, mainly as illustration. There are four distinct sets of routines. Ordered by increasing complexity, rather than by their appearance in the event-generation sequence, they are: \begin{itemize} \item Ones that gives you access to the event record in between the process-level and parton-level steps, or in between the parton-level and hadron-level ones. You can study the event record and decide whether to veto this event. \item Ones that allow you to set a scale at which the combined multiple-interactions, initial-state and final-state parton-shower downwards evolution in $p_{\perp}$ is temporarily interrupted, so the event can be studied and either vetoed or allowed to continue the evolution. \item Similar ones that instead gives you access after the first few parton-shower branchings of the hardest subprocess. \item Ones that gives you access to the properties of the trial hard process, so that you can modify the internal \textsc{Pythia} cross section by your own correction factors. \end{itemize} \subsection{Random-number generators} \texttt{RndmEngine} is a base class for the external handling of random-number generation. The user-written derived class is called if a pointer to it has been handed in with the \texttt{pythia.setRndmEnginePtr(RndmEngine*)} method. Since the default Marsaglia-Zaman algorithm is quite good, there is absolutely no physics reason to replace it, but this may still be required for consistency with other program elements in big experimental frameworks. \subsection{The \textsc{HepMC} event format} The \textsc{HepMC} event format \cite{hepmc} is a standard format for the storage of events in several major experiments. The translation from the \textsc{Pythia}~8 \texttt{Event} format should be done after \texttt{pythia.next()} has generated an event. Therefore there is no need for a tight linkage, but only to call the\\ \texttt{HepMC::I\_Pythia8::fill\_next\_event( pythia.event, hepmcevt )}\\ conversion routine from the main program written by the user. Version 1 of \textsc{HepMC} makes use of the \textsc{CLHep} library \cite{clhep} for four-vectors, while version 2 is standalone; this requires some adjustments in the interface code based on which version is used. \subsection{SUSY parameter input} \textsc{Pythia}~8 does not contain a machinery for calculating masses and couplings of supersymmetric particles from some small set of input parameters. Instead the SUSY Les Houches Accord (SLHA) \cite{slha} is used to provide this information, as calculated by some external program. You need to supply the name of the file where the SLHA information is stored, in an appropriate setting, and then the rest is taken care of automatically. (Or at least will be, once SUSY processes are implemented.) \subsection{Parton showers} It is possible to replace the existing timelike and/or spacelike showers in the program by your own. This is truly for experts, since it requires a rather strict adherence to a wide set of rules. These are described in detail in the HTML/PHP documentation accompanying the code. The \textsc{Vincia} program \cite{vincia} offers a first example of a plug-in of an external (timelike) shower. \section{Getting Going \label{sec:how-to}} After you download the \texttt{pythia8100.tgz} (or later) package from the \textsc{Pythia} webpage,\\ \cindent \texttt{http://www.thep.lu.se/}$\sim$% \texttt{torbjorn/Pythia.html}\\ you can unpack it with \texttt{tar xvfz pythia8100.tgz}, into a new subdirectory \texttt{pythia8100}. The rest of the installation procedure is described in the \texttt{README} file in that directory. It is assumed you are on a Linux system; so far there is hardly any multiplatform support. After this, the main program is up to the user to write. A worksheet (found on the webpage) takes you through as step-by-step procedure, and sample main programs are provided in the \texttt{examples} subdirectory. These programs are included to serve as inspiration when starting to write your own program, by illustrating the principles involved. The information available if you open \texttt{htmldoc/Welcome.html} in your web browser will help you explore the program possibilities further. If you install the \texttt{phpdoc} subdirectory under a web server you will also get extra help to build a file of commands to the \texttt{Settings} and \texttt{ParticleDataTable} machineries, to steer the execution of your main program. Such "cards files" are separate from the main programs proper, so that minor changes can be made without any recompilation. It is then convenient to collect in the same place some run parameters, such as the number of events to generate, that could be used inside the main program. Therefore some such have been predefined, e.g.\ \texttt{Main:numberOfEvents}. Whether they actually are used is up to the author of a main program to decide. \section{Outlook \label{sec:outlook}} As already explained in the introduction, \textsc{Pythia}~8.1 is not yet a complete replacement of \textsc{Pythia}~6.4, but it is getting there, and already contains some new features not found elsewhere. In many cases the quality of the physics should be comparable between the two versions, but obviously the objective is that soon \textsc{Pythia}~8 should offer the overall better alternative. This will occur by further improvements of the existing framework and by the gradual addition of new features. \ack The support and kind hospitality of the SFT group at CERN is gratefully acknowledged by TS. Mikhail Kirsanov has developed the configure files, the makefiles and the interface to \textsc{HepMC}, and made several valuable suggestions. Ben Lloyd has written the PHP webpage framework. Bertrand Bellenot has provided a simple makefile for Win32/NMAKE. Marc Montull has helped write the extended Higgs sector. SM and PS are supported by Fermi Research Alliance, LLC, under Contract No.~DE-AC02-07CH11359 with the United States Department of Energy. This work was supported in part by the European Union Marie Curie Research Training Network MCnet under contract MRTN-CT-2006-035606. \begin{thebibliography}{99} \bibitem{jetset} T. Sj\"ostrand, Computer Physics Commun. {\bf 27} (1982) 243, {\bf 28} (1983) 229, {\bf 39} (1986) 347;\\ T. Sj\"ostrand and M. Bengtsson, Computer Physics Commun. {\bf 43} (1987) 367 \bibitem{pythiaearly} H.-U. Bengtsson, Computer Physics Commun. {\bf 31} (1984) 323;\\ H.-U. Bengtsson and G. Ingelman, Computer Physics Commun. {\bf 34} (1985) 251;\\ H.-U. Bengtsson and T. Sj\"ostrand, Computer Physics Commun. {\bf 46} (1987) 43;\\ T. Sj\"ostrand, Computer Physics Commun. {\bf 82} (1994) 74 \bibitem{pythiasixone} T. Sj\"ostrand, P. Ed\'en, C. Friberg, L. L\"onnblad, G. Miu, S. Mrenna and E. Norrbin, Computer Physics Commun. {\bf 135} (2001) 238 \bibitem{pythiasixfour} T. Sj\"ostrand, S. Mrenna and P. Skands, JHEP {\bf 05} (2006) 026 [hep-ph/0603175] \bibitem{pythiaseven} L. L\"onnblad, Computer Physics Commun. {\bf 118} (1999) 213;\\ M. Bertini, L. L\"onnblad and T. Sj\"ostrand, Computer Physics Commun. {\bf 134} (2001) 365 \bibitem{thepeg} see webpage \texttt{http://www.thep.lu.se/ThePEG/} \bibitem{herwigpp} S. Gieseke, A. Ribon, M.H. Seymour, P. Stephens and B.R. Webber, JHEP {\bf 02} (2004) 005;\\ see webpage \texttt{http://hepforge.cedar.ac.uk/herwig/} \bibitem{lha} E. Boos et al., in the Proceedings of the Workshop on Physics at TeV Colliders,\\ Les Houches, France, 21 May - 1 Jun 2001 [hep-ph/0109068] \bibitem{lhef} J. Alwall et al., Computer Physics Comm. {\bf 176} (2007) 300 \bibitem{lhapdf} M.R. Whalley, D. Bourilkov and R.C. Group, in `HERA and the LHC',\\ eds. A. De Roeck and H. Jung, CERN-2005-014, p. 575 [hep-ph/0508110] \bibitem{ptshowers} T. Sj\"ostrand and P. Skands, Eur. Phys. J {\bf C39} (2005) 129 \bibitem{newremnants} T. Sj\"ostrand and P. Skands, JHEP {\bf 03} (2004) 053 \bibitem{zijl} T. Sj\"ostrand and M. van Zijl, Phys. Rev. {\bf D36} (1987) 2019 \bibitem{wicke} P. Skands and D. Wicke, Eur. Phys. J. {\bf C52} (2007) 133 \bibitem{lundreview} B. Andersson, G. Gustafson, G. Ingelman and T. Sj\"ostrand, Phys. Rep. {\bf 97} (1983) 31 \bibitem{pdg} Particle Data Group, W.-M. Yao et al., J. Phys. {\bf G33} (2006) 1 \bibitem{boseeinstein} L. L\"onnblad and T. Sj\"ostrand, Eur. Phys. J. {\bf C2} (1998) 165 \bibitem{hepmc} M. Dobbs and J.B. Hansen, Computer Physics Comm. {\bf 134} (2001) 41 \bibitem{slha} P. Skands et al., JHEP {\bf 07} (2004) 036 \bibitem{tauola} S. Jadach, Z. W\c{a}s, R. Decker and J.H. K\"uhn, Computer Physics Commun. {\bf 76} (1993) 361 \bibitem{evtgen} D.J. Lange, Nucl. Instrum. Meth. {\bf A462} (2001) 152 \bibitem{clhep} see webpage \texttt{http://proj-clhep.web.cern.ch/proj-clhep/} \bibitem{vincia} W.T. Giele, D.A. Kosower and P.Z. Skands, arXiv:0707.3652 \end{thebibliography} \end{document}