1@node Program Basics, Processes, Signal Handling, Top 2@c %MENU% Writing the beginning and end of your program 3@chapter The Basic Program/System Interface 4 5@cindex process 6@cindex program 7@cindex address space 8@cindex thread of control 9@dfn{Processes} are the primitive units for allocation of system 10resources. Each process has its own address space and (usually) one 11thread of control. A process executes a program; you can have multiple 12processes executing the same program, but each process has its own copy 13of the program within its own address space and executes it 14independently of the other copies. Though it may have multiple threads 15of control within the same program and a program may be composed of 16multiple logically separate modules, a process always executes exactly 17one program. 18 19Note that we are using a specific definition of ``program'' for the 20purposes of this manual, which corresponds to a common definition in the 21context of Unix systems. In popular usage, ``program'' enjoys a much 22broader definition; it can refer for example to a system's kernel, an 23editor macro, a complex package of software, or a discrete section of 24code executing within a process. 25 26Writing the program is what this manual is all about. This chapter 27explains the most basic interface between your program and the system 28that runs, or calls, it. This includes passing of parameters (arguments 29and environment) from the system, requesting basic services from the 30system, and telling the system the program is done. 31 32A program starts another program with the @code{exec} family of system calls. 33This chapter looks at program startup from the execee's point of view. To 34see the event from the execor's point of view, see @ref{Executing a File}. 35 36@menu 37* Program Arguments:: Parsing your program's command-line arguments 38* Environment Variables:: Less direct parameters affecting your program 39* Auxiliary Vector:: Least direct parameters affecting your program 40* System Calls:: Requesting service from the system 41* Program Termination:: Telling the system you're done; return status 42@end menu 43 44@node Program Arguments, Environment Variables, , Program Basics 45@section Program Arguments 46@cindex program arguments 47@cindex command line arguments 48@cindex arguments, to program 49 50@cindex program startup 51@cindex startup of program 52@cindex invocation of program 53@cindex @code{main} function 54@findex main 55The system starts a C program by calling the function @code{main}. It 56is up to you to write a function named @code{main}---otherwise, you 57won't even be able to link your program without errors. 58 59In @w{ISO C} you can define @code{main} either to take no arguments, or to 60take two arguments that represent the command line arguments to the 61program, like this: 62 63@smallexample 64int main (int @var{argc}, char *@var{argv}[]) 65@end smallexample 66 67@cindex argc (program argument count) 68@cindex argv (program argument vector) 69The command line arguments are the whitespace-separated tokens given in 70the shell command used to invoke the program; thus, in @samp{cat foo 71bar}, the arguments are @samp{foo} and @samp{bar}. The only way a 72program can look at its command line arguments is via the arguments of 73@code{main}. If @code{main} doesn't take arguments, then you cannot get 74at the command line. 75 76The value of the @var{argc} argument is the number of command line 77arguments. The @var{argv} argument is a vector of C strings; its 78elements are the individual command line argument strings. The file 79name of the program being run is also included in the vector as the 80first element; the value of @var{argc} counts this element. A null 81pointer always follows the last element: @code{@var{argv}[@var{argc}]} 82is this null pointer. 83 84For the command @samp{cat foo bar}, @var{argc} is 3 and @var{argv} has 85three elements, @code{"cat"}, @code{"foo"} and @code{"bar"}. 86 87In Unix systems you can define @code{main} a third way, using three arguments: 88 89@smallexample 90int main (int @var{argc}, char *@var{argv}[], char *@var{envp}[]) 91@end smallexample 92 93The first two arguments are just the same. The third argument 94@var{envp} gives the program's environment; it is the same as the value 95of @code{environ}. @xref{Environment Variables}. POSIX.1 does not 96allow this three-argument form, so to be portable it is best to write 97@code{main} to take two arguments, and use the value of @code{environ}. 98 99@menu 100* Argument Syntax:: By convention, options start with a hyphen. 101* Parsing Program Arguments:: Ways to parse program options and arguments. 102@end menu 103 104@node Argument Syntax, Parsing Program Arguments, , Program Arguments 105@subsection Program Argument Syntax Conventions 106@cindex program argument syntax 107@cindex syntax, for program arguments 108@cindex command argument syntax 109 110POSIX recommends these conventions for command line arguments. 111@code{getopt} (@pxref{Getopt}) and @code{argp_parse} (@pxref{Argp}) make 112it easy to implement them. 113 114@itemize @bullet 115@item 116Arguments are options if they begin with a hyphen delimiter (@samp{-}). 117 118@item 119Multiple options may follow a hyphen delimiter in a single token if 120the options do not take arguments. Thus, @samp{-abc} is equivalent to 121@samp{-a -b -c}. 122 123@item 124Option names are single alphanumeric characters (as for @code{isalnum}; 125@pxref{Classification of Characters}). 126 127@item 128Certain options require an argument. For example, the @option{-o} option 129of the @command{ld} command requires an argument---an output file name. 130 131@item 132An option and its argument may or may not appear as separate tokens. (In 133other words, the whitespace separating them is optional.) Thus, 134@w{@option{-o foo}} and @option{-ofoo} are equivalent. 135 136@item 137Options typically precede other non-option arguments. 138 139The implementations of @code{getopt} and @code{argp_parse} in @theglibc{} 140normally make it appear as if all the option arguments were 141specified before all the non-option arguments for the purposes of 142parsing, even if the user of your program intermixed option and 143non-option arguments. They do this by reordering the elements of the 144@var{argv} array. This behavior is nonstandard; if you want to suppress 145it, define the @code{_POSIX_OPTION_ORDER} environment variable. 146@xref{Standard Environment}. 147 148@item 149The argument @option{--} terminates all options; any following arguments 150are treated as non-option arguments, even if they begin with a hyphen. 151 152@item 153A token consisting of a single hyphen character is interpreted as an 154ordinary non-option argument. By convention, it is used to specify 155input from or output to the standard input and output streams. 156 157@item 158Options may be supplied in any order, or appear multiple times. The 159interpretation is left up to the particular application program. 160@end itemize 161 162@cindex long-named options 163GNU adds @dfn{long options} to these conventions. Long options consist 164of @option{--} followed by a name made of alphanumeric characters and 165dashes. Option names are typically one to three words long, with 166hyphens to separate words. Users can abbreviate the option names as 167long as the abbreviations are unique. 168 169To specify an argument for a long option, write 170@option{--@var{name}=@var{value}}. This syntax enables a long option to 171accept an argument that is itself optional. 172 173Eventually, @gnusystems{} will provide completion for long option names 174in the shell. 175 176@node Parsing Program Arguments, , Argument Syntax, Program Arguments 177@subsection Parsing Program Arguments 178 179@cindex program arguments, parsing 180@cindex command arguments, parsing 181@cindex parsing program arguments 182If the syntax for the command line arguments to your program is simple 183enough, you can simply pick the arguments off from @var{argv} by hand. 184But unless your program takes a fixed number of arguments, or all of the 185arguments are interpreted in the same way (as file names, for example), 186you are usually better off using @code{getopt} (@pxref{Getopt}) or 187@code{argp_parse} (@pxref{Argp}) to do the parsing. 188 189@code{getopt} is more standard (the short-option only version of it is a 190part of the POSIX standard), but using @code{argp_parse} is often 191easier, both for very simple and very complex option structures, because 192it does more of the dirty work for you. 193 194@menu 195* Getopt:: Parsing program options using @code{getopt}. 196* Argp:: Parsing program options using @code{argp_parse}. 197* Suboptions:: Some programs need more detailed options. 198* Suboptions Example:: This shows how it could be done for @code{mount}. 199@end menu 200 201@c Getopt and argp start at the @section level so that there's 202@c enough room for their internal hierarchy (mostly a problem with 203@c argp). -Miles 204 205@include getopt.texi 206@include argp.texi 207 208@node Suboptions, Suboptions Example, Argp, Parsing Program Arguments 209@c This is a @section so that it's at the same level as getopt and argp 210@subsubsection Parsing of Suboptions 211 212Having a single level of options is sometimes not enough. There might 213be too many options which have to be available or a set of options is 214closely related. 215 216For this case some programs use suboptions. One of the most prominent 217programs is certainly @code{mount}(8). The @code{-o} option take one 218argument which itself is a comma separated list of options. To ease the 219programming of code like this the function @code{getsubopt} is 220available. 221 222@deftypefun int getsubopt (char **@var{optionp}, char *const *@var{tokens}, char **@var{valuep}) 223@standards{???, stdlib.h} 224@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}} 225@c getsubopt ok 226@c strchrnul dup ok 227@c memchr dup ok 228@c strncmp dup ok 229 230The @var{optionp} parameter must be a pointer to a variable containing 231the address of the string to process. When the function returns, the 232reference is updated to point to the next suboption or to the 233terminating @samp{\0} character if there are no more suboptions available. 234 235The @var{tokens} parameter references an array of strings containing the 236known suboptions. All strings must be @samp{\0} terminated and to mark 237the end a null pointer must be stored. When @code{getsubopt} finds a 238possible legal suboption it compares it with all strings available in 239the @var{tokens} array and returns the index in the string as the 240indicator. 241 242In case the suboption has an associated value introduced by a @samp{=} 243character, a pointer to the value is returned in @var{valuep}. The 244string is @samp{\0} terminated. If no argument is available 245@var{valuep} is set to the null pointer. By doing this the caller can 246check whether a necessary value is given or whether no unexpected value 247is present. 248 249In case the next suboption in the string is not mentioned in the 250@var{tokens} array the starting address of the suboption including a 251possible value is returned in @var{valuep} and the return value of the 252function is @samp{-1}. 253@end deftypefun 254 255@node Suboptions Example, , Suboptions, Parsing Program Arguments 256@subsection Parsing of Suboptions Example 257 258The code which might appear in the @code{mount}(8) program is a perfect 259example of the use of @code{getsubopt}: 260 261@smallexample 262@include subopt.c.texi 263@end smallexample 264 265 266@node Environment Variables, Auxiliary Vector, Program Arguments, Program Basics 267@section Environment Variables 268 269@cindex environment variable 270When a program is executed, it receives information about the context in 271which it was invoked in two ways. The first mechanism uses the 272@var{argv} and @var{argc} arguments to its @code{main} function, and is 273discussed in @ref{Program Arguments}. The second mechanism uses 274@dfn{environment variables} and is discussed in this section. 275 276The @var{argv} mechanism is typically used to pass command-line 277arguments specific to the particular program being invoked. The 278environment, on the other hand, keeps track of information that is 279shared by many programs, changes infrequently, and that is less 280frequently used. 281 282The environment variables discussed in this section are the same 283environment variables that you set using assignments and the 284@code{export} command in the shell. Programs executed from the shell 285inherit all of the environment variables from the shell. 286@c !!! xref to right part of bash manual when it exists 287 288@cindex environment 289Standard environment variables are used for information about the user's 290home directory, terminal type, current locale, and so on; you can define 291additional variables for other purposes. The set of all environment 292variables that have values is collectively known as the 293@dfn{environment}. 294 295Names of environment variables are case-sensitive and must not contain 296the character @samp{=}. System-defined environment variables are 297invariably uppercase. 298 299The values of environment variables can be anything that can be 300represented as a string. A value must not contain an embedded null 301character, since this is assumed to terminate the string. 302 303 304@menu 305* Environment Access:: How to get and set the values of 306 environment variables. 307* Standard Environment:: These environment variables have 308 standard interpretations. 309@end menu 310 311@node Environment Access 312@subsection Environment Access 313@cindex environment access 314@cindex environment representation 315 316The value of an environment variable can be accessed with the 317@code{getenv} function. This is declared in the header file 318@file{stdlib.h}. 319@pindex stdlib.h 320 321Libraries should use @code{secure_getenv} instead of @code{getenv}, so 322that they do not accidentally use untrusted environment variables. 323Modifications of environment variables are not allowed in 324multi-threaded programs. The @code{getenv} and @code{secure_getenv} 325functions can be safely used in multi-threaded programs. 326 327@deftypefun {char *} getenv (const char *@var{name}) 328@standards{ISO, stdlib.h} 329@safety{@prelim{}@mtsafe{@mtsenv{}}@assafe{}@acsafe{}} 330@c Unguarded access to __environ. 331This function returns a string that is the value of the environment 332variable @var{name}. You must not modify this string. In some non-Unix 333systems not using @theglibc{}, it might be overwritten by subsequent 334calls to @code{getenv} (but not by any other library function). If the 335environment variable @var{name} is not defined, the value is a null 336pointer. 337@end deftypefun 338 339@deftypefun {char *} secure_getenv (const char *@var{name}) 340@standards{GNU, stdlib.h} 341@safety{@prelim{}@mtsafe{@mtsenv{}}@assafe{}@acsafe{}} 342@c Calls getenv unless secure mode is enabled. 343This function is similar to @code{getenv}, but it returns a null 344pointer if the environment is untrusted. This happens when the 345program file has SUID or SGID bits set. General-purpose libraries 346should always prefer this function over @code{getenv} to avoid 347vulnerabilities if the library is referenced from a SUID/SGID program. 348 349This function is a GNU extension. 350@end deftypefun 351 352 353@deftypefun int putenv (char *@var{string}) 354@standards{SVID, stdlib.h} 355@safety{@prelim{}@mtunsafe{@mtasuconst{:@mtsenv{}}}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@acucorrupt{} @aculock{} @acsmem{}}} 356@c putenv @mtasuconst:@mtsenv @ascuheap @asulock @acucorrupt @aculock @acsmem 357@c strchr dup ok 358@c strndup dup @ascuheap @acsmem 359@c add_to_environ dup @mtasuconst:@mtsenv @ascuheap @asulock @acucorrupt @aculock @acsmem 360@c free dup @ascuheap @acsmem 361@c unsetenv dup @mtasuconst:@mtsenv @asulock @aculock 362The @code{putenv} function adds or removes definitions from the environment. 363If the @var{string} is of the form @samp{@var{name}=@var{value}}, the 364definition is added to the environment. Otherwise, the @var{string} is 365interpreted as the name of an environment variable, and any definition 366for this variable in the environment is removed. 367 368If the function is successful it returns @code{0}. Otherwise the return 369value is nonzero and @code{errno} is set to indicate the error. 370 371The difference to the @code{setenv} function is that the exact string 372given as the parameter @var{string} is put into the environment. If the 373user should change the string after the @code{putenv} call this will 374reflect automatically in the environment. This also requires that 375@var{string} not be an automatic variable whose scope is left before the 376variable is removed from the environment. The same applies of course to 377dynamically allocated variables which are freed later. 378 379This function is part of the extended Unix interface. You should define 380@var{_XOPEN_SOURCE} before including any header. 381@end deftypefun 382 383 384@deftypefun int setenv (const char *@var{name}, const char *@var{value}, int @var{replace}) 385@standards{BSD, stdlib.h} 386@safety{@prelim{}@mtunsafe{@mtasuconst{:@mtsenv{}}}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@acucorrupt{} @aculock{} @acsmem{}}} 387@c setenv @mtasuconst:@mtsenv @ascuheap @asulock @acucorrupt @aculock @acsmem 388@c add_to_environ @mtasuconst:@mtsenv @ascuheap @asulock @acucorrupt @aculock @acsmem 389@c strlen dup ok 390@c libc_lock_lock @asulock @aculock 391@c strncmp dup ok 392@c realloc dup @ascuheap @acsmem 393@c libc_lock_unlock @aculock 394@c malloc dup @ascuheap @acsmem 395@c free dup @ascuheap @acsmem 396@c mempcpy dup ok 397@c memcpy dup ok 398@c KNOWN_VALUE ok 399@c tfind(strcmp) [no @mtsrace guarded access] 400@c strcmp dup ok 401@c STORE_VALUE @ascuheap @acucorrupt @acsmem 402@c tsearch(strcmp) @ascuheap @acucorrupt @acsmem [no @mtsrace or @asucorrupt guarded access makes for mtsafe and @asulock] 403@c strcmp dup ok 404The @code{setenv} function can be used to add a new definition to the 405environment. The entry with the name @var{name} is replaced by the 406value @samp{@var{name}=@var{value}}. Please note that this is also true 407if @var{value} is the empty string. To do this a new string is created 408and the strings @var{name} and @var{value} are copied. A null pointer 409for the @var{value} parameter is illegal. If the environment already 410contains an entry with key @var{name} the @var{replace} parameter 411controls the action. If replace is zero, nothing happens. Otherwise 412the old entry is replaced by the new one. 413 414Please note that you cannot remove an entry completely using this function. 415 416If the function is successful it returns @code{0}. Otherwise the 417environment is unchanged and the return value is @code{-1} and 418@code{errno} is set. 419 420This function was originally part of the BSD library but is now part of 421the Unix standard. 422@end deftypefun 423 424@deftypefun int unsetenv (const char *@var{name}) 425@standards{BSD, stdlib.h} 426@safety{@prelim{}@mtunsafe{@mtasuconst{:@mtsenv{}}}@asunsafe{@asulock{}}@acunsafe{@aculock{}}} 427@c unsetenv @mtasuconst:@mtsenv @asulock @aculock 428@c strchr dup ok 429@c strlen dup ok 430@c libc_lock_lock @asulock @aculock 431@c strncmp dup ok 432@c libc_lock_unlock @aculock 433Using this function one can remove an entry completely from the 434environment. If the environment contains an entry with the key 435@var{name} this whole entry is removed. A call to this function is 436equivalent to a call to @code{putenv} when the @var{value} part of the 437string is empty. 438 439The function returns @code{-1} if @var{name} is a null pointer, points to 440an empty string, or points to a string containing a @code{=} character. 441It returns @code{0} if the call succeeded. 442 443This function was originally part of the BSD library but is now part of 444the Unix standard. The BSD version had no return value, though. 445@end deftypefun 446 447There is one more function to modify the whole environment. This 448function is said to be used in the POSIX.9 (POSIX bindings for Fortran 44977) and so one should expect it did made it into POSIX.1. But this 450never happened. But we still provide this function as a GNU extension 451to enable writing standard compliant Fortran environments. 452 453@deftypefun int clearenv (void) 454@standards{GNU, stdlib.h} 455@safety{@prelim{}@mtunsafe{@mtasuconst{:@mtsenv{}}}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@aculock{} @acsmem{}}} 456@c clearenv @mtasuconst:@mtsenv @ascuheap @asulock @aculock @acsmem 457@c libc_lock_lock @asulock @aculock 458@c free dup @ascuheap @acsmem 459@c libc_lock_unlock @aculock 460The @code{clearenv} function removes all entries from the environment. 461Using @code{putenv} and @code{setenv} new entries can be added again 462later. 463 464If the function is successful it returns @code{0}. Otherwise the return 465value is nonzero. 466@end deftypefun 467 468 469You can deal directly with the underlying representation of environment 470objects to add more variables to the environment (for example, to 471communicate with another program you are about to execute; 472@pxref{Executing a File}). 473 474@deftypevar {char **} environ 475@standards{POSIX.1, unistd.h} 476The environment is represented as an array of strings. Each string is 477of the format @samp{@var{name}=@var{value}}. The order in which 478strings appear in the environment is not significant, but the same 479@var{name} must not appear more than once. The last element of the 480array is a null pointer. 481 482This variable is declared in the header file @file{unistd.h}. 483 484If you just want to get the value of an environment variable, use 485@code{getenv}. 486@end deftypevar 487 488Unix systems, and @gnusystems{}, pass the initial value of 489@code{environ} as the third argument to @code{main}. 490@xref{Program Arguments}. 491 492@node Standard Environment 493@subsection Standard Environment Variables 494@cindex standard environment variables 495 496These environment variables have standard meanings. This doesn't mean 497that they are always present in the environment; but if these variables 498@emph{are} present, they have these meanings. You shouldn't try to use 499these environment variable names for some other purpose. 500 501@comment Extra blank lines make it look better. 502@table @code 503@item HOME 504@cindex @code{HOME} environment variable 505@cindex home directory 506 507This is a string representing the user's @dfn{home directory}, or 508initial default working directory. 509 510The user can set @code{HOME} to any value. 511If you need to make sure to obtain the proper home directory 512for a particular user, you should not use @code{HOME}; instead, 513look up the user's name in the user database (@pxref{User Database}). 514 515For most purposes, it is better to use @code{HOME}, precisely because 516this lets the user specify the value. 517 518@c !!! also USER 519@item LOGNAME 520@cindex @code{LOGNAME} environment variable 521 522This is the name that the user used to log in. Since the value in the 523environment can be tweaked arbitrarily, this is not a reliable way to 524identify the user who is running a program; a function like 525@code{getlogin} (@pxref{Who Logged In}) is better for that purpose. 526 527For most purposes, it is better to use @code{LOGNAME}, precisely because 528this lets the user specify the value. 529 530@item PATH 531@cindex @code{PATH} environment variable 532 533A @dfn{path} is a sequence of directory names which is used for 534searching for a file. The variable @code{PATH} holds a path used 535for searching for programs to be run. 536 537The @code{execlp} and @code{execvp} functions (@pxref{Executing a File}) 538use this environment variable, as do many shells and other utilities 539which are implemented in terms of those functions. 540 541The syntax of a path is a sequence of directory names separated by 542colons. An empty string instead of a directory name stands for the 543current directory (@pxref{Working Directory}). 544 545A typical value for this environment variable might be a string like: 546 547@smallexample 548:/bin:/etc:/usr/bin:/usr/new/X11:/usr/new:/usr/local/bin 549@end smallexample 550 551This means that if the user tries to execute a program named @code{foo}, 552the system will look for files named @file{foo}, @file{/bin/foo}, 553@file{/etc/foo}, and so on. The first of these files that exists is 554the one that is executed. 555 556@c !!! also TERMCAP 557@item TERM 558@cindex @code{TERM} environment variable 559 560This specifies the kind of terminal that is receiving program output. 561Some programs can make use of this information to take advantage of 562special escape sequences or terminal modes supported by particular kinds 563of terminals. Many programs which use the termcap library 564(@pxref{Finding a Terminal Description,Find,,termcap,The Termcap Library 565Manual}) use the @code{TERM} environment variable, for example. 566 567@item TZ 568@cindex @code{TZ} environment variable 569 570This specifies the time zone. @xref{TZ Variable}, for information about 571the format of this string and how it is used. 572 573@item LANG 574@cindex @code{LANG} environment variable 575 576This specifies the default locale to use for attribute categories where 577neither @code{LC_ALL} nor the specific environment variable for that 578category is set. @xref{Locales}, for more information about 579locales. 580 581@ignore 582@c I doubt this really exists 583@item LC_ALL 584@cindex @code{LC_ALL} environment variable 585 586This is similar to the @code{LANG} environment variable. However, its 587value takes precedence over any values provided for the individual 588attribute category environment variables, or for the @code{LANG} 589environment variable. 590@end ignore 591 592@item LC_ALL 593@cindex @code{LC_ALL} environment variable 594 595If this environment variable is set it overrides the selection for all 596the locales done using the other @code{LC_*} environment variables. The 597value of the other @code{LC_*} environment variables is simply ignored 598in this case. 599 600@item LC_COLLATE 601@cindex @code{LC_COLLATE} environment variable 602 603This specifies what locale to use for string sorting. 604 605@item LC_CTYPE 606@cindex @code{LC_CTYPE} environment variable 607 608This specifies what locale to use for character sets and character 609classification. 610 611@item LC_MESSAGES 612@cindex @code{LC_MESSAGES} environment variable 613 614This specifies what locale to use for printing messages and to parse 615responses. 616 617@item LC_MONETARY 618@cindex @code{LC_MONETARY} environment variable 619 620This specifies what locale to use for formatting monetary values. 621 622@item LC_NUMERIC 623@cindex @code{LC_NUMERIC} environment variable 624 625This specifies what locale to use for formatting numbers. 626 627@item LC_TIME 628@cindex @code{LC_TIME} environment variable 629 630This specifies what locale to use for formatting date/time values. 631 632@item NLSPATH 633@cindex @code{NLSPATH} environment variable 634 635This specifies the directories in which the @code{catopen} function 636looks for message translation catalogs. 637 638@item _POSIX_OPTION_ORDER 639@cindex @code{_POSIX_OPTION_ORDER} environment variable. 640 641If this environment variable is defined, it suppresses the usual 642reordering of command line arguments by @code{getopt} and 643@code{argp_parse}. @xref{Argument Syntax}. 644 645@c !!! GNU also has COREFILE, CORESERVER, EXECSERVERS 646@end table 647 648@node Auxiliary Vector 649@section Auxiliary Vector 650@cindex auxiliary vector 651 652When a program is executed, it receives information from the operating 653system about the environment in which it is operating. The form of this 654information is a table of key-value pairs, where the keys are from the 655set of @samp{AT_} values in @file{elf.h}. Some of the data is provided 656by the kernel for libc consumption, and may be obtained by ordinary 657interfaces, such as @code{sysconf}. However, on a platform-by-platform 658basis there may be information that is not available any other way. 659 660@subsection Definition of @code{getauxval} 661@deftypefun {unsigned long int} getauxval (unsigned long int @var{type}) 662@standards{???, sys/auxv.h} 663@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}} 664@c Reads from hwcap or iterates over constant auxv. 665This function is used to inquire about the entries in the auxiliary 666vector. The @var{type} argument should be one of the @samp{AT_} symbols 667defined in @file{elf.h}. If a matching entry is found, the value is 668returned; if the entry is not found, zero is returned and @code{errno} is 669set to @code{ENOENT}. 670@end deftypefun 671 672For some platforms, the key @code{AT_HWCAP} is the easiest way to inquire 673about any instruction set extensions available at runtime. In this case, 674there will (of necessity) be a platform-specific set of @samp{HWCAP_} 675values masked together that describe the capabilities of the cpu on which 676the program is being executed. 677 678@node System Calls 679@section System Calls 680 681@cindex system call 682A system call is a request for service that a program makes of the 683kernel. The service is generally something that only the kernel has 684the privilege to do, such as doing I/O. Programmers don't normally 685need to be concerned with system calls because there are functions in 686@theglibc{} to do virtually everything that system calls do. 687These functions work by making system calls themselves. For example, 688there is a system call that changes the permissions of a file, but 689you don't need to know about it because you can just use @theglibc{}'s 690@code{chmod} function. 691 692@cindex kernel call 693System calls are sometimes called kernel calls. 694 695However, there are times when you want to make a system call explicitly, 696and for that, @theglibc{} provides the @code{syscall} function. 697@code{syscall} is harder to use and less portable than functions like 698@code{chmod}, but easier and more portable than coding the system call 699in assembler instructions. 700 701@code{syscall} is most useful when you are working with a system call 702which is special to your system or is newer than @theglibc{} you 703are using. @code{syscall} is implemented in an entirely generic way; 704the function does not know anything about what a particular system 705call does or even if it is valid. 706 707The description of @code{syscall} in this section assumes a certain 708protocol for system calls on the various platforms on which @theglibc{} 709runs. That protocol is not defined by any strong authority, but 710we won't describe it here either because anyone who is coding 711@code{syscall} probably won't accept anything less than kernel and C 712library source code as a specification of the interface between them 713anyway. 714 715 716@code{syscall} is declared in @file{unistd.h}. 717 718@deftypefun {long int} syscall (long int @var{sysno}, @dots{}) 719@standards{???, unistd.h} 720@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}} 721 722@code{syscall} performs a generic system call. 723 724@cindex system call number 725@var{sysno} is the system call number. Each kind of system call is 726identified by a number. Macros for all the possible system call numbers 727are defined in @file{sys/syscall.h} 728 729The remaining arguments are the arguments for the system call, in 730order, and their meanings depend on the kind of system call. Each kind 731of system call has a definite number of arguments, from zero to five. 732If you code more arguments than the system call takes, the extra ones to 733the right are ignored. 734 735The return value is the return value from the system call, unless the 736system call failed. In that case, @code{syscall} returns @code{-1} and 737sets @code{errno} to an error code that the system call returned. Note 738that system calls do not return @code{-1} when they succeed. 739@cindex errno 740 741If you specify an invalid @var{sysno}, @code{syscall} returns @code{-1} 742with @code{errno} = @code{ENOSYS}. 743 744Example: 745 746@smallexample 747 748#include <unistd.h> 749#include <sys/syscall.h> 750#include <errno.h> 751 752@dots{} 753 754int rc; 755 756rc = syscall(SYS_chmod, "/etc/passwd", 0444); 757 758if (rc == -1) 759 fprintf(stderr, "chmod failed, errno = %d\n", errno); 760 761@end smallexample 762 763This, if all the compatibility stars are aligned, is equivalent to the 764following preferable code: 765 766@smallexample 767 768#include <sys/types.h> 769#include <sys/stat.h> 770#include <errno.h> 771 772@dots{} 773 774int rc; 775 776rc = chmod("/etc/passwd", 0444); 777if (rc == -1) 778 fprintf(stderr, "chmod failed, errno = %d\n", errno); 779 780@end smallexample 781 782@end deftypefun 783 784 785@node Program Termination 786@section Program Termination 787@cindex program termination 788@cindex process termination 789 790@cindex exit status value 791The usual way for a program to terminate is simply for its @code{main} 792function to return. The @dfn{exit status value} returned from the 793@code{main} function is used to report information back to the process's 794parent process or shell. 795 796A program can also terminate normally by calling the @code{exit} 797function. 798 799In addition, programs can be terminated by signals; this is discussed in 800more detail in @ref{Signal Handling}. The @code{abort} function causes 801a signal that kills the program. 802 803@menu 804* Normal Termination:: If a program calls @code{exit}, a 805 process terminates normally. 806* Exit Status:: The @code{exit status} provides information 807 about why the process terminated. 808* Cleanups on Exit:: A process can run its own cleanup 809 functions upon normal termination. 810* Aborting a Program:: The @code{abort} function causes 811 abnormal program termination. 812* Termination Internals:: What happens when a process terminates. 813@end menu 814 815@node Normal Termination 816@subsection Normal Termination 817 818A process terminates normally when its program signals it is done by 819calling @code{exit}. Returning from @code{main} is equivalent to 820calling @code{exit}, and the value that @code{main} returns is used as 821the argument to @code{exit}. 822 823@deftypefun void exit (int @var{status}) 824@standards{ISO, stdlib.h} 825@safety{@prelim{}@mtunsafe{@mtasurace{:exit}}@asunsafe{@asucorrupt{}}@acunsafe{@acucorrupt{} @aculock{}}} 826@c Access to the atexit/on_exit list, the libc_atexit hook and tls dtors 827@c is not guarded. Streams must be flushed, and that triggers the usual 828@c AS and AC issues with streams. 829The @code{exit} function tells the system that the program is done, which 830causes it to terminate the process. 831 832@var{status} is the program's exit status, which becomes part of the 833process' termination status. This function does not return. 834@end deftypefun 835 836Normal termination causes the following actions: 837 838@enumerate 839@item 840Functions that were registered with the @code{atexit} or @code{on_exit} 841functions are called in the reverse order of their registration. This 842mechanism allows your application to specify its own ``cleanup'' actions 843to be performed at program termination. Typically, this is used to do 844things like saving program state information in a file, or unlocking 845locks in shared data bases. 846 847@item 848All open streams are closed, writing out any buffered output data. See 849@ref{Closing Streams}. In addition, temporary files opened 850with the @code{tmpfile} function are removed; see @ref{Temporary Files}. 851 852@item 853@code{_exit} is called, terminating the program. @xref{Termination Internals}. 854@end enumerate 855 856@node Exit Status 857@subsection Exit Status 858@cindex exit status 859 860When a program exits, it can return to the parent process a small 861amount of information about the cause of termination, using the 862@dfn{exit status}. This is a value between 0 and 255 that the exiting 863process passes as an argument to @code{exit}. 864 865Normally you should use the exit status to report very broad information 866about success or failure. You can't provide a lot of detail about the 867reasons for the failure, and most parent processes would not want much 868detail anyway. 869 870There are conventions for what sorts of status values certain programs 871should return. The most common convention is simply 0 for success and 1 872for failure. Programs that perform comparison use a different 873convention: they use status 1 to indicate a mismatch, and status 2 to 874indicate an inability to compare. Your program should follow an 875existing convention if an existing convention makes sense for it. 876 877A general convention reserves status values 128 and up for special 878purposes. In particular, the value 128 is used to indicate failure to 879execute another program in a subprocess. This convention is not 880universally obeyed, but it is a good idea to follow it in your programs. 881 882@strong{Warning:} Don't try to use the number of errors as the exit 883status. This is actually not very useful; a parent process would 884generally not care how many errors occurred. Worse than that, it does 885not work, because the status value is truncated to eight bits. 886Thus, if the program tried to report 256 errors, the parent would 887receive a report of 0 errors---that is, success. 888 889For the same reason, it does not work to use the value of @code{errno} 890as the exit status---these can exceed 255. 891 892@strong{Portability note:} Some non-POSIX systems use different 893conventions for exit status values. For greater portability, you can 894use the macros @code{EXIT_SUCCESS} and @code{EXIT_FAILURE} for the 895conventional status value for success and failure, respectively. They 896are declared in the file @file{stdlib.h}. 897@pindex stdlib.h 898 899@deftypevr Macro int EXIT_SUCCESS 900@standards{ISO, stdlib.h} 901This macro can be used with the @code{exit} function to indicate 902successful program completion. 903 904On POSIX systems, the value of this macro is @code{0}. On other 905systems, the value might be some other (possibly non-constant) integer 906expression. 907@end deftypevr 908 909@deftypevr Macro int EXIT_FAILURE 910@standards{ISO, stdlib.h} 911This macro can be used with the @code{exit} function to indicate 912unsuccessful program completion in a general sense. 913 914On POSIX systems, the value of this macro is @code{1}. On other 915systems, the value might be some other (possibly non-constant) integer 916expression. Other nonzero status values also indicate failures. Certain 917programs use different nonzero status values to indicate particular 918kinds of "non-success". For example, @code{diff} uses status value 919@code{1} to mean that the files are different, and @code{2} or more to 920mean that there was difficulty in opening the files. 921@end deftypevr 922 923Don't confuse a program's exit status with a process' termination status. 924There are lots of ways a process can terminate besides having its program 925finish. In the event that the process termination @emph{is} caused by program 926termination (i.e., @code{exit}), though, the program's exit status becomes 927part of the process' termination status. 928 929@node Cleanups on Exit 930@subsection Cleanups on Exit 931 932Your program can arrange to run its own cleanup functions if normal 933termination happens. If you are writing a library for use in various 934application programs, then it is unreliable to insist that all 935applications call the library's cleanup functions explicitly before 936exiting. It is much more robust to make the cleanup invisible to the 937application, by setting up a cleanup function in the library itself 938using @code{atexit} or @code{on_exit}. 939 940@deftypefun int atexit (void (*@var{function}) (void)) 941@standards{ISO, stdlib.h} 942@safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@aculock{} @acsmem{}}} 943@c atexit @ascuheap @asulock @aculock @acsmem 944@c cxa_atexit @ascuheap @asulock @aculock @acsmem 945@c __internal_atexit @ascuheap @asulock @aculock @acsmem 946@c __new_exitfn @ascuheap @asulock @aculock @acsmem 947@c __libc_lock_lock @asulock @aculock 948@c calloc dup @ascuheap @acsmem 949@c __libc_lock_unlock @aculock 950@c atomic_write_barrier dup ok 951The @code{atexit} function registers the function @var{function} to be 952called at normal program termination. The @var{function} is called with 953no arguments. 954 955The return value from @code{atexit} is zero on success and nonzero if 956the function cannot be registered. 957@end deftypefun 958 959@deftypefun int on_exit (void (*@var{function})(int @var{status}, void *@var{arg}), void *@var{arg}) 960@standards{SunOS, stdlib.h} 961@safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@aculock{} @acsmem{}}} 962@c on_exit @ascuheap @asulock @aculock @acsmem 963@c new_exitfn dup @ascuheap @asulock @aculock @acsmem 964@c atomic_write_barrier dup ok 965This function is a somewhat more powerful variant of @code{atexit}. It 966accepts two arguments, a function @var{function} and an arbitrary 967pointer @var{arg}. At normal program termination, the @var{function} is 968called with two arguments: the @var{status} value passed to @code{exit}, 969and the @var{arg}. 970 971This function is included in @theglibc{} only for compatibility 972for SunOS, and may not be supported by other implementations. 973@end deftypefun 974 975Here's a trivial program that illustrates the use of @code{exit} and 976@code{atexit}: 977 978@smallexample 979@include atexit.c.texi 980@end smallexample 981 982@noindent 983When this program is executed, it just prints the message and exits. 984 985@node Aborting a Program 986@subsection Aborting a Program 987@cindex aborting a program 988 989You can abort your program using the @code{abort} function. The prototype 990for this function is in @file{stdlib.h}. 991@pindex stdlib.h 992 993@deftypefun void abort (void) 994@standards{ISO, stdlib.h} 995@safety{@prelim{}@mtsafe{}@asunsafe{@asucorrupt{}}@acunsafe{@aculock{} @acucorrupt{}}} 996@c The implementation takes a recursive lock and attempts to support 997@c calls from signal handlers, but if we're in the middle of flushing or 998@c using streams, we may encounter them in inconsistent states. 999The @code{abort} function causes abnormal program termination. This 1000does not execute cleanup functions registered with @code{atexit} or 1001@code{on_exit}. 1002 1003This function actually terminates the process by raising a 1004@code{SIGABRT} signal, and your program can include a handler to 1005intercept this signal; see @ref{Signal Handling}. 1006@end deftypefun 1007 1008@node Termination Internals 1009@subsection Termination Internals 1010 1011The @code{_exit} function is the primitive used for process termination 1012by @code{exit}. It is declared in the header file @file{unistd.h}. 1013@pindex unistd.h 1014 1015@deftypefun void _exit (int @var{status}) 1016@standards{POSIX.1, unistd.h} 1017@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}} 1018@c Direct syscall (exit_group or exit); calls __task_terminate on hurd, 1019@c and abort in the generic posix implementation. 1020The @code{_exit} function is the primitive for causing a process to 1021terminate with status @var{status}. Calling this function does not 1022execute cleanup functions registered with @code{atexit} or 1023@code{on_exit}. 1024@end deftypefun 1025 1026@deftypefun void _Exit (int @var{status}) 1027@standards{ISO, stdlib.h} 1028@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}} 1029@c Alias for _exit. 1030The @code{_Exit} function is the @w{ISO C} equivalent to @code{_exit}. 1031The @w{ISO C} committee members were not sure whether the definitions of 1032@code{_exit} and @code{_Exit} were compatible so they have not used the 1033POSIX name. 1034 1035This function was introduced in @w{ISO C99} and is declared in 1036@file{stdlib.h}. 1037@end deftypefun 1038 1039When a process terminates for any reason---either because the program 1040terminates, or as a result of a signal---the 1041following things happen: 1042 1043@itemize @bullet 1044@item 1045All open file descriptors in the process are closed. @xref{Low-Level I/O}. 1046Note that streams are not flushed automatically when the process 1047terminates; see @ref{I/O on Streams}. 1048 1049@item 1050A process exit status is saved to be reported back to the parent process 1051via @code{wait} or @code{waitpid}; see @ref{Process Completion}. If the 1052program exited, this status includes as its low-order 8 bits the program 1053exit status. 1054 1055 1056@item 1057Any child processes of the process being terminated are assigned a new 1058parent process. (On most systems, including GNU, this is the @code{init} 1059process, with process ID 1.) 1060 1061@item 1062A @code{SIGCHLD} signal is sent to the parent process. 1063 1064@item 1065If the process is a session leader that has a controlling terminal, then 1066a @code{SIGHUP} signal is sent to each process in the foreground job, 1067and the controlling terminal is disassociated from that session. 1068@xref{Job Control}. 1069 1070@item 1071If termination of a process causes a process group to become orphaned, 1072and any member of that process group is stopped, then a @code{SIGHUP} 1073signal and a @code{SIGCONT} signal are sent to each process in the 1074group. @xref{Job Control}. 1075@end itemize 1076