1@node Program Basics, Processes, Signal Handling, Top
2@c %MENU% Writing the beginning and end of your program
3@chapter The Basic Program/System Interface
4
5@cindex process
6@cindex program
7@cindex address space
8@cindex thread of control
9@dfn{Processes} are the primitive units for allocation of system
10resources.  Each process has its own address space and (usually) one
11thread of control.  A process executes a program; you can have multiple
12processes executing the same program, but each process has its own copy
13of the program within its own address space and executes it
14independently of the other copies.  Though it may have multiple threads
15of control within the same program and a program may be composed of
16multiple logically separate modules, a process always executes exactly
17one program.
18
19Note that we are using a specific definition of ``program'' for the
20purposes of this manual, which corresponds to a common definition in the
21context of Unix systems.  In popular usage, ``program'' enjoys a much
22broader definition; it can refer for example to a system's kernel, an
23editor macro, a complex package of software, or a discrete section of
24code executing within a process.
25
26Writing the program is what this manual is all about.  This chapter
27explains the most basic interface between your program and the system
28that runs, or calls, it.  This includes passing of parameters (arguments
29and environment) from the system, requesting basic services from the
30system, and telling the system the program is done.
31
32A program starts another program with the @code{exec} family of system calls.
33This chapter looks at program startup from the execee's point of view.  To
34see the event from the execor's point of view, see @ref{Executing a File}.
35
36@menu
37* Program Arguments::           Parsing your program's command-line arguments
38* Environment Variables::       Less direct parameters affecting your program
39* Auxiliary Vector::            Least direct parameters affecting your program
40* System Calls::                Requesting service from the system
41* Program Termination::         Telling the system you're done; return status
42@end menu
43
44@node Program Arguments, Environment Variables, , Program Basics
45@section Program Arguments
46@cindex program arguments
47@cindex command line arguments
48@cindex arguments, to program
49
50@cindex program startup
51@cindex startup of program
52@cindex invocation of program
53@cindex @code{main} function
54@findex main
55The system starts a C program by calling the function @code{main}.  It
56is up to you to write a function named @code{main}---otherwise, you
57won't even be able to link your program without errors.
58
59In @w{ISO C} you can define @code{main} either to take no arguments, or to
60take two arguments that represent the command line arguments to the
61program, like this:
62
63@smallexample
64int main (int @var{argc}, char *@var{argv}[])
65@end smallexample
66
67@cindex argc (program argument count)
68@cindex argv (program argument vector)
69The command line arguments are the whitespace-separated tokens given in
70the shell command used to invoke the program; thus, in @samp{cat foo
71bar}, the arguments are @samp{foo} and @samp{bar}.  The only way a
72program can look at its command line arguments is via the arguments of
73@code{main}.  If @code{main} doesn't take arguments, then you cannot get
74at the command line.
75
76The value of the @var{argc} argument is the number of command line
77arguments.  The @var{argv} argument is a vector of C strings; its
78elements are the individual command line argument strings.  The file
79name of the program being run is also included in the vector as the
80first element; the value of @var{argc} counts this element.  A null
81pointer always follows the last element: @code{@var{argv}[@var{argc}]}
82is this null pointer.
83
84For the command @samp{cat foo bar}, @var{argc} is 3 and @var{argv} has
85three elements, @code{"cat"}, @code{"foo"} and @code{"bar"}.
86
87In Unix systems you can define @code{main} a third way, using three arguments:
88
89@smallexample
90int main (int @var{argc}, char *@var{argv}[], char *@var{envp}[])
91@end smallexample
92
93The first two arguments are just the same.  The third argument
94@var{envp} gives the program's environment; it is the same as the value
95of @code{environ}.  @xref{Environment Variables}.  POSIX.1 does not
96allow this three-argument form, so to be portable it is best to write
97@code{main} to take two arguments, and use the value of @code{environ}.
98
99@menu
100* Argument Syntax::             By convention, options start with a hyphen.
101* Parsing Program Arguments::   Ways to parse program options and arguments.
102@end menu
103
104@node Argument Syntax, Parsing Program Arguments, , Program Arguments
105@subsection Program Argument Syntax Conventions
106@cindex program argument syntax
107@cindex syntax, for program arguments
108@cindex command argument syntax
109
110POSIX recommends these conventions for command line arguments.
111@code{getopt} (@pxref{Getopt}) and @code{argp_parse} (@pxref{Argp}) make
112it easy to implement them.
113
114@itemize @bullet
115@item
116Arguments are options if they begin with a hyphen delimiter (@samp{-}).
117
118@item
119Multiple options may follow a hyphen delimiter in a single token if
120the options do not take arguments.  Thus, @samp{-abc} is equivalent to
121@samp{-a -b -c}.
122
123@item
124Option names are single alphanumeric characters (as for @code{isalnum};
125@pxref{Classification of Characters}).
126
127@item
128Certain options require an argument.  For example, the @option{-o} option
129of the @command{ld} command requires an argument---an output file name.
130
131@item
132An option and its argument may or may not appear as separate tokens.  (In
133other words, the whitespace separating them is optional.)  Thus,
134@w{@option{-o foo}} and @option{-ofoo} are equivalent.
135
136@item
137Options typically precede other non-option arguments.
138
139The implementations of @code{getopt} and @code{argp_parse} in @theglibc{}
140normally make it appear as if all the option arguments were
141specified before all the non-option arguments for the purposes of
142parsing, even if the user of your program intermixed option and
143non-option arguments.  They do this by reordering the elements of the
144@var{argv} array.  This behavior is nonstandard; if you want to suppress
145it, define the @code{_POSIX_OPTION_ORDER} environment variable.
146@xref{Standard Environment}.
147
148@item
149The argument @option{--} terminates all options; any following arguments
150are treated as non-option arguments, even if they begin with a hyphen.
151
152@item
153A token consisting of a single hyphen character is interpreted as an
154ordinary non-option argument.  By convention, it is used to specify
155input from or output to the standard input and output streams.
156
157@item
158Options may be supplied in any order, or appear multiple times.  The
159interpretation is left up to the particular application program.
160@end itemize
161
162@cindex long-named options
163GNU adds @dfn{long options} to these conventions.  Long options consist
164of @option{--} followed by a name made of alphanumeric characters and
165dashes.  Option names are typically one to three words long, with
166hyphens to separate words.  Users can abbreviate the option names as
167long as the abbreviations are unique.
168
169To specify an argument for a long option, write
170@option{--@var{name}=@var{value}}.  This syntax enables a long option to
171accept an argument that is itself optional.
172
173Eventually, @gnusystems{} will provide completion for long option names
174in the shell.
175
176@node Parsing Program Arguments, , Argument Syntax, Program Arguments
177@subsection Parsing Program Arguments
178
179@cindex program arguments, parsing
180@cindex command arguments, parsing
181@cindex parsing program arguments
182If the syntax for the command line arguments to your program is simple
183enough, you can simply pick the arguments off from @var{argv} by hand.
184But unless your program takes a fixed number of arguments, or all of the
185arguments are interpreted in the same way (as file names, for example),
186you are usually better off using @code{getopt} (@pxref{Getopt}) or
187@code{argp_parse} (@pxref{Argp}) to do the parsing.
188
189@code{getopt} is more standard (the short-option only version of it is a
190part of the POSIX standard), but using @code{argp_parse} is often
191easier, both for very simple and very complex option structures, because
192it does more of the dirty work for you.
193
194@menu
195* Getopt::                      Parsing program options using @code{getopt}.
196* Argp::                        Parsing program options using @code{argp_parse}.
197* Suboptions::                  Some programs need more detailed options.
198* Suboptions Example::          This shows how it could be done for @code{mount}.
199@end menu
200
201@c Getopt and argp start at the @section level so that there's
202@c enough room for their internal hierarchy (mostly a problem with
203@c argp).         -Miles
204
205@include getopt.texi
206@include argp.texi
207
208@node Suboptions, Suboptions Example, Argp, Parsing Program Arguments
209@c This is a @section so that it's at the same level as getopt and argp
210@subsubsection Parsing of Suboptions
211
212Having a single level of options is sometimes not enough.  There might
213be too many options which have to be available or a set of options is
214closely related.
215
216For this case some programs use suboptions.  One of the most prominent
217programs is certainly @code{mount}(8).  The @code{-o} option take one
218argument which itself is a comma separated list of options.  To ease the
219programming of code like this the function @code{getsubopt} is
220available.
221
222@deftypefun int getsubopt (char **@var{optionp}, char *const *@var{tokens}, char **@var{valuep})
223@standards{???, stdlib.h}
224@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
225@c getsubopt ok
226@c  strchrnul dup ok
227@c  memchr dup ok
228@c  strncmp dup ok
229
230The @var{optionp} parameter must be a pointer to a variable containing
231the address of the string to process.  When the function returns, the
232reference is updated to point to the next suboption or to the
233terminating @samp{\0} character if there are no more suboptions available.
234
235The @var{tokens} parameter references an array of strings containing the
236known suboptions.  All strings must be @samp{\0} terminated and to mark
237the end a null pointer must be stored.  When @code{getsubopt} finds a
238possible legal suboption it compares it with all strings available in
239the @var{tokens} array and returns the index in the string as the
240indicator.
241
242In case the suboption has an associated value introduced by a @samp{=}
243character, a pointer to the value is returned in @var{valuep}.  The
244string is @samp{\0} terminated.  If no argument is available
245@var{valuep} is set to the null pointer.  By doing this the caller can
246check whether a necessary value is given or whether no unexpected value
247is present.
248
249In case the next suboption in the string is not mentioned in the
250@var{tokens} array the starting address of the suboption including a
251possible value is returned in @var{valuep} and the return value of the
252function is @samp{-1}.
253@end deftypefun
254
255@node Suboptions Example, , Suboptions, Parsing Program Arguments
256@subsection Parsing of Suboptions Example
257
258The code which might appear in the @code{mount}(8) program is a perfect
259example of the use of @code{getsubopt}:
260
261@smallexample
262@include subopt.c.texi
263@end smallexample
264
265
266@node Environment Variables, Auxiliary Vector, Program Arguments, Program Basics
267@section Environment Variables
268
269@cindex environment variable
270When a program is executed, it receives information about the context in
271which it was invoked in two ways.  The first mechanism uses the
272@var{argv} and @var{argc} arguments to its @code{main} function, and is
273discussed in @ref{Program Arguments}.  The second mechanism uses
274@dfn{environment variables} and is discussed in this section.
275
276The @var{argv} mechanism is typically used to pass command-line
277arguments specific to the particular program being invoked.  The
278environment, on the other hand, keeps track of information that is
279shared by many programs, changes infrequently, and that is less
280frequently used.
281
282The environment variables discussed in this section are the same
283environment variables that you set using assignments and the
284@code{export} command in the shell.  Programs executed from the shell
285inherit all of the environment variables from the shell.
286@c !!! xref to right part of bash manual when it exists
287
288@cindex environment
289Standard environment variables are used for information about the user's
290home directory, terminal type, current locale, and so on; you can define
291additional variables for other purposes.  The set of all environment
292variables that have values is collectively known as the
293@dfn{environment}.
294
295Names of environment variables are case-sensitive and must not contain
296the character @samp{=}.  System-defined environment variables are
297invariably uppercase.
298
299The values of environment variables can be anything that can be
300represented as a string.  A value must not contain an embedded null
301character, since this is assumed to terminate the string.
302
303
304@menu
305* Environment Access::          How to get and set the values of
306				 environment variables.
307* Standard Environment::        These environment variables have
308                		 standard interpretations.
309@end menu
310
311@node Environment Access
312@subsection Environment Access
313@cindex environment access
314@cindex environment representation
315
316The value of an environment variable can be accessed with the
317@code{getenv} function.  This is declared in the header file
318@file{stdlib.h}.
319@pindex stdlib.h
320
321Libraries should use @code{secure_getenv} instead of @code{getenv}, so
322that they do not accidentally use untrusted environment variables.
323Modifications of environment variables are not allowed in
324multi-threaded programs.  The @code{getenv} and @code{secure_getenv}
325functions can be safely used in multi-threaded programs.
326
327@deftypefun {char *} getenv (const char *@var{name})
328@standards{ISO, stdlib.h}
329@safety{@prelim{}@mtsafe{@mtsenv{}}@assafe{}@acsafe{}}
330@c Unguarded access to __environ.
331This function returns a string that is the value of the environment
332variable @var{name}.  You must not modify this string.  In some non-Unix
333systems not using @theglibc{}, it might be overwritten by subsequent
334calls to @code{getenv} (but not by any other library function).  If the
335environment variable @var{name} is not defined, the value is a null
336pointer.
337@end deftypefun
338
339@deftypefun {char *} secure_getenv (const char *@var{name})
340@standards{GNU, stdlib.h}
341@safety{@prelim{}@mtsafe{@mtsenv{}}@assafe{}@acsafe{}}
342@c Calls getenv unless secure mode is enabled.
343This function is similar to @code{getenv}, but it returns a null
344pointer if the environment is untrusted.  This happens when the
345program file has SUID or SGID bits set.  General-purpose libraries
346should always prefer this function over @code{getenv} to avoid
347vulnerabilities if the library is referenced from a SUID/SGID program.
348
349This function is a GNU extension.
350@end deftypefun
351
352
353@deftypefun int putenv (char *@var{string})
354@standards{SVID, stdlib.h}
355@safety{@prelim{}@mtunsafe{@mtasuconst{:@mtsenv{}}}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@acucorrupt{} @aculock{} @acsmem{}}}
356@c putenv @mtasuconst:@mtsenv @ascuheap @asulock @acucorrupt @aculock @acsmem
357@c  strchr dup ok
358@c  strndup dup @ascuheap @acsmem
359@c  add_to_environ dup @mtasuconst:@mtsenv @ascuheap @asulock @acucorrupt @aculock @acsmem
360@c  free dup @ascuheap @acsmem
361@c  unsetenv dup @mtasuconst:@mtsenv @asulock @aculock
362The @code{putenv} function adds or removes definitions from the environment.
363If the @var{string} is of the form @samp{@var{name}=@var{value}}, the
364definition is added to the environment.  Otherwise, the @var{string} is
365interpreted as the name of an environment variable, and any definition
366for this variable in the environment is removed.
367
368If the function is successful it returns @code{0}.  Otherwise the return
369value is nonzero and @code{errno} is set to indicate the error.
370
371The difference to the @code{setenv} function is that the exact string
372given as the parameter @var{string} is put into the environment.  If the
373user should change the string after the @code{putenv} call this will
374reflect automatically in the environment.  This also requires that
375@var{string} not be an automatic variable whose scope is left before the
376variable is removed from the environment.  The same applies of course to
377dynamically allocated variables which are freed later.
378
379This function is part of the extended Unix interface.  You should define
380@var{_XOPEN_SOURCE} before including any header.
381@end deftypefun
382
383
384@deftypefun int setenv (const char *@var{name}, const char *@var{value}, int @var{replace})
385@standards{BSD, stdlib.h}
386@safety{@prelim{}@mtunsafe{@mtasuconst{:@mtsenv{}}}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@acucorrupt{} @aculock{} @acsmem{}}}
387@c setenv @mtasuconst:@mtsenv @ascuheap @asulock @acucorrupt @aculock @acsmem
388@c  add_to_environ @mtasuconst:@mtsenv @ascuheap @asulock @acucorrupt @aculock @acsmem
389@c   strlen dup ok
390@c   libc_lock_lock @asulock @aculock
391@c   strncmp dup ok
392@c   realloc dup @ascuheap @acsmem
393@c   libc_lock_unlock @aculock
394@c   malloc dup @ascuheap @acsmem
395@c   free dup @ascuheap @acsmem
396@c   mempcpy dup ok
397@c   memcpy dup ok
398@c   KNOWN_VALUE ok
399@c    tfind(strcmp) [no @mtsrace guarded access]
400@c     strcmp dup ok
401@c   STORE_VALUE @ascuheap @acucorrupt @acsmem
402@c    tsearch(strcmp) @ascuheap @acucorrupt @acsmem [no @mtsrace or @asucorrupt guarded access makes for mtsafe and @asulock]
403@c     strcmp dup ok
404The @code{setenv} function can be used to add a new definition to the
405environment.  The entry with the name @var{name} is replaced by the
406value @samp{@var{name}=@var{value}}.  Please note that this is also true
407if @var{value} is the empty string.  To do this a new string is created
408and the strings @var{name} and @var{value} are copied.  A null pointer
409for the @var{value} parameter is illegal.  If the environment already
410contains an entry with key @var{name} the @var{replace} parameter
411controls the action.  If replace is zero, nothing happens.  Otherwise
412the old entry is replaced by the new one.
413
414Please note that you cannot remove an entry completely using this function.
415
416If the function is successful it returns @code{0}.  Otherwise the
417environment is unchanged and the return value is @code{-1} and
418@code{errno} is set.
419
420This function was originally part of the BSD library but is now part of
421the Unix standard.
422@end deftypefun
423
424@deftypefun int unsetenv (const char *@var{name})
425@standards{BSD, stdlib.h}
426@safety{@prelim{}@mtunsafe{@mtasuconst{:@mtsenv{}}}@asunsafe{@asulock{}}@acunsafe{@aculock{}}}
427@c unsetenv @mtasuconst:@mtsenv @asulock @aculock
428@c  strchr dup ok
429@c  strlen dup ok
430@c  libc_lock_lock @asulock @aculock
431@c  strncmp dup ok
432@c  libc_lock_unlock @aculock
433Using this function one can remove an entry completely from the
434environment.  If the environment contains an entry with the key
435@var{name} this whole entry is removed.  A call to this function is
436equivalent to a call to @code{putenv} when the @var{value} part of the
437string is empty.
438
439The function returns @code{-1} if @var{name} is a null pointer, points to
440an empty string, or points to a string containing a @code{=} character.
441It returns @code{0} if the call succeeded.
442
443This function was originally part of the BSD library but is now part of
444the Unix standard.  The BSD version had no return value, though.
445@end deftypefun
446
447There is one more function to modify the whole environment.  This
448function is said to be used in the POSIX.9 (POSIX bindings for Fortran
44977) and so one should expect it did made it into POSIX.1.  But this
450never happened.  But we still provide this function as a GNU extension
451to enable writing standard compliant Fortran environments.
452
453@deftypefun int clearenv (void)
454@standards{GNU, stdlib.h}
455@safety{@prelim{}@mtunsafe{@mtasuconst{:@mtsenv{}}}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@aculock{} @acsmem{}}}
456@c clearenv @mtasuconst:@mtsenv @ascuheap @asulock @aculock @acsmem
457@c  libc_lock_lock @asulock @aculock
458@c  free dup @ascuheap @acsmem
459@c  libc_lock_unlock @aculock
460The @code{clearenv} function removes all entries from the environment.
461Using @code{putenv} and @code{setenv} new entries can be added again
462later.
463
464If the function is successful it returns @code{0}.  Otherwise the return
465value is nonzero.
466@end deftypefun
467
468
469You can deal directly with the underlying representation of environment
470objects to add more variables to the environment (for example, to
471communicate with another program you are about to execute;
472@pxref{Executing a File}).
473
474@deftypevar {char **} environ
475@standards{POSIX.1, unistd.h}
476The environment is represented as an array of strings.  Each string is
477of the format @samp{@var{name}=@var{value}}.  The order in which
478strings appear in the environment is not significant, but the same
479@var{name} must not appear more than once.  The last element of the
480array is a null pointer.
481
482This variable is declared in the header file @file{unistd.h}.
483
484If you just want to get the value of an environment variable, use
485@code{getenv}.
486@end deftypevar
487
488Unix systems, and @gnusystems{}, pass the initial value of
489@code{environ} as the third argument to @code{main}.
490@xref{Program Arguments}.
491
492@node Standard Environment
493@subsection Standard Environment Variables
494@cindex standard environment variables
495
496These environment variables have standard meanings.  This doesn't mean
497that they are always present in the environment; but if these variables
498@emph{are} present, they have these meanings.  You shouldn't try to use
499these environment variable names for some other purpose.
500
501@comment Extra blank lines make it look better.
502@table @code
503@item HOME
504@cindex @code{HOME} environment variable
505@cindex home directory
506
507This is a string representing the user's @dfn{home directory}, or
508initial default working directory.
509
510The user can set @code{HOME} to any value.
511If you need to make sure to obtain the proper home directory
512for a particular user, you should not use @code{HOME}; instead,
513look up the user's name in the user database (@pxref{User Database}).
514
515For most purposes, it is better to use @code{HOME}, precisely because
516this lets the user specify the value.
517
518@c !!! also USER
519@item LOGNAME
520@cindex @code{LOGNAME} environment variable
521
522This is the name that the user used to log in.  Since the value in the
523environment can be tweaked arbitrarily, this is not a reliable way to
524identify the user who is running a program; a function like
525@code{getlogin} (@pxref{Who Logged In}) is better for that purpose.
526
527For most purposes, it is better to use @code{LOGNAME}, precisely because
528this lets the user specify the value.
529
530@item PATH
531@cindex @code{PATH} environment variable
532
533A @dfn{path} is a sequence of directory names which is used for
534searching for a file.  The variable @code{PATH} holds a path used
535for searching for programs to be run.
536
537The @code{execlp} and @code{execvp} functions (@pxref{Executing a File})
538use this environment variable, as do many shells and other utilities
539which are implemented in terms of those functions.
540
541The syntax of a path is a sequence of directory names separated by
542colons.  An empty string instead of a directory name stands for the
543current directory (@pxref{Working Directory}).
544
545A typical value for this environment variable might be a string like:
546
547@smallexample
548:/bin:/etc:/usr/bin:/usr/new/X11:/usr/new:/usr/local/bin
549@end smallexample
550
551This means that if the user tries to execute a program named @code{foo},
552the system will look for files named @file{foo}, @file{/bin/foo},
553@file{/etc/foo}, and so on.  The first of these files that exists is
554the one that is executed.
555
556@c !!! also TERMCAP
557@item TERM
558@cindex @code{TERM} environment variable
559
560This specifies the kind of terminal that is receiving program output.
561Some programs can make use of this information to take advantage of
562special escape sequences or terminal modes supported by particular kinds
563of terminals.  Many programs which use the termcap library
564(@pxref{Finding a Terminal Description,Find,,termcap,The Termcap Library
565Manual}) use the @code{TERM} environment variable, for example.
566
567@item TZ
568@cindex @code{TZ} environment variable
569
570This specifies the time zone.  @xref{TZ Variable}, for information about
571the format of this string and how it is used.
572
573@item LANG
574@cindex @code{LANG} environment variable
575
576This specifies the default locale to use for attribute categories where
577neither @code{LC_ALL} nor the specific environment variable for that
578category is set.  @xref{Locales}, for more information about
579locales.
580
581@ignore
582@c I doubt this really exists
583@item LC_ALL
584@cindex @code{LC_ALL} environment variable
585
586This is similar to the @code{LANG} environment variable.  However, its
587value takes precedence over any values provided for the individual
588attribute category environment variables, or for the @code{LANG}
589environment variable.
590@end ignore
591
592@item LC_ALL
593@cindex @code{LC_ALL} environment variable
594
595If this environment variable is set it overrides the selection for all
596the locales done using the other @code{LC_*} environment variables.  The
597value of the other @code{LC_*} environment variables is simply ignored
598in this case.
599
600@item LC_COLLATE
601@cindex @code{LC_COLLATE} environment variable
602
603This specifies what locale to use for string sorting.
604
605@item LC_CTYPE
606@cindex @code{LC_CTYPE} environment variable
607
608This specifies what locale to use for character sets and character
609classification.
610
611@item LC_MESSAGES
612@cindex @code{LC_MESSAGES} environment variable
613
614This specifies what locale to use for printing messages and to parse
615responses.
616
617@item LC_MONETARY
618@cindex @code{LC_MONETARY} environment variable
619
620This specifies what locale to use for formatting monetary values.
621
622@item LC_NUMERIC
623@cindex @code{LC_NUMERIC} environment variable
624
625This specifies what locale to use for formatting numbers.
626
627@item LC_TIME
628@cindex @code{LC_TIME} environment variable
629
630This specifies what locale to use for formatting date/time values.
631
632@item NLSPATH
633@cindex @code{NLSPATH} environment variable
634
635This specifies the directories in which the @code{catopen} function
636looks for message translation catalogs.
637
638@item _POSIX_OPTION_ORDER
639@cindex @code{_POSIX_OPTION_ORDER} environment variable.
640
641If this environment variable is defined, it suppresses the usual
642reordering of command line arguments by @code{getopt} and
643@code{argp_parse}.  @xref{Argument Syntax}.
644
645@c !!! GNU also has COREFILE, CORESERVER, EXECSERVERS
646@end table
647
648@node Auxiliary Vector
649@section Auxiliary Vector
650@cindex auxiliary vector
651
652When a program is executed, it receives information from the operating
653system about the environment in which it is operating.  The form of this
654information is a table of key-value pairs, where the keys are from the
655set of @samp{AT_} values in @file{elf.h}.  Some of the data is provided
656by the kernel for libc consumption, and may be obtained by ordinary
657interfaces, such as @code{sysconf}.  However, on a platform-by-platform
658basis there may be information that is not available any other way.
659
660@subsection Definition of @code{getauxval}
661@deftypefun {unsigned long int} getauxval (unsigned long int @var{type})
662@standards{???, sys/auxv.h}
663@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
664@c Reads from hwcap or iterates over constant auxv.
665This function is used to inquire about the entries in the auxiliary
666vector.  The @var{type} argument should be one of the @samp{AT_} symbols
667defined in @file{elf.h}.  If a matching entry is found, the value is
668returned; if the entry is not found, zero is returned and @code{errno} is
669set to @code{ENOENT}.
670@end deftypefun
671
672For some platforms, the key @code{AT_HWCAP} is the easiest way to inquire
673about any instruction set extensions available at runtime.  In this case,
674there will (of necessity) be a platform-specific set of @samp{HWCAP_}
675values masked together that describe the capabilities of the cpu on which
676the program is being executed.
677
678@node System Calls
679@section System Calls
680
681@cindex system call
682A system call is a request for service that a program makes of the
683kernel.  The service is generally something that only the kernel has
684the privilege to do, such as doing I/O.  Programmers don't normally
685need to be concerned with system calls because there are functions in
686@theglibc{} to do virtually everything that system calls do.
687These functions work by making system calls themselves.  For example,
688there is a system call that changes the permissions of a file, but
689you don't need to know about it because you can just use @theglibc{}'s
690@code{chmod} function.
691
692@cindex kernel call
693System calls are sometimes called kernel calls.
694
695However, there are times when you want to make a system call explicitly,
696and for that, @theglibc{} provides the @code{syscall} function.
697@code{syscall} is harder to use and less portable than functions like
698@code{chmod}, but easier and more portable than coding the system call
699in assembler instructions.
700
701@code{syscall} is most useful when you are working with a system call
702which is special to your system or is newer than @theglibc{} you
703are using.  @code{syscall} is implemented in an entirely generic way;
704the function does not know anything about what a particular system
705call does or even if it is valid.
706
707The description of @code{syscall} in this section assumes a certain
708protocol for system calls on the various platforms on which @theglibc{}
709runs.  That protocol is not defined by any strong authority, but
710we won't describe it here either because anyone who is coding
711@code{syscall} probably won't accept anything less than kernel and C
712library source code as a specification of the interface between them
713anyway.
714
715
716@code{syscall} is declared in @file{unistd.h}.
717
718@deftypefun {long int} syscall (long int @var{sysno}, @dots{})
719@standards{???, unistd.h}
720@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
721
722@code{syscall} performs a generic system call.
723
724@cindex system call number
725@var{sysno} is the system call number.  Each kind of system call is
726identified by a number.  Macros for all the possible system call numbers
727are defined in @file{sys/syscall.h}
728
729The remaining arguments are the arguments for the system call, in
730order, and their meanings depend on the kind of system call.  Each kind
731of system call has a definite number of arguments, from zero to five.
732If you code more arguments than the system call takes, the extra ones to
733the right are ignored.
734
735The return value is the return value from the system call, unless the
736system call failed.  In that case, @code{syscall} returns @code{-1} and
737sets @code{errno} to an error code that the system call returned.  Note
738that system calls do not return @code{-1} when they succeed.
739@cindex errno
740
741If you specify an invalid @var{sysno}, @code{syscall} returns @code{-1}
742with @code{errno} = @code{ENOSYS}.
743
744Example:
745
746@smallexample
747
748#include <unistd.h>
749#include <sys/syscall.h>
750#include <errno.h>
751
752@dots{}
753
754int rc;
755
756rc = syscall(SYS_chmod, "/etc/passwd", 0444);
757
758if (rc == -1)
759   fprintf(stderr, "chmod failed, errno = %d\n", errno);
760
761@end smallexample
762
763This, if all the compatibility stars are aligned, is equivalent to the
764following preferable code:
765
766@smallexample
767
768#include <sys/types.h>
769#include <sys/stat.h>
770#include <errno.h>
771
772@dots{}
773
774int rc;
775
776rc = chmod("/etc/passwd", 0444);
777if (rc == -1)
778   fprintf(stderr, "chmod failed, errno = %d\n", errno);
779
780@end smallexample
781
782@end deftypefun
783
784
785@node Program Termination
786@section Program Termination
787@cindex program termination
788@cindex process termination
789
790@cindex exit status value
791The usual way for a program to terminate is simply for its @code{main}
792function to return.  The @dfn{exit status value} returned from the
793@code{main} function is used to report information back to the process's
794parent process or shell.
795
796A program can also terminate normally by calling the @code{exit}
797function.
798
799In addition, programs can be terminated by signals; this is discussed in
800more detail in @ref{Signal Handling}.  The @code{abort} function causes
801a signal that kills the program.
802
803@menu
804* Normal Termination::          If a program calls @code{exit}, a
805                                 process terminates normally.
806* Exit Status::                 The @code{exit status} provides information
807                                 about why the process terminated.
808* Cleanups on Exit::            A process can run its own cleanup
809                                 functions upon normal termination.
810* Aborting a Program::          The @code{abort} function causes
811                                 abnormal program termination.
812* Termination Internals::       What happens when a process terminates.
813@end menu
814
815@node Normal Termination
816@subsection Normal Termination
817
818A process terminates normally when its program signals it is done by
819calling @code{exit}.  Returning from @code{main} is equivalent to
820calling @code{exit}, and the value that @code{main} returns is used as
821the argument to @code{exit}.
822
823@deftypefun void exit (int @var{status})
824@standards{ISO, stdlib.h}
825@safety{@prelim{}@mtunsafe{@mtasurace{:exit}}@asunsafe{@asucorrupt{}}@acunsafe{@acucorrupt{} @aculock{}}}
826@c Access to the atexit/on_exit list, the libc_atexit hook and tls dtors
827@c is not guarded.  Streams must be flushed, and that triggers the usual
828@c AS and AC issues with streams.
829The @code{exit} function tells the system that the program is done, which
830causes it to terminate the process.
831
832@var{status} is the program's exit status, which becomes part of the
833process' termination status.  This function does not return.
834@end deftypefun
835
836Normal termination causes the following actions:
837
838@enumerate
839@item
840Functions that were registered with the @code{atexit} or @code{on_exit}
841functions are called in the reverse order of their registration.  This
842mechanism allows your application to specify its own ``cleanup'' actions
843to be performed at program termination.  Typically, this is used to do
844things like saving program state information in a file, or unlocking
845locks in shared data bases.
846
847@item
848All open streams are closed, writing out any buffered output data.  See
849@ref{Closing Streams}.  In addition, temporary files opened
850with the @code{tmpfile} function are removed; see @ref{Temporary Files}.
851
852@item
853@code{_exit} is called, terminating the program.  @xref{Termination Internals}.
854@end enumerate
855
856@node Exit Status
857@subsection Exit Status
858@cindex exit status
859
860When a program exits, it can return to the parent process a small
861amount of information about the cause of termination, using the
862@dfn{exit status}.  This is a value between 0 and 255 that the exiting
863process passes as an argument to @code{exit}.
864
865Normally you should use the exit status to report very broad information
866about success or failure.  You can't provide a lot of detail about the
867reasons for the failure, and most parent processes would not want much
868detail anyway.
869
870There are conventions for what sorts of status values certain programs
871should return.  The most common convention is simply 0 for success and 1
872for failure.  Programs that perform comparison use a different
873convention: they use status 1 to indicate a mismatch, and status 2 to
874indicate an inability to compare.  Your program should follow an
875existing convention if an existing convention makes sense for it.
876
877A general convention reserves status values 128 and up for special
878purposes.  In particular, the value 128 is used to indicate failure to
879execute another program in a subprocess.  This convention is not
880universally obeyed, but it is a good idea to follow it in your programs.
881
882@strong{Warning:} Don't try to use the number of errors as the exit
883status.  This is actually not very useful; a parent process would
884generally not care how many errors occurred.  Worse than that, it does
885not work, because the status value is truncated to eight bits.
886Thus, if the program tried to report 256 errors, the parent would
887receive a report of 0 errors---that is, success.
888
889For the same reason, it does not work to use the value of @code{errno}
890as the exit status---these can exceed 255.
891
892@strong{Portability note:} Some non-POSIX systems use different
893conventions for exit status values.  For greater portability, you can
894use the macros @code{EXIT_SUCCESS} and @code{EXIT_FAILURE} for the
895conventional status value for success and failure, respectively.  They
896are declared in the file @file{stdlib.h}.
897@pindex stdlib.h
898
899@deftypevr Macro int EXIT_SUCCESS
900@standards{ISO, stdlib.h}
901This macro can be used with the @code{exit} function to indicate
902successful program completion.
903
904On POSIX systems, the value of this macro is @code{0}.  On other
905systems, the value might be some other (possibly non-constant) integer
906expression.
907@end deftypevr
908
909@deftypevr Macro int EXIT_FAILURE
910@standards{ISO, stdlib.h}
911This macro can be used with the @code{exit} function to indicate
912unsuccessful program completion in a general sense.
913
914On POSIX systems, the value of this macro is @code{1}.  On other
915systems, the value might be some other (possibly non-constant) integer
916expression.  Other nonzero status values also indicate failures.  Certain
917programs use different nonzero status values to indicate particular
918kinds of "non-success".  For example, @code{diff} uses status value
919@code{1} to mean that the files are different, and @code{2} or more to
920mean that there was difficulty in opening the files.
921@end deftypevr
922
923Don't confuse a program's exit status with a process' termination status.
924There are lots of ways a process can terminate besides having its program
925finish.  In the event that the process termination @emph{is} caused by program
926termination (i.e., @code{exit}), though, the program's exit status becomes
927part of the process' termination status.
928
929@node Cleanups on Exit
930@subsection Cleanups on Exit
931
932Your program can arrange to run its own cleanup functions if normal
933termination happens.  If you are writing a library for use in various
934application programs, then it is unreliable to insist that all
935applications call the library's cleanup functions explicitly before
936exiting.  It is much more robust to make the cleanup invisible to the
937application, by setting up a cleanup function in the library itself
938using @code{atexit} or @code{on_exit}.
939
940@deftypefun int atexit (void (*@var{function}) (void))
941@standards{ISO, stdlib.h}
942@safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@aculock{} @acsmem{}}}
943@c atexit @ascuheap @asulock @aculock @acsmem
944@c  cxa_atexit @ascuheap @asulock @aculock @acsmem
945@c   __internal_atexit @ascuheap @asulock @aculock @acsmem
946@c    __new_exitfn @ascuheap @asulock @aculock @acsmem
947@c     __libc_lock_lock @asulock @aculock
948@c     calloc dup @ascuheap @acsmem
949@c     __libc_lock_unlock @aculock
950@c    atomic_write_barrier dup ok
951The @code{atexit} function registers the function @var{function} to be
952called at normal program termination.  The @var{function} is called with
953no arguments.
954
955The return value from @code{atexit} is zero on success and nonzero if
956the function cannot be registered.
957@end deftypefun
958
959@deftypefun int on_exit (void (*@var{function})(int @var{status}, void *@var{arg}), void *@var{arg})
960@standards{SunOS, stdlib.h}
961@safety{@prelim{}@mtsafe{}@asunsafe{@ascuheap{} @asulock{}}@acunsafe{@aculock{} @acsmem{}}}
962@c on_exit @ascuheap @asulock @aculock @acsmem
963@c  new_exitfn dup @ascuheap @asulock @aculock @acsmem
964@c  atomic_write_barrier dup ok
965This function is a somewhat more powerful variant of @code{atexit}.  It
966accepts two arguments, a function @var{function} and an arbitrary
967pointer @var{arg}.  At normal program termination, the @var{function} is
968called with two arguments:  the @var{status} value passed to @code{exit},
969and the @var{arg}.
970
971This function is included in @theglibc{} only for compatibility
972for SunOS, and may not be supported by other implementations.
973@end deftypefun
974
975Here's a trivial program that illustrates the use of @code{exit} and
976@code{atexit}:
977
978@smallexample
979@include atexit.c.texi
980@end smallexample
981
982@noindent
983When this program is executed, it just prints the message and exits.
984
985@node Aborting a Program
986@subsection Aborting a Program
987@cindex aborting a program
988
989You can abort your program using the @code{abort} function.  The prototype
990for this function is in @file{stdlib.h}.
991@pindex stdlib.h
992
993@deftypefun void abort (void)
994@standards{ISO, stdlib.h}
995@safety{@prelim{}@mtsafe{}@asunsafe{@asucorrupt{}}@acunsafe{@aculock{} @acucorrupt{}}}
996@c The implementation takes a recursive lock and attempts to support
997@c calls from signal handlers, but if we're in the middle of flushing or
998@c using streams, we may encounter them in inconsistent states.
999The @code{abort} function causes abnormal program termination.  This
1000does not execute cleanup functions registered with @code{atexit} or
1001@code{on_exit}.
1002
1003This function actually terminates the process by raising a
1004@code{SIGABRT} signal, and your program can include a handler to
1005intercept this signal; see @ref{Signal Handling}.
1006@end deftypefun
1007
1008@node Termination Internals
1009@subsection Termination Internals
1010
1011The @code{_exit} function is the primitive used for process termination
1012by @code{exit}.  It is declared in the header file @file{unistd.h}.
1013@pindex unistd.h
1014
1015@deftypefun void _exit (int @var{status})
1016@standards{POSIX.1, unistd.h}
1017@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1018@c Direct syscall (exit_group or exit); calls __task_terminate on hurd,
1019@c and abort in the generic posix implementation.
1020The @code{_exit} function is the primitive for causing a process to
1021terminate with status @var{status}.  Calling this function does not
1022execute cleanup functions registered with @code{atexit} or
1023@code{on_exit}.
1024@end deftypefun
1025
1026@deftypefun void _Exit (int @var{status})
1027@standards{ISO, stdlib.h}
1028@safety{@prelim{}@mtsafe{}@assafe{}@acsafe{}}
1029@c Alias for _exit.
1030The @code{_Exit} function is the @w{ISO C} equivalent to @code{_exit}.
1031The @w{ISO C} committee members were not sure whether the definitions of
1032@code{_exit} and @code{_Exit} were compatible so they have not used the
1033POSIX name.
1034
1035This function was introduced in @w{ISO C99} and is declared in
1036@file{stdlib.h}.
1037@end deftypefun
1038
1039When a process terminates for any reason---either because the program
1040terminates, or as a result of a signal---the
1041following things happen:
1042
1043@itemize @bullet
1044@item
1045All open file descriptors in the process are closed.  @xref{Low-Level I/O}.
1046Note that streams are not flushed automatically when the process
1047terminates; see @ref{I/O on Streams}.
1048
1049@item
1050A process exit status is saved to be reported back to the parent process
1051via @code{wait} or @code{waitpid}; see @ref{Process Completion}.  If the
1052program exited, this status includes as its low-order 8 bits the program
1053exit status.
1054
1055
1056@item
1057Any child processes of the process being terminated are assigned a new
1058parent process.  (On most systems, including GNU, this is the @code{init}
1059process, with process ID 1.)
1060
1061@item
1062A @code{SIGCHLD} signal is sent to the parent process.
1063
1064@item
1065If the process is a session leader that has a controlling terminal, then
1066a @code{SIGHUP} signal is sent to each process in the foreground job,
1067and the controlling terminal is disassociated from that session.
1068@xref{Job Control}.
1069
1070@item
1071If termination of a process causes a process group to become orphaned,
1072and any member of that process group is stopped, then a @code{SIGHUP}
1073signal and a @code{SIGCONT} signal are sent to each process in the
1074group.  @xref{Job Control}.
1075@end itemize
1076