Common Log Format
The Common Log Format,[1] also known as the NCSA Common log format,[2] (after NCSA_HTTPd) is a standardized text file format used by web servers when generating server log files. Because the format is standardized, the files can be readily analyzed by a variety of web analysis programs, for example Webalizer and Analog.
Each line in a file stored in the Common Log Format has the following syntax:
host ident authuser date request status bytes
The format is extended by the Combined Log Format with referrer and user-agent fields.
Example
127.0.0.1 user-identifier frank [10/Oct/2000:13:55:36 -0700] "GET /apache_pb.gif HTTP/1.0" 200 2326
A "-" in a field indicates missing data.
- 127.0.0.1 is the IP address of the client (remote host) which made the request to the server.
- user-identifier is the RFC 1413 identity of the client. Usually "-".
- frank is the userid of the person requesting the document. Usually "-" unless .htaccess has requested authentication.
- [10/Oct/2000:13:55:36 -0700] is the date, time, and time zone that the request was received, by default in strftime format %d/%b/%Y:%H:%M:%S %z.
- "GET /apache_pb.gif HTTP/1.0" is the request line from the client. The method GET, /apache_pb.gif the resource requested, and HTTP/1.0 the HTTP protocol.
- 200 is the HTTP status code returned to the client. 2xx is a successful response, 3xx a redirection, 4xx a client error, and 5xx a server error.
- 2326 is the size of the object returned to the client, measured in bytes.
Usage
Log files are a standard tool for computer systems developers and administrators. They record the "what happened when by whom" of the system. This information can record faults and help their diagnosis. It can identify security breaches and other computer misuse. It can be used for auditing. It can be used for accounting purposes.
The information stored is only available for later analysis if it is stored in a form that can be analysed. This data can be structured in many ways for analysis. For example, storing it in a relational database would force the data into a query-able format. However, it would also make it more difficult to retrieve if the computer crashed, and logging would not be available unless the database was available. A plain text format minimises dependencies on other system processes, and assists logging at all phases of computer operation, including start-up and shut-down, where such processes might be unavailable.
See also
References
- "Logging in W3C httpd". World Wide Web Consortium. 1995-10-12. Retrieved 2015-04-16.
- "Log File Formats: NCSA Common". IBM. 2004-05-19. Retrieved 2013-05-07.
External links
- "Logging Control In W3C httpd: The Common Logfile Format". W3C. July 1995. Retrieved 2013-05-07.
- "Common Logfile Format". Apache webserver. 2013. Retrieved 2013-05-07.
- "Extended Log File Format". W3C Working Draft WD-logfile-960323. W3C. 1996-03-23. Retrieved 2013-05-07.