Header record

The following header record fields are currently used by the Archie system:

Field Name Function

primary_hostname

The primary hostname of the site to which the data belongs. These names are used internally by the Archie system
preferred_hostname The name under which users see this site listed. It will be a valid Domain Name System canonical name (CNAME) for that site if one has been set
generated_by

The component of the Archie system which has generated this header. Valid values are:

retrieve

Output from the parse phase

Output from the data acquisition phase

Generated by the data retrieval phase

Generated by an administrative procedure

Generated by the controlling routines (usually after an error)

Generated by normal update routines (seldom seen)

source_archie_hostname The name of the Archie server responsible for monitoring information at this Data Host
primary_ipaddr The primary IP address of the Data Host used internally by the Archie system
access_methods The name of the Archie catalog to which this data belongs. E.g., anonftp (for anonymous ftp listings), webindex (for WWW pages) etc ..
access_command The catalog-specific sequence of parameters used during the Data Acquisition phase to perform the acquisition of the raw data from the Data Host.
os_type The operating system type of the Data Host.
timezone The timezone of the Data Host.
retrieve_time The time of data acquisition from the data host. This is written as YYYYMMDDHHMMSS (year, month, day, hour, minute, second) and is always in UTC (GMT).
parse_time The time the data was parsed. Written in the same format as the retrieve_time field.
update_time The time the data was updated. Written in the same format as the retrieve_time field.
no_recs The number of “records” in this data. For example, the value for a file listing would be the number of files in the listing. This field is not used by all catalogs.
current_status

Lists the current status of the data host. This can be:

del_by_archie

available to be queried and inspected

temporarily disabled by the system

scheduled to be deleted. Usually means that the data in the system is out of date

scheduled to be deleted by the local administrator

deactivated by the local Archie administrator

catalog type is not supported at this data host

update_status One of fail or succeed. Used internally by the system to determine result of the previous phase of the update.
prospero_host One of yes or no describing if the Prospero system is in operation at that site.
data_name Name of individual data in current file.