J4 – Data set: general statistics

File: "/MZ/A/J04/X/DATASET/dataset.htm".
Text portion: January 2007 - G. Buccellati
Processed on 2022-09-23.

Introduction
Display version
     Text files
     Graphic files
     Files overall
     Text files for constituents
Archival version
     Text files

Introduction

     The data set includes two parallel and distinct versions. The display version is for use as a browser edition, so that all its component parts are interconencted through hyperlinks. The archival version contains the texts in plain ASCII format, with no hyperlinks, and the graphic files in sequential order.
     The number of basic text files is identical in both versions, however: (1) a variety of supplementary files (such as indices and frequencies) are generated by the various programs for the display version only, and (2) the file size is much larger for the display version, because of formatting and hyperlinking.
     The totals given below are all generated by the programs, and provide in tabular form a quantitative overview of the data set.
Back to top

Display version

     The data referring to text files are only those generated by various programs to produce the full browser edition. They exclude files generated manually. They are all in HTML format.
     Graphic files are all in low resolution (JPEG and WMF formats). The high resolution version in TIFF format is available separately and is not included in the browser edition, hence not in this count either. The files in script format for AutoCAD are included in the text files.
     The totals for constituents give the file count for each individual constituent. Each number indicates that there are that many files for as many features, items, etc..
Back to top

Text files

total number of files 134
total number of records 130,656
total number of hyperlinks 886,382
cumulative file size (in bites)

Back to top

Graphic files

total number of graphic files overall 4369
total number of photographs 3092
total number of templates 458
total number of drawings 705
total number of plots 11
total number of sketches 77
cumulative file size (in bites) 397,315,919

Back to top

Overall totals for display files

total number of files 4,369
cumulative file size (in bites) 397,315,919

Back to top

Text files for constituents

Elements
features 295
aggregates 38
items 68
q-items 375
joins 0
negatives 0
composites 0
q-lots 517
sherds 5615
animal bones 0

Referents
loci 26
relays 1590
phases 19
strata 29
views 499
drawings 25
plots 11
total number of sketches 0

Incidentals
incidentals 13

Back to top

Archival version

     The archival data set contains the textual data in plain ASCII format. They are structured, but not formatted: thus they are not meant for interactive online use, but rather for importation in other programs, such as Excel.
     In their raw format, the archival data set consist of separate files for each individual constituent (features, items, loci, etc.). These are the files that are counted (for files and filesizes) in the tabulation below. The version that is available to the online user (through the Database function) is slightly different, in that all the individual files for a given constituent are given as a single file - thus all the features are given in sequential order within the same file.
     The counts for text files differ from those given above for the display version for two reasons: the file size is greater because the HTML formatting increases the size of the display version, and the file count for the display version is greater because it includes the tabulations.
     The fact that the archival data base is of such a small size is significant. For it is this data base that serves as the foundation for all subsequent elaboration by means of programs, and this addresses squarely the issue of portability and permanence.
     The counts for graphic files are identical to those given above, hence they are not repeated below. The only difference is that in the database version they are given sequentially, whereas in the display version thay are called individually through hyperlinks.
Back to top

Text files

total number of files
total number of records 130,656
cumulative file size (in bites)