|
Goals:
- to create and improve a system for measuring Web space
- to regularly perform size measurement and evaluation of contents of the Web space on the .hr top level Internet domain
- to analyze available meta-data entries within the performed measurement
- to regularly publish measurment results and provide their further analysis and comparison
Conducted measuring:
The first measurement was conducted from March 29 to May 7, 2002, in the framework of cooperation with the National and University Library
Summary of results (CRO)
Full report (CRO)
Original data:
MIME data types
meta-data
The second (May 14 to July 22, 2003) and third (from September 8 to November 25, 2003) measurement were carried out in the framework of the project which was financed by the Ministry of Science and Technology of the Republic of Croatia under the code 2002-066 and the name "Measuring the Croatian Web space".
Summary of results (CRO)
Full report (CRO)
Original data (MWP-3): MIME data types
meta-data
Limitations:
This research relates only to the so-called surface Web, so it cannot include Web sites with protected access, dynamically generated Web sites (with dynamically generated addresses) or databases accessible through the Web. The analysis does not include contents, i.e. the context in which the Web sites appear and it will not offer, for example, the evaluation of the number of books or scientific papers published on the Web.
MWP gatherer (MWP 2.0):
MWP gatherer is a robotic program which we use in this research. The program follows the robotic exclusion protocol (robots.txt files) and META ROBOTS HTML tag.
Please send comments on the project, especially on the robotic performance,
to the address: mwp@srce.hr
|