Dataset Catalogs
Catalogs are collections of datasets available on the Internet.
Each catalog belongs to one of the catalog types based on its owner status, content, and other details.
A catalog, especially one that is professionally organized, is usually maintained using a data CMS. The choice of data CMS for a catalog depends on its type.
Catalogs in Dateno
The Registry of Catalogs
Dateno maintains a registry of dataset catalogs by indexing them. The search engine parses the indexed catalogs, extracts their metadata, and presents it in a searchable format. The inclusion of catalogs in the index is moderated to remove useless information and provide value to users.
Catalog Types
A dataset catalog always belongs to a singular catalog type.
The search engine recognizes the type of the catalog when including it in the registry. The following catalog types are recognized in the service:
Catalog type | Typical content | Details |
---|---|---|
General research repository | References to large hubs of scientifical data | Catalogs of this type point to large collections of datasets that cannot be parsed or accessed automatically because of technical or legal reasons. |
Geoportal | Data attached to geographical maps | Often provides links to public APIs or data represented in geoinformatics system formats |
Indicators catalog | Tabulated values of economic, social, ecological, etc. indicators | Usually provides CSV and spreadsheet files |
Machine learning catalog | Datasets created for training and testing machine learning solutions | May contain naturally-looking data not describing actual affairs |
Microdata catalog | Sociological survey data | Typically contains datasets from public opinion or household surveys. Access often requires authorization, but some catalogs—such as the World Bank Microdata Library—offer open access |
Open data portal | Wide range of data describing mostly society, business, politics, and matters significant for society | The most generic type of catalog, usually maintained by national governments or non-profits |
Scientific data repository | Data provided for scientific needs | Often provides archives of files in multiple formats |
If a subset of datasets is selected, either by entering a full-text query or by choosing options in other facets, the Catalog type facet displays only the catalog types related to the selected datasets.
Dataset Search Tips
-
Choose the data catalog type in the Catalog type facet that matches the area of your research. For example,
Indicators catalog
usually meets the needs of economic research. SelectGeoportal
if the data is supposed to be tied to settlements, territories, or other geographical entities. -
Alternatively, you can use the name of a data CMS as a criterion for selecting appropriate datasets. For example, choose data CMSs designed for geographical data if you are searching for geodata.