Skip to main content

Dataset Catalogs

Catalogs are collections of datasets available on the Internet.

Each catalog belongs to one of the catalog types based on its owner status, content, and other details.

A catalog, especially one that is professionally organized, is usually maintained using a data CMS. The choice of data CMS for a catalog depends on its type.

Catalogs in Dateno

The Registry of Catalogs

Dateno maintains a registry of dataset catalogs by indexing them. The search engine parses the indexed catalogs, extracts their metadata, and presents it in a searchable format. The inclusion of catalogs in the index is moderated to remove useless information and provide value to users.

Catalog Types

A dataset catalog always belongs to a singular catalog type.

The search engine recognizes the type of the catalog when including it in the registry. The following catalog types are recognized in the service:

Catalog typeTypical contentDetails
General research repositoryReferences to large hubs of scientifical dataCatalogs of this type point to large collections of datasets that cannot be parsed or accessed automatically because of technical or legal reasons.
GeoportalData attached to geographical mapsOften provides links to public APIs or data represented in geoinformatics system formats
Indicators catalogTabulated values of economic, social, ecological, etc. indicatorsUsually provides CSV and spreadsheet files
Machine learning catalogDatasets created for training and testing machine learning solutionsMay contain naturally-looking data not describing actual affairs
Microdata catalogSociological survey dataTypically contains datasets from public opinion or household surveys. Access often requires authorization, but some catalogs—such as the World Bank Microdata Library—offer open access
Open data portalWide range of data describing mostly society, business, politics, and matters significant for societyThe most generic type of catalog, usually maintained by national governments or non-profits
Scientific data repositoryData provided for scientific needsOften provides archives of files in multiple formats

If a subset of datasets is selected, either by entering a full-text query or by choosing options in other facets, the Catalog type facet displays only the catalog types related to the selected datasets.

Dataset Search Tips

  • Choose the data catalog type in the Catalog type facet that matches the area of your research. For example, Indicators catalog usually meets the needs of economic research. Select Geoportal if the data is supposed to be tied to settlements, territories, or other geographical entities.

  • Alternatively, you can use the name of a data CMS as a criterion for selecting appropriate datasets. For example, choose data CMSs designed for geographical data if you are searching for geodata.