Home > About
Metadata refers to data that provides information about other data. In publishing, metadata refers to various details about an article or a book, such as the title, author(s), DOI, publication year, abstract, keywords, and sources. It serves as a key component, complementing the content itself. Properly curated metadata is essential for facilitating data discovery, comprehension, and organization of collected journal articles. Additionally, it simplifies communication with authors or affiliated institutions, thereby enhancing the research process for scholars.
In many cases, researchers collect files containing published articles but may lack essential metadata fields such as all the authors, publication years, abstracts, or keywords. To fill in these gaps typically, one might resort to manually searching for each article on various publishing platforms to gather the missing information. However, this process becomes tedious and error-prone, especially with large datasets.
An alternative method involves leveraging public APIs offered by many publishing companies to programmatically fetch the missing metadata. This approach automates and streamlines the process of retrieving metadata for multiple articles. Inspired by this streamlined approach, Metafetch specifically automates the retrieval of metadata for journal articles from multiple sources.
Currently the tool uses the APIs to retrieve the metadata from Elservier’s Scopus, PubMed and CrossRef.
Given a file of journal articles which may contain some gaps in metadata, the tool uses APIs from journal sources to retrieve the missing data. The MetaFetch interface is easy to navigate and has instructions the user can follow. Here is an overview of the steps you can take to use the tool.
While acknowledging the capabilities of the tool, it's important to consider more questions it could address. For instance, can we improve its comprehensiveness by integrating additional APIs from various sources? Furthermore, is it possible to extend its functionality beyond mere metadata retrieval to include information extraction capabilities?
These inquiries serve as the foundation for advancing the tool. By incorporating more APIs, we can enrich the tool's capabilities with a wider range of data sources. Additionally, by including information extraction functionalities, we can empower users to get more insights from the content itself.