TechTalk26: Techniques and Software Framework for Extracting Metadata
Wed, 24 Apr 2024 04:00:00 GMT → Wed, 24 Apr 2024 05:00:00 GMT (d=1 hours, 0 seconds)
Like many other kinds of government and research institutions, Australia's geological institutions house datasets for which there is a broad variation in the quality of metadata. This creates challenges for data providers and aggregators trying to maintain a certain standard of FAIR compliance across all their offerings.
For example, when the offering is very poor, more research and manual data entry may be required. For aggregators there is the problem of extracting metadata of a consistent standard from a wide variety of catalogue systems.
This talk will outline a software architecture and techniques that attempt to alleviate the burden of metadata collation and curation. The software is designed to homogenize metadata harvested from a variety of common metadata catalogue applications. For metadata-poor sources, extraction of metadata from associated technical reports using textual analysis and machine learning models is utilised. The limitations and viability of such techniques are discussed. At the end of the transformation process, ISO-compliant metadata records are created which are suitable for importing into a geonetwork geospatial catalogue.
Who would benefit from attending?
- research software engineers, academics, coders and other interested parties
Who will be speaking?
Vincent Fazio is a software developer employed by CSIRO Mineral Resources to implement AuScope's future vision for Australian geoscience research. He has worked in a broad range of industries and research areas over the past 20 years including defence, telecommunications, hand-held devices and protein crystallography. His current interests include: implementing metadata standards, geospatial information systems, displaying 3D geospatial datasets, website development and open source software.
Further ARDC resources
Tech Talks series is a forum for sharing technical experience and expertise in digital research. To access presentation slides and free resources from previous talks visit http://eresear.ch/techtalk
Will the session be recorded?
Yes, the session will be recorded and provided to all registrants. Please register even you are unable to attend the live session.
Have questions? Email [email protected]
Subscribe to the ARDC Connect newsletter to keep up-to-date on latest digital research news and events.
Please note that this event may be recorded and published by the ARDC. This may include your contributions during the session. ARDC respects the privacy of individuals. Information collected is in accordance with the ARDC Privacy Policy.