Utility to download and extract document metadata from an organization. This technique can be used to identify: domains, usernames, software/version numbers and naming conventions.
-
Updated
Jun 19, 2024 - Python
Utility to download and extract document metadata from an organization. This technique can be used to identify: domains, usernames, software/version numbers and naming conventions.
A collection of tools for forensic analysis
R Interface to Apache Tika
A dart library for extracting metadata in web pages. Supports Open Graph, Meta, Twitter Cards, and Structured Data (Json-LD)
Radare2 Metadata Extraction to Elasticsearch
Azure Function & supporting framework to take PDF files, extract metadata using regular expressions, store the results in DocumentDB to be indexed and searchable by Azure Search.
Scrape web pages and effortlessly extract the data you need. Easy, robust, efficient, and intuitively user-friendly.
Add a description, image, and links to the extract-metadata topic page so that developers can more easily learn about it.
To associate your repository with the extract-metadata topic, visit your repo's landing page and select "manage topics."