Search, find, save with searchit
Efficient search in all Microsoft Office file formats, Outlook archives in PST format, PDF files, TXT files, TIFF/TIF files, PNG files, AutoCAD and DWG formats, ZIP, RAR and 7z archives, XML formats and many more!
The search of almost any file format is one of the greatest strengths of the enterprise search solution searchit. In contrast to the search in File Explorer, the file content including metadata of all indexed files can be searched, even in formats for scans, images or CAD files. Find out exactly how searchit makes the unsearchable searchable and scroll through the full list of supported file formats.
How are files searched in searchit?
As an enterprise search solution, searchit enables comprehensive file searches through intelligent indexing and categorization. Users can quickly and efficiently search for content in various formats such as documents, emails, presentations, and more to find relevant information and increase productivity.
What are MIME types?
MIME types (Multipurpose Internet Mail Extensions) are labels that define the media type of files on the Internet. They enable the correct interpretation and processing of content by telling the servers and browsers the file type.
Supported categories of file formats
Every day, the lawyer rummages through e-mail archives, the forewoman through CAD files – the most frequently used file format depends on both the industry and the job. searchit’sever-growing number of parsers makes it possible to search in almost all file categories.
HTML (Hypertext Markup Language)
The lingua franca of the web – Almost every HTML format found on the web is supported with the searchit search function:
- Valid XHTML code and XML
- Microsoft Office document formats
- OpenDocument
- iWorks
- Portable Document Formats
- EPUB
- RTF
- Compression and packaging formats
- Audio, image and video formats
- And other scientific, language-processing, object-recognizing, and database-based formats
XML and Derived Formats
The Extensible Markup Language (XML) format is used both for hierarchically structured data and for a platform-independent exchange of data between computer systems. XML languages supported by searchit include:
- XHTML (Extensible Hypertext Markup Language)
- OOXML (Office Open XML)
- ODF (Open Document Format)
Microsoft Office document formats
Text and metadata extraction from Microsoft Office and some related applications can be searched in the following formats:
- OLE 2 Compound Document Format
- OOXML (Office Open XML)
- Temporary Office Lock Files (Owner Files)
OpenDocument Format
searchit searches the OpenDocument format (ODF) for:
- All files in the OpenOffice.org office suite
- Older files in OpenOffice 1.0 format, the predecessor of ODF
iWorks Document Formats
Both text and metadata are supported in iWorks, including:
- Numbers
- Pages
- Keynotes
WordPerfect document formats
searchit searches all formats related to:
- Corel WordPerfect Office Suite
- WordPerfect WP6+ Files
- QuattroPro QPW v9+ Files
Portable Document Format
Digitally created and non-searchable scans are made searchable in searchit using the ORC functionality. More about PDF search with searchit.
Electronic Publication Format
searchit searches eBooks, digital books, and papers in the following formats:
- Electronic Publication Format (EPUB)
- Fiction Book Publishing Format
Rich Text Format
Full search functionality for documents in Rich Text Format (RTF).
Compression and packaging formats
Enterprise search software searchit enables you to search even in compressed data. Various compression and packaging formats are supported:
- Tar
- ARE
- ARJ
- CPIO
- Dump
- Zip
- 7Zip
- Gzip
- BZip2
- XZ
- LZMA
- Z
- Pack200
- RARE
- AppleSingle and
- AppleDouble Files
Text Formats
Extracting text content from plain text files seems like an easy task until you start thinking about all the possible character encodings. searchit is able to automatically recognize the character encoding of a text document .
Feed and syndication formats
Updates of websites, podcasts or news articles – searchit supports syndication formats that keep users up to date:
- RSS Feed
- Atom Feed
- IPTC ANPA News Wire Feed Format
Help Formats
searchit searches the Microsoft Help files:
- CHM Help Format ( called Compiled HTML Help, also Compressed HTML Help or Compiled Help Module(s))
Video Formats
Video recordings in the most common formats are searched with serachit with a focus on metadata:
- Flash Video Format
- MP4 family of video formats including MP4, Quicktime, 3GPP and many more
- Ogg family of video formats
Java Class Files and Archives
Class names and method signatures are searched in searchit in the following formats:
- Java Class Files
- jar Archives
Source Code
searchit searches source code for content and metadata itself:
- Java
- C
- C++ Groovy
- and more!
Email formats
Searchit makes it possible to search e-mails and even e-mail archives in the following formats:
- PST email format, used in Microsoft Outlook archives
- MSG e-mail format, used for individually downloaded Outlook e-mails
- Microsoft TNEF (Transport Neutral Encoding Format, also known as Winmail.dat), used by most Microsoft email clients for email attachments
- mbox format, widely used in email archives and Unix-like mailboxes
- RFC 822 format: Used by many email clients in archives and exports
CAD formats
searchit searches data from files in DWG CAD format.
Font Formats
Search for metadata even in font files – searchit supports:
- TrueType font format
- Adobe Font Metrics Files
Scientific formats
Many of the programs that are specifically used in science can be searched for metadata and content with searchit :
- GCMD Directory Interchange Format (DIF)
- GDAL
- ISO 19139 file format for geographic information
- Grib
- HDF
- Family of file formats ISA-Tab (ISA Tools)
- Netcdf
- Matlab
Executable programs and libraries
Searchit extracts and searches metadata information about platforms, architectures, and types from a range of executable formats and libraries:
- Windows Executables
- Linux/BSD programs and libraries
- and many more!
Crypto Formats
Using secure access controls and special parsers, searchit even searches encrypted messages:
- PKCS7-signed messages, without information from the outer PKCS7 wrapper
- Metadata from Time Stamped Data Envelope (TSD) Files
- Saved Content from the TSD Wrapper
Database formats
Several types of databases can be searched quickly and easily in searchit :
- SQLite3 files
- Microsoft Access database files
- dBase files (dbf) including dBase, FoxBASE, FoxPRO, and shapefile format from ESRI
Natural Language Processing
Artificial intelligence is used in searchit , for example, by means of natural language processing and named entity recognition frameworks. This enables:
- Classification of the mood and emotional tone of a document
- Extract metadata from full-text journal publications.
Image and video object recognition
Several object detection frameworks are supported to analyze the content of images and videos. searchit instances are trained with large training datasets for specific areas of application of customers.
Know what's in it - regardless of the file format
Thanks to searchit, you can search in hundreds of file formats at the same time on a central platformFull list of searchable MIME types
Over three hundred formats for text files, images and scans, PDFs and much more are supported in searchit :
AppleSingleFileParse
- application/applefile
PListParser
- application/x-plist
- application/x-bplist-itunes
- application/x-bplist
- application/x-bplist-memgraph
- application/x-bplist-webarchive
ClassParser
- application/java-vm
AudioParser
- audio/vnd.wave
- Audio/X-WAV
- audio/basic
- Audio/X-AIFF
MidiParser
- application/x-midi
- audio/midi
SourceCodeParser
- text/x-c++src
- text/x-groovy
- text/x-java-source
Pkcs7Parser
- application/pkcs7-signature
- application/pkcs7-mime
TSDParser
- application/timestamped-data
TextAndCSVParser
- text/csv
- Text/TSV
- text/plain
DBFParser
- application/x-dbf
DGN8Parser
- image/vnd.dgn; version=8
DIFParser
- application/dif+xml
DWGParser
EpubParser
- application/x-ibooks+zip
- application/epub+zip
ExecutableParser
- application/x-msdownload
- application/x-sharedlib
- application/x-elf
- application/x-object
- application/x-executable
- application/x-coredump
ExternalParser
- Video/AVI
- Video/MPEG
- Video/X-MSvideo
- Video/MP4
FeedParser
- application/atom+xml
- application/rss+xml
AdobeFontMetricParser
- application/x-font-adobe-metric
TrueTypeParser
- application/x-font-ttf
HtmlParser
- text/html
- application/vnd.wap.xhtml+xml
- application/x-asp
- application/xhtml+xml
HttpParser
- application/x-httpresponse
HwpV5Parser
- application/x-hwp-v5
BPGParser
- image/bpg
- image/x-bpg
HeifParser
- image/heic-sequence
- image/heif
- image/heic
- image/heif-sequence
ICNSParser
- image/icns
ImageParser
- image/png
- image/vnd.wap.wbmp
- image/x-jbig2
- image/bmp
- image/x-xcf
- image/gif
- image/x-icon
- image/x-ms-bmp
JXLParser
- image/jxl
JpegParser
- image/jpeg
PSDParser
- image/vnd.adobe.photoshop
TiffParser
WebPParser
- image/webp
IDMLParser
- application/vnd.adobe.indesign-idml-package
IptcAnpaParser
- text/vnd.iptc.anpa
IWorkPackageParser
- application/vnd.apple.keynote
- application/vnd.apple.iwork
- application/vnd.apple.numbers
- application/vnd.apple.pages
IWork13PackageParser
- application/vnd.apple.numbers.13
- application/vnd.apple.unknown.13
- application/vnd.apple.pages.13
- application/vnd.apple.keynote.13
IWork18PackageParser
- application/vnd.apple.pages.18
- application/vnd.apple.keynote.18
- application/vnd.apple.numbers.18
RFC822Parser
- message/rfc822
MatParser
- application/x-matlab-data
MboxParser
- application/mbox
EMFParser
- image/emf
JackcessParser
- application/x-msaccess
MSOwnerFileParser
OfficeParser
- application/x-tika-msoffice-embedded; format=ole10_native
- application/msword
- application/vnd.visio
- application/x-tika-ole-drm-encrypted
- application/vnd.ms-project
- application/x-tika-msworks-spreadsheet
- application/x-mspublisher
- application/vnd.ms-powerpoint
- application/x-tika-msoffice
- application/sldworks
- application/x-tika-ooxml-protected
- application/vnd.ms-excel
- application/vnd.ms-outlook
OldExcelParser
- application/vnd.ms-excel.workspace.3
- application/vnd.ms-excel.workspace.4
- application/vnd.ms-excel.sheet.2
- application/vnd.ms-excel.sheet.3
- application/vnd.ms-excel.sheet.4
TNEFParser
- application/vnd.ms-tnef
- application/x-tnef
- application/ms-tnef
WMFParser
- image/wmf
ActiveMimeParser
- application/x-activemime
ChmParser
- application/vnd.ms-htmlhelp
- application/x-chm
- application/chm
OneNoteParser
- application/onenote; format=one
OOXMLParser
- application/vnd.ms-powerpoint.template.macroenabled.12
- application/vnd.ms-excel.addin.macroenabled.12
- application/vnd.openxmlformats-officedocument.wordprocessingml.template
- application/vnd.ms-excel.sheet.binary.macroenabled.12
- application/vnd.openxmlformats-officedocument.wordprocessingml.document
- application/vnd.ms-powerpoint.slide.macroenabled.12
- application/vnd.ms-visio.drawing
- application/vnd.ms-powerpoint.slideshow.macroenabled.12
- application/vnd.ms-powerpoint.presentation.macroenabled.12
- application/vnd.openxmlformats-officedocument.presentationml.slide
- application/vnd.ms-excel.sheet.macroenabled.12
- application/vnd.ms-word.template.macroenabled.12
- application/vnd.ms-word.document.macroenabled.12
- application/vnd.ms-powerpoint.addin.macroenabled.12
- application/vnd.openxmlformats-officedocument.spreadsheetml.template
- application/vnd.ms-xpsdocument
- application/vnd.ms-visio.drawing.macroenabled.12
- application/vnd.ms-visio.template.macroenabled.12
- model/vnd.dwfx+xps
- application/vnd.openxmlformats-officedocument.presentationml.template
- application/vnd.openxmlformats-officedocument.presentationml.presentation
- application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
- application/vnd.ms-visio.stencil
- application/vnd.ms-visio.template
- application/vnd.openxmlformats-officedocument.presentationml.slideshow
- application/vnd.ms-visio.stencil.macroenabled.12
- application/vnd.ms-excel.template.macroenabled.12
Word2006MLParser
shh. OutlookPSTParser
Rtf. RTFParser
- application/rtf
xml.SpreadsheetMLParser
- application/vnd.ms-spreadsheetml
xml.WordMLParser
- application/vnd.ms-wordml
MIFParser
- application/x-mif
- application/vnd.mif
- application/x-maker
Mp3Parser
- Audio/MPEG
MP4Parser
- Video/X-M4V
- application/mp4
- Video/3GPP
- Video/3GPP2
- video/quicktime
- Audio/MP4
- Video/MP4
TesseractOCRParser
- image/ocr-x-portable-pixmap
- image/ocr-jpx
- image/x-portable-pixmap
- image/OCR-JPEG
- image/OCR-JP2
- image/jpx
- image/ocr-png
- image/OCR-TIFF
- image/ocr-gif
- image/ocr-bmp
- image/jp2
FlatOpenDocumentParser
- application/vnd.oasis.opendocument.tika.flat.document
- application/vnd.oasis.opendocument.flat.presentation
- application/vnd.oasis.opendocument.flat.spreadsheet
- application/vnd.oasis.opendocument.flat.text
OpenDocumentParser
- application/x-vnd.oasis.opendocument.presentation
- application/vnd.oasis.opendocument.chart
- application/x-vnd.oasis.opendocument.text-web
- application/x-vnd.oasis.opendocument.image
- application/vnd.oasis.opendocument.graphics-template
- application/vnd.oasis.opendocument.text-web
- application/x-vnd.oasis.opendocument.spreadsheet-template
- application/vnd.oasis.opendocument.spreadsheet-template
- application/vnd.sun.xml.writer
- application/x-vnd.oasis.opendocument.graphics-template
- application/vnd.oasis.opendocument.graphics
- application/vnd.oasis.opendocument.spreadsheet
- application/x-vnd.oasis.opendocument.chart
- application/x-vnd.oasis.opendocument.spreadsheet
- application/vnd.oasis.opendocument.image
- application/x-vnd.oasis.opendocument.text
- application/x-vnd.oasis.opendocument.text-template
- application/vnd.oasis.opendocument.formula-template
- application/x-vnd.oasis.opendocument.formula
- application/vnd.oasis.opendocument.image-template
- application/x-vnd.oasis.opendocument.image-template
- application/x-vnd.oasis.opendocument.presentation-template
- application/vnd.oasis.opendocument.presentation-template
- application/vnd.oasis.opendocument.text
- application/vnd.oasis.opendocument.text-template
- application/vnd.oasis.opendocument.chart-template
- application/x-vnd.oasis.opendocument.chart-template
- application/x-vnd.oasis.opendocument.formula-template
- application/x-vnd.oasis.opendocument.text-master
- application/vnd.oasis.opendocument.presentation
- application/x-vnd.oasis.opendocument.graphics
- application/vnd.oasis.opendocument.formula
- application/vnd.oasis.opendocument.text-master
PDFParser
CompressorParser
- application/zlib
- application/x-gzip
- application/x-bzip2
- application/x-compress
- application/x-java-pack200
- application/x-lzma
- application/deflate64
- application/X-LZ4
- application/x-snappy
- application/x-brötli
- application/gzip
- application/x-bzip
- application/x-xz
PackageParser
- application/x-tar
- application/java-archive
- application/x-arj
- application/x-archive
- application/zip
- application/x-cpio
- application/x-tika-unix-dump
- application/x-7z-compressed
RarParser
- application/x-rar-compressed
PRTParser
- application/x-prt
SAS7BDATParser
- application/x-sas-data
TMXParser
- application/x-tmx
FLVParser
- Video/X-FLV
WACZParser
- application/x-wacz
WARCParser
- application/warc
- application/warc+gz
QuattroProParser
- application/x-quattro-pro; version=9
WordPerfectParser
- application/vnd.wordperfect; version=5.1
- application/vnd.wordperfect; version=5.0
- application/vnd.wordperfect; version=6.x
XLIFF12Parser
- application/x-xliff+xml
XLZParser
- application/x-xliff+zip
DcXMLParser
- application/xml
- image/svg+xml
FictionBookParser
- application/x-fictionbook+xml
FlacParser
- Audio/X-Oggflac
- Audio/X-FLAC
OggParser
- Audio/OGG
- application/kate
- application/ogg
- Video/Daala
- video/x-ogguvs
- Video/X-OGM
- audio/x-oggpcm
- video/ogg
- video/x-dirac
- video/x-oggrgb
- Video/X-Oggyuv
OpusParser
- Audio/Opus
- Audio/OGG; codecs=opus
SpeexParser
- Audio/OGG; codecs=speex
- audio/speex
TheoraParser
- video/theora
VorbisParser
- Audio/Vorbis
Contact us
We focus on holistic service & a high-end enterprise search engine. Please contact us.