The Data X-Ray team is adding new datasources and file types all the time. This article provides the current and complete list of the technologies that the Data X-Ray supports.
If you see a datasource that we don't have but you need, hit the intercom button and let us know! We prioritize according to client demand and it usually takes us only a few days to build and deploy new connectors.
The complete list of file-sharing datasources are:
- Windows Network Drives (SMB)
- Ubuntu Network Drives (SMB)
- MacOS Network Drives (SMB)
- Single Google Drives
- Office365 / OneDrive
- G Suite (every user's Google Drive in the organization)
And the SQL and no-SQL databases:
- Microsoft SQL
- Mongo DB
- Amazon Aurora
Email and email attachments:
- Gmail (IMAP)
- Outlook / Exchange Server (IMAP)
- Yahoo Mail (IMAP)
API support is also provided via the NLP Classifier API.
A Java SDK is available for connecting to any type of datasource you may have.
New datasource support is driven by client demand, so please reach out if you would like to request a new native connector. Our current roadmap (in order of priority) is:
- Azure Blobs (partially supported already)
- DB2 (partially supported already)
Our parsers currently handle 1000s of file types from normal Microsoft Office files to PDFs to ZIP files and even more esoteric file types like Word Perfect. Contact us if you have a particular concern about a file type that you would like to know more about.
OCR (Optical Character Recognition)
The Data X-Ray supports OCR and is now able to identify text in over 100 languages within scanned documents and most images.
Specialized Networking Connections
When not using the on premise installation, Data X-Ray uses secure and specialized networking connections to connect to your virtual servers, including:
- AWS (VPC connection)
- Google Cloud (VPN connection)
- Heroku (PostgreSQL databases)
This list is largely driven by our customer needs, so if you think something is missing, let us know!