The Data X-Ray team is adding new datasources and file types all the time. This article provides the current and complete list of the technologies that the Data X-Ray supports. 

If you see a datasource that we don't have but you need, hit the intercom button and let us know! We prioritize according to client demand and it usually takes us only a few days to build and deploy new connectors.

Datasources

Current

The complete list of file-sharing datasources are:

  • Windows Network Drives (SMB)
  • Ubuntu Network Drives (SMB)
  • MacOS Network Drives (SMB)
  • Box
  • Single Google Drives
  • Office365 / OneDrive
  • G Suite (every user's Google Drive in the organization)

And the SQL and no-SQL databases:

  • MySQL
  • Microsoft SQL
  • PostgreSQL
  • Mongo DB
  • Amazon Aurora

Email and email attachments:

  • Gmail (IMAP)
  • Outlook / Exchange Server (IMAP)
  • Yahoo Mail (IMAP)

API support is also provided via the NLP Classifier API.

Java SDK

A Java SDK is available for connecting to any type of datasource you may have.

Roadmap

New datasource support is driven by client demand, so please reach out if you would like to request a new native connector. Our current roadmap (in order of priority) is:

  1. Sharepoint
  2. Salesforce/Quip
  3. Slack
  4. SAP
  5. HDFS
  6. Azure Blobs (partially supported already)
  7. DB2 (partially supported already)

File types

Current

Our parsers currently handle 1000s of file types from normal Microsoft Office files to PDFs to ZIP files and even more esoteric file types like Word Perfect. Contact us if you have a particular concern about a file type that you would like to know more about.

OCR (Optical Character Recognition)

The Data X-Ray supports OCR and is now able to identify text in over 100 languages within scanned documents and most images.

Specialized Networking Connections

When not using the on premise installation, Data X-Ray uses secure and specialized networking connections to connect to your virtual servers, including:

  • AWS (VPC connection)
  • Azure
  • Google Cloud (VPN connection)
  • Heroku (PostgreSQL databases)


This list is largely driven by our customer needs, so if you think something is missing, let us know!

Did this answer your question?