The Data X-Ray team is adding new datasources and file types all the time. This article provides the current and complete list of the technologies that the Data X-Ray supports. 

If you see a datasource that we don't have but you need, hit the intercom button and let us know! We prioritize according to client demand and it usually takes us only a few days to build and deploy new connectors.

Datasources

Current

The complete list of file-sharing datasources are:

  • Windows Network Drives (SMB)
  • Ubuntu Network Drives (SMB)
  • MacOS Network Drives (SMB)
  • Box
  • Google Drive
  • G Suite (every user's Google Drive in the organization)

And the SQL and no-SQL databases:

  • MySQL
  • Microsoft SQL
  • PostgreSQL
  • Mongo DB
  • Amazon Aurora

Email and email attachments:

  • Gmail (IMAP)
  • Outlook / Exchange Server (IMAP)
  • Yahoo Mail (IMAP)

API support is also provided via the NLP Classifier API.

Roadmap

New datasource support is driven by client demand, so please reach out if you would like to request a new native connector. Our current roadmap (in order of priority) is:

  1. Office365
  2. Sharepoint
  3. Salesforce/Quip
  4. Slack
  5. SAP
  6. HDFS
  7. Azure Blobs (partially supported already)
  8. DB2 (partially supported already)

File types

Current

Our parsers and classifiers currently handle:

  • PDF (.pdf )
  • Plain text (.txt and .md )
  • Microsoft Word documents (.doc  and .docx )
  • Microsoft Excel documents (.xls and .xlsx)
  • Comma Separated Values (.csv ) -- note: only comma delimiter currently supported
  • Emails and email attachments (through IMAP)
  • Google Spreadsheets
  • Google Documents
  • SQL database values
  • NoSQL database text values

Having trouble parsing one of these file types? Make sure that it has the correct file extension. Often our parser can figure out what the file type is, but sometimes it's difficult without the correct file extension.

Roadmap

New file type support is driven by client demand, so please reach out if you would like to request a file type that we don't currently support. We will soon support (in order of priority):

  1. Javascript Object Notation (.json )
  2. Optical Character Recognition (OCR) PDFs and images

Specialized Networking Connections

When not using the on premise installation, Data X-Ray uses secure and specialized networking connections to connect to your virtual servers, including:

  • AWS (VPC connection)
  • Google Cloud (VPN connection)
  • Heroku (PostgreSQL databases)


This list is largely driven by our customer needs, so if you think something is missing, let us know!

Did this answer your question?