HPCC Systems incorporates a software architecture implemented on commodity shared-nothing computing clusters to provide high-performance, data-parallel processing and delivery for applications utilizing big data. The HPCC Systems platform includes system configurations to support both parallel batch data processing (Thor) and high-performance data delivery applications using indexed data files (ROXIE). It also includes a high level and implicitly parallel data-centric declarative programming language for parallel data processing, called Enterprise Control Language (ECL).
Extract Transform and Load your data using a powerful scripting language (ECL) specifically developed to work with data.
Data Profiling, Data Cleansing, Snapshot Delta Updates and consolidation, Job Scheduling and automation are some of the key features.
An index based search engine to perform real-time queries. SOAP, XML, REST, and SQL are all supported interfaces.
In place (supporting distributed linear algebra) predictive modeling functionality to perform Linear Regression, Logistic Regression, Decision Trees, and Random Forests.