Data Management from Edge to Enterprise

Connecting Edge to Big Data by Democratizing Processors

icon-io
icon-ai
icon-hd
icon-h

technology

The value of Data Recency, according to Harvard Business Review

big-frame-screen
image-grafic

Effectiveness for Model Time Period Relative to Best Model Predicting Given Relative Time Period (100MB)

small-frame-screen
frame-3-sircle

Sigmax    Software Creates the Most Direct Value When it Intersects:

01
Data Recency

The highest-value data in the Enterprise is that which has the highest recency. Fresh data is more valuable.

02
Data Dissemination

A more highly-connected system contributes non-linearly to the value of an organization.

03
Ubiquity is key

Cloud computing contributes to agility and, under certain conditions, to economics. But contributes even more to ubiquitous access.

why choose us

in-Flight Data Formatting Results in Data Processing at Breakthrough Speeds

5G MEC, Autonomous Vehicle, Predictive Maintenance and similar edge compute and transport platforms need to make decisions and react at real-world speeds. Our extended heterogeneous processing support results in JSON to Apache Arrow Ingest at 100X lower latency vs. scaling out Xeon processors alone.

100x

Lower Latency

Our extended heterogeneous processing support results in JSON to Apache Arrow Ingest at 100X lower latencies

80%

Wall-Clock Savings

Wait less for data bound for notebooks and Data Science tools

20x

Data Ingest Rates

Our product moves data to AI+ML at 20X better troughput than competing technologies

Features

Featured Stack Elements

Slider Image
slide-icon

Apache Arrow

Is an in-memory data format that is ideal for streaming analytics and big data systems. Arrow is the first piece in building an efficient, flexible and highly performant dataflow. Its columnar in-memory structure is cache efficient and allows for extremely fast query and processing in AI+ML and analytic environments.
Slider Image
slide-icon

Apache Arrow Flight

This is a new client-server framework built for high performance transport of large datasets over network interfaces. Apache Arrow Flight is architected to be general purpose (can talk with any gRPC capable client with or without knowledge of Apache Arrow format), however, when paired with Apache Arrow data you get extreme efficiency in data movement
Slider Image
Frame-wave

Apache Pulsar

Apache Pulsar is a cloud-native, distributed messaging and streaming platform. It offers significant benefits when compared to other messaging platforms for example vs. Apache KAFKA. Pulsar is well suited for both Latency sensitive requirements of new 5G wireless data infrastructures as well as high bandwidth requirements put forward by high complexity schemas.
Slider Image
Frame-arrow

Apache Presto

Apache Presto is a distributed SQL query engine designed to query large data sets distributed over one or more heterogeneous data sources. Presto is not a database however – It understands SQL which provide the features of a standard database but can operate against a modern streaming analytics distributed compute environment. Apache Presto is architected to perform well in big data scale environments and offers users of the Sigmax stack a simple SQL interface to query Pulsar data.

why choose us

SigmaX's Commitment to Apache Open Source Software

World Firsts @ SigmaX.ai:

Producer/Consumer: FPGA->Pulsar
Producer/Consumer: Nvidia GPU->Pulsar
Design for 5G-enabled Distributed Collect/Query

Multitude of Apache Core Improvements

Apache Pulsar, Presto, Arrow

Page-Aligned Buffers for FPGA/GPU processing

Type Handlers

PR’s for Pulsar, Arrow, Arrow Flight

Improvements to Apache Fletcher for expanded Datasets

Apache New Project Leadership

Apache Bolson Messaging Saturation Test Harness

Apache Illex Random At-Scale Data Generator

Apache Dive defining low-power device pre-processing and query of data

afwerx
mercury
liquid
evlos
intel
dell
zettaset
rancher
server image

Mercury Sigmax   Appliance

Unified Acquisition cost HW+SW, Linear Scaling

Single Contract HW + SW. 2 Week engagement to productivity

Integrated platform with defined point-releases. Highly customizable perproject.

Arrow Flight returns 80% of wall clock time to data scientists allowing for agile development.

Director

“My Acquisition cost is too high and it takes 9 months to deploy a new system”

Finance

“Takes months to align contracts for hardware then software – then labor costs to perform integration”

IT

“We do need to deliver a custom solution, but do we really have to maintain dozens of point-releases?”

Data Scientist

“Why am I waiting all this time for data to return to my Jupyter/Python notebook from a database?”

Technology

FPGA coprocessor accelerated Data Engineering

01
Deterministic processing guarantees record-order preservation
02
Fastest translation functions (any format in, more optimized format out)
03
Data rates exceed PCI and eclipse general-purpose processors' performance on latency dimension
technology-image-1
technology-image-2

Technology

Xeon/i86 family of processors Agility contributions

01
Multitude of Big Data methods and projects supported
02
Rapid horizontal scaling
03
High clock rate vs high parallelism

Technology

Nvidia GPU AI/ML and High Parallelism

01
Best in class AI/ML offloading
02
Well-known libraries, friendly to Data Scientists for algorithm development
03
High consistency with modern Data Engineering techniques
technology-image-3

TRY it now

Get Started with Sigmax    Today

Predictive Maintenance and similar edge compute and transport platforms need to make decisions and react at real-world speeds. Our extended heterogeneous processing support results in JSON