Bodo can be translated into Run or Accelerate , and our technology does just that.
For developers and data scientists who regularly perform large-scale analytics and related applications, Bodo offers a parallel data compute platform providing extreme scale and speed, but with the simplicity and flexibility of native Python. In contrast to using Python libraries and frameworks, Bodo is anew type of inferential compiler offering true parallelism and high efficiency surpassing 10,000+ cores.
Broad access to such inexpensive near-real-time analytics enables new datacentric revenue opportunities, faster competitive responses, and radically more efficient overall data operations.
Bodo is the first extreme-performance Python data analytics platform. It is the culmination of years of research in academia and industry, from the creators of the first auto-parallelization compiler technology. This technology has long been a “holy grail” for researchers working on solving the gap between simplicity and performance in computing. By democratizing access to parallel computing, we aim to accelerate data insights, ML and AI in ways the industry can’t yet imagine.
Most authorities agree that Moore’s law is being challenged. Data sets continue to grow, driven principally by AI and ML. In order to keep pace with compute demands, we at Bodo believe that taking advantage of parallelization, combined with today’s abundant cloud CPU resources, is the best answer. With the advent of mega-cloud vendors bringing cost-effective access to this processing power, combining with HPC-style parallelization to this infrastructure will again accelerate computing performance and access to computing speed.
Today’s data scientists and developers face daunting learning curves, glue-code, and special-purpose hardware/software combinations. Achieving extreme performance has therefore meant giving up programming simplicity and velocity. Bodo instead believes that extreme performance can be achieved and made accessible to everyday developers through the use of inferential, auto-parallelization compilation which is far more accessible to the developer.
Open Source communities, and the software industry overall, has generated a plethora of add-in libraries and frameworks to enable scale-out programming for developers and data scientists. However, none of these solutions has brought true linear scaling, and all require some new APIs and/or steep learning curves. They have all failed to consider a deeper look at parallel programming fundamentals, and have overlooked considering JIT compilation as a solution. The inefficiencies of the underlying approach drive the complexity of the systems and features that most solutions offer today. In contrast, Bodo adheres to the most advanced parallel computing principles, while making it accessible to programmers so they can focus on solving the problems that matter.
Python has been viewed by the industry as just a scripting prototype language. But for production, code would be rewritten in C++ or Java/Scala for performance and reliability. Automatic parallelization, code optimization, full type checking, and native code generation will now bring C++/MPI levels of speed, reliability and scalability. This paradigm shift will finally enable rapid development velocity that was hard to imagine previously.
Behzad is a global business leader with years of executive leadership experience and deep subject matter expertise in business strategy, development, strategic planning, and general management.
Ehsan is an entrepreneur, computer science researcher, and software engineer working on democratization of High-Performance Computing (HPC) for data analytics/AI/ML