In Autonomous Truck Technology, TuSimple Selecting WekaIO Matrix HPC Storage System
For autonomous fleet vehicle machine learning training
This is a Press Release edited by StorageNewsletter.com on June 22, 2018 at 2:13 pmWekaIO, Inc., in scalable file storage for AI and technical compute applications, announced that TuSimple, Inc., in autonomous truck technology, has selected the WekaIO Matrix HPC storage system to provide flash-based parallel file storage capabilities to accelerate its deep neural network (DNN) machine learning training.
TuSimple’s goal is to develop a level 4 autonomous truck driving solution, for the dock-to-dock delivery of commercial goods. The company, which was founded by entrepreneurs from the California Institute of Technology, has facilities in San Diego, CA and Tucson, AZ, and to date its technology has been road-tested for some 15,000 miles. TuSimple chose WekaIO Matrix after comparisons with other scale-out file systems demonstrated that Matrix has the ability to meet its demanding performance requirements.
“WekaIO Matrix was the clear choice for our on-premises DNN training in the US. It was understood from the outset that a standard NAS solution would not be able to scale to the extent we would need it to, and apart from Matrix being the most performant of all the parallel file systems we evaluated, we really liked the fact that it is hardware-independent, allowing us better control over our infrastructure costs. We are also taking full advantage of WekaIO’s object storage capability, which is much more economical than an all-flash system, and allows us to efficiently scale our data catalog in a single namespace,” said Dr. Xiaodi Hou, co-founder and CTO, TuSimple.
Implementing WekaIO Matrix positions TuSimple to extract the maximum value from its training systems for autonomous fleet vehicles. Extensive training enables TuSimple’s L4 system to recognize and safely respond, in real-time, to the range of objects and conditions a class 8 truck might encounter while driving autonomously. With Matrix software, both data and metadata are distributed across the entire storage infrastructure to ensure massively parallel access. The software has an optimized network stack, which will deliver low latency and high bandwidth performance, resulting in a solution that can handle the demanding data and metadata intensive operations.
“We don’t rely on LiDAR as our primary sensor, we do a lot of camera-based analysis,” added Hou. “The data sets that train our AI models are comprised of millions of image files which need to be read at high bandwidth. Matrix provides the low latency, high bandwidth we need to meet our data ingest demands.“
“TuSimple is a visionary company whose AV technology is unlike anything else currently available,” said Liran Zvibel, co-founder and CEO, WekaIO. “We have deep expertise in architecting AI storage infrastructures similar to the TuSimple use case at other AV locations, which has moreover contributed to our understanding of how to handle AI at massive scale. I’m also pleased to say that we are delivering on our promise to keep TuSimple’s GPU cluster fully saturated with data and accelerating its training workloads.“