• About
  • FAQ
  • Earn Bitcoin while Surfing the net
  • Buy & Sell Crypto on Paxful
Newsletter
Approx Foundation
  • Home
    • Home – Layout 1
  • Bitcoin
  • Ethereum
  • Regulation
  • Market
  • Blockchain
  • Business
  • Guide
  • Contact Us
No Result
View All Result
  • Home
    • Home – Layout 1
  • Bitcoin
  • Ethereum
  • Regulation
  • Market
  • Blockchain
  • Business
  • Guide
  • Contact Us
No Result
View All Result
Approx Foundation
No Result
View All Result
Home Blockchain

IBM Research data loader enhances AI model training for open-source community

Moussa by Moussa
October 4, 2024
in Blockchain
0
189
SHARES
1.5k
VIEWS
Share on FacebookShare on Twitter


How do you overcome bottlenecks when you’re training AI models on massive quantities of data? At this year’s PyTorch conference, IBM Research showcased a groundbreaking data loader for large-scale LLM training. The tool, now available to PyTorch users, aims to simplify large-scale training for as broad an audience as possible.

The origins of the research

The idea for the high-throughput data loader stemmed from practical issues research scientists observed during model training, as their work required a tool that could process large amounts of data across multiple devices—all while keeping up with progressively efficient GPUs. As IBM Research notes in its blog about the release, “It’s all thanks to a team of researchers who were simply building the tools they needed to get a job done.”

Related articles

Real-World Use Cases of Security Token Offerings (STOs)

Real-World Use Cases of Security Token Offerings (STOs)

April 3, 2026
Success Story: Ola Osode’s Learning Journey with 101 Blockchains

Success Story: Ola Osode’s Learning Journey with 101 Blockchains

April 1, 2026

Davis Wertheimer of IBM Research explains some of the challenges that can emerge during large-scale training: “There’s something of an 80/20 rule when it comes to large-scale training. Eighty percent of all the published literature is looking at algorithmic tradeoffs between GPU memory and communication and computation. But when you actually try to build something, 80% of the time, you can depend on a very long tail of all these other practical issues because the pipeline runs at the speed of the narrowest bottleneck.”

As the IBM team developed their training platform, they continued encountering bottlenecks. “As we get better and better at using our GPUs, more and more often the bottleneck is the data loader,” observes Wertheimer.

This realization led to a dual development process. “There’s been a parallel journey of, on the one hand, evolving our training platform, and, on the other hand, constantly evolving our data loader to keep up with the speed demands from our training platform to avoid bottlenecking it,” he explains.

Key features of the world-class data loader

IBM Research’s Linsong Chu outlines the essential features of the data loader:

Stateful and checkpointable: “Whenever you save a model, your data loader state is also saved, and whenever you recover from a checkpoint, both the model state and data loader states need to be recovered at the same time,” says Chu.

Auto-rescaling of checkpoints: The data loader automatically adjusts to workload changes during extended training sessions. “Training could easily take weeks or months, and there are tons of reasons why you might have to rescale your workload in the middle,” notes Chu.

Efficient data streaming: The system supports data streaming with zero build overhead for shuffling.

Asynchronous distributed operation: “We want the data loader to be non-blocking,” Chu explains. “While saving the data loader state, we want the saving to be distributed in a form where zero communication is involved.”

Dynamic data mixing: The data loader can adapt to different data mixing ratios, which is useful for evolving training needs.

Efficient global shuffling: The tool addresses memory bottlenecks when handling large datasets, making shuffling efficient even as data grows.

PyTorch native, modular and extensive: Designed for adaptability and scalability, the data loader is prepared for future growth. “What if next year we have to deal with 30 trillion, 50 trillion or 100 trillion tokens?” asks Chu. “The world is changing fast, so we need to build the data loader so it can not only survive today, but also survive for tomorrow.”

Real-world performance

The IBM Research team rigorously tested their data loader over several months, running hundreds of small and large jobs. They observed stable and smooth code numbers. Moreover, the entire data loader operates asynchronously and is non-blocking.

“We leveraged a lot of built-in PyTorch capabilities in order to make all this happen,” says Wertheimer. “That’s why we’re contributing, contributing it back.”

eBook: How to choose the right foundation model

Was this article helpful?

YesNo



Source link

Share76Tweet47

Related Posts

Real-World Use Cases of Security Token Offerings (STOs)

Real-World Use Cases of Security Token Offerings (STOs)

by Moussa
April 3, 2026
0

Blockchain technology has introduced a paradigm shift in the ways we think about ownership, investments and value transfer. The growing...

Success Story: Ola Osode’s Learning Journey with 101 Blockchains

Success Story: Ola Osode’s Learning Journey with 101 Blockchains

by Moussa
April 1, 2026
0

About Ola Osode Full Name: Ola Osode Designation: Founder and Team Lead Company: Gallinex AI Country: Serbia Which course or...

Announcement: 101 Blockchains Recognized as a Leader in the G2 Spring 2026 Reports

Announcement: 101 Blockchains Recognized as a Leader in the G2 Spring 2026 Reports

by Moussa
March 27, 2026
0

Our streak of excellence in the G2 reports continues in the latest spring report for 2026. At 101 Blockchains, we...

Success Story: Aaron Simon’s Learning Journey with 101 Blockchains

Success Story: Aaron Simon’s Learning Journey with 101 Blockchains

by Moussa
March 24, 2026
0

About Aaron Simon Full Name: Aaron Simon Designation: Lawyer | Privacy, Cybersecurity & Compliance Company: ECIJA Country: Spain Aaron’s Learning...

101 Blockchains Rejoins Paris Blockchain Week 2026 as an Official Partner

101 Blockchains Rejoins Paris Blockchain Week 2026 as an Official Partner

by Moussa
March 19, 2026
0

Paris Blockchain Week, one of the biggest community events in the web3 space, is set to return in 2026. 101...

Load More

youssufi.com

sephina.com

[vc_row full_width="stretch_row" parallax="content-moving" vc_row_background="" background_repeat="no-repeat" background_position="center center" footer_scheme="dark" css=".vc_custom_1517813231908{padding-top: 60px !important;padding-bottom: 30px !important;background-color: #191818 !important;background-position: center;background-repeat: no-repeat !important;background-size: cover !important;}" footer_widget_title_color="#fcbf46" footer_button_bg="#fcb11e"][vc_column width="1/4"]

We bring you the latest in Crypto News

[/vc_column][vc_column width="1/4"][vc_wp_categories]
[/vc_column][vc_column width="1/4"][vc_wp_tagcloud taxonomy="post_tag"][/vc_column][vc_column width="1/4"]

Newsletter

[vc_raw_html]JTNDcCUzRSUzQ2RpdiUyMGNsYXNzJTNEJTIydG5wJTIwdG5wLXN1YnNjcmlwdGlvbiUyMiUzRSUwQSUzQ2Zvcm0lMjBtZXRob2QlM0QlMjJwb3N0JTIyJTIwYWN0aW9uJTNEJTIyaHR0cHMlM0ElMkYlMkZhcHByb3gub3JnJTJGJTNGbmElM0RzJTIyJTNFJTBBJTBBJTNDaW5wdXQlMjB0eXBlJTNEJTIyaGlkZGVuJTIyJTIwbmFtZSUzRCUyMm5sYW5nJTIyJTIwdmFsdWUlM0QlMjIlMjIlM0UlM0NkaXYlMjBjbGFzcyUzRCUyMnRucC1maWVsZCUyMHRucC1maWVsZC1maXJzdG5hbWUlMjIlM0UlM0NsYWJlbCUyMGZvciUzRCUyMnRucC0xJTIyJTNFRmlyc3QlMjBuYW1lJTIwb3IlMjBmdWxsJTIwbmFtZSUzQyUyRmxhYmVsJTNFJTBBJTNDaW5wdXQlMjBjbGFzcyUzRCUyMnRucC1uYW1lJTIyJTIwdHlwZSUzRCUyMnRleHQlMjIlMjBuYW1lJTNEJTIybm4lMjIlMjBpZCUzRCUyMnRucC0xJTIyJTIwdmFsdWUlM0QlMjIlMjIlM0UlM0MlMkZkaXYlM0UlMEElM0NkaXYlMjBjbGFzcyUzRCUyMnRucC1maWVsZCUyMHRucC1maWVsZC1lbWFpbCUyMiUzRSUzQ2xhYmVsJTIwZm9yJTNEJTIydG5wLTIlMjIlM0VFbWFpbCUzQyUyRmxhYmVsJTNFJTBBJTNDaW5wdXQlMjBjbGFzcyUzRCUyMnRucC1lbWFpbCUyMiUyMHR5cGUlM0QlMjJlbWFpbCUyMiUyMG5hbWUlM0QlMjJuZSUyMiUyMGlkJTNEJTIydG5wLTIlMjIlMjB2YWx1ZSUzRCUyMiUyMiUyMHJlcXVpcmVkJTNFJTNDJTJGZGl2JTNFJTBBJTNDZGl2JTIwY2xhc3MlM0QlMjJ0bnAtZmllbGQlMjB0bnAtcHJpdmFjeS1maWVsZCUyMiUzRSUzQ2xhYmVsJTNFJTNDaW5wdXQlMjB0eXBlJTNEJTIyY2hlY2tib3glMjIlMjBuYW1lJTNEJTIybnklMjIlMjByZXF1aXJlZCUyMGNsYXNzJTNEJTIydG5wLXByaXZhY3klMjIlM0UlQzIlQTBCeSUyMGNvbnRpbnVpbmclMkMlMjB5b3UlMjBhY2NlcHQlMjB0aGUlMjBwcml2YWN5JTIwcG9saWN5JTNDJTJGbGFiZWwlM0UlM0MlMkZkaXYlM0UlM0NkaXYlMjBjbGFzcyUzRCUyMnRucC1maWVsZCUyMHRucC1maWVsZC1idXR0b24lMjIlM0UlM0NpbnB1dCUyMGNsYXNzJTNEJTIydG5wLXN1Ym1pdCUyMiUyMHR5cGUlM0QlMjJzdWJtaXQlMjIlMjB2YWx1ZSUzRCUyMlN1YnNjcmliZSUyMiUyMCUzRSUwQSUzQyUyRmRpdiUzRSUwQSUzQyUyRmZvcm0lM0UlMEElM0MlMkZkaXYlM0UlM0NiciUyRiUzRSUzQyUyRnAlM0U=[/vc_raw_html][/vc_column][/vc_row]
No Result
View All Result
  • Contact Us
  • Homepages
  • Business
  • Guide

© 2024 APPROX FOUNDATION - The Crypto Currency News