Home > News > All News > Big Data

Big Data

News Feed

Open-source training framework increases the speed of large language model pre-training when failures arise

Pipeline templates strike a balance between speed and effectiveness in resilient distributed computing.

H.V. Jagadish named Edgar F. Codd Distinguished University Professor of EECS

Professor Jagadish is being recognized for his work as one of the nation’s most visible and influential researchers in the interdisciplinary field of data science

Prof. Danai Koutra recognized as rising star with ACM SIGKDD Award

The Rising Star Award is based on an individual’s whole body of work in the first five years after the PhD.

How predictive modeling could help us reopen more safely

Graphical online simulation could spur more targeted COVID-19 protection measures.

Analytical model predicts exactly how much a piece of hardware will speed up data centers

The analytical model, called Accelerometer, can be applied in the early stages of an accelerator’s design to predict its effectiveness before ever being installed.

Big data, small footprint

How changing the rules of computing could lighten Big Data’s impact on the internet.

Generating realistic stock market data for deeper financial research

A team at Michigan proposed an approach to generating realistic and high-fidelity stock market data to enable broader study of financial markets.

Three faculty earn MIDAS grants to broaden the frontiers of data science

This round of funding strongly encourages pioneering work with the potential for major expansion.

$2M NSF grant to explore data equity systems

Researchers plan to establish a framework for a national institute that would enable research using sensitive data, while preventing misuse and misinterpretation.

$1M NSF grant supports new system for gathering, structuring data with ease

The team's new tool will combine of software and data to make gathering structured data dramatically easier.

Automated tool optimizes complex programs better than humans

Erie provided database repairs that were previously performed exclusively by human programmers.

‘Air traffic control’ for driverless cars could speed up deployment

Human-generated responses could remotely assist autonomous vehicles decision’s during times of uncertainty.

Two papers announced among 10 most influential in healthcare and infection control

The papers provide data-driven solutions to hospital infection and the use of machine learning in healthcare.

Michigan Data Science Team wrangles big data

MDST brings together students from many fields to get their hands dirty with real data science problems and tools.

Tool for structuring data creates efficiency for data scientists

Foofah is a tool that can help to minimize the effort and required background knowledge needed to clean up data.

Undocumented immigrants’ privacy at risk online, on phones

When it comes to their smartphones, immigrants struggle to apply instinctive caution, according to a study by a team of University of Michigan researchers.

Cafarella Receives VLDB Test of Time Award for Structured Web Data Search

This award is given to the VLDB paper published ten years earlier that has had the most influence since its publication.

Study maps careers of CS PhDs using decades of data

The researchers identified movement between industry, academia, and government work, tracked the growth of important organizations, and built predictive models for career transitions and employer retention.

“Stitching” together a web user from scattered, messy data

Even though we interact with different web services in different ways, there are clues in the data that can indicate trends and identify a unique profile.

Michigan, Georgia Tech researchers funded to deter financial market manipulation

Increasingly, market manipulators are attacking market integrity through complex computer-controlled attacks.

“Learning database” speeds queries from hours to seconds

Verdict can make databases deliver answers more than 200 times faster while maintaining 99 percent accuracy.

Codeon is the intelligent assistant for software developers

With Codeon, developers can request help by speaking their requests aloud within the context of their Integrated Development Environment (IDE).

Kurator Will Help You Curate Your Personal Digital Content

Kurator is a hybrid intelligence system leveraging mixed-expertise crowds to help families curate their personal digital content, including videos and photos.

Movie design for specific target audiences

Researchers are working to design a successful movie that will attract the interest of a targeted demographic by leveraging user ratings, reviews, and product characteristics.

Social interaction patterns provide clues to real life changes

The identified changes in social media behavior may point to real events and changes, some of which can benefit from intervention.

COVE: a tool for advancing progress in computer vision

Centralizing available data in the intelligent systems community through a COmputer Vision Exchange for Data, Annotations and Tools, called COVE.

Shadows in the Dark Web

Secrets lurk in the dark web, the 95 percent of the internet that most of us can't see. One U-M professor is bringing some of those secrets to light, making the digital and the real world a little safer.

Summer Bootcamp prepares undergraduates for work with big data

The Big Data Summer Bootcamp is a six-week interdisciplinary training and research program at the University of Michigan.

Collecting data to better identify bipolar disorder

Prof. Emily Mower Provost is collaborating to develop new technologies that provide individuals with insight into how the disease changes over time.

Google, U-M to build digital tools for Flint water crisis

CSE students and faculty will collaborate as a part of a larger team to help respond to the crisis.

U-M researchers launch fight against C. difficile with $9.2M grant from NIH

Prof. Wiens will continue to use machine learning techniques to study the disease.

Machine learning proves useful for analyzing NBA ball screen defense

The team used machine learning to extract information from NBA sports data for automatically recognizing common defense strategies to ball screens.

Michael J. Cafarella selected for Sloan Research Fellowship

He has built software systems for information extraction, database integration, and feature engineering and applied these to problems in the social sciences.

Jenna Wiens receives NSF CAREER Award to increase the utility of machine learning in clinical care

Her primary research interests lie at the intersection of machine learning and healthcare.

Barzan Mozafari receives NSF CAREER Award to improve predictability of database systems

Prof. Mozafari is passionate about building large-scale data-intensive systems that are more scalable, more robust, and more predictable.

Censys enables fast searching of actionable internet data

The software enables users to ask questions about the hosts and networks that compose the Internet and get an immediate reply.

The Promise and Perils of Predictive Policing Based on Big Data

Such tactics, even if effective in reducing crime, raise civil liberty concerns.

Researchers Expose Security Flaws in Backscatter X-ray Scanners

Researchers demo hack to seize control of municipal traffic signal systems

Barzan Mozafari and collaborators chosen for best demo at ACM SIGMOD

Third Annual Data Mining Workshop Brings Together 100+ Researchers

100+ researchers from across the University of Michigan and from industry gathered on North Campus for the third U-M Workshop on Data Mining.