Big Data
News Feed
Open-source training framework increases the speed of large language model pre-training when failures arise
Pipeline templates strike a balance between speed and effectiveness in resilient distributed computing.H.V. Jagadish named Edgar F. Codd Distinguished University Professor of EECS
Professor Jagadish is being recognized for his work as one of the nation’s most visible and influential researchers in the interdisciplinary field of data scienceProf. Danai Koutra recognized as rising star with ACM SIGKDD Award
The Rising Star Award is based on an individual’s whole body of work in the first five years after the PhD.
How predictive modeling could help us reopen more safely
Graphical online simulation could spur more targeted COVID-19 protection measures.
Analytical model predicts exactly how much a piece of hardware will speed up data centers
The analytical model, called Accelerometer, can be applied in the early stages of an accelerator’s design to predict its effectiveness before ever being installed.
Big data, small footprint
How changing the rules of computing could lighten Big Data’s impact on the internet.
Generating realistic stock market data for deeper financial research
A team at Michigan proposed an approach to generating realistic and high-fidelity stock market data to enable broader study of financial markets.
Three faculty earn MIDAS grants to broaden the frontiers of data science
This round of funding strongly encourages pioneering work with the potential for major expansion.
$2M NSF grant to explore data equity systems
Researchers plan to establish a framework for a national institute that would enable research using sensitive data, while preventing misuse and misinterpretation.$1M NSF grant supports new system for gathering, structuring data with ease
The team's new tool will combine of software and data to make gathering structured data dramatically easier.
Automated tool optimizes complex programs better than humans
Erie provided database repairs that were previously performed exclusively by human programmers.
‘Air traffic control’ for driverless cars could speed up deployment
Human-generated responses could remotely assist autonomous vehicles decision’s during times of uncertainty.
Two papers announced among 10 most influential in healthcare and infection control
The papers provide data-driven solutions to hospital infection and the use of machine learning in healthcare.
Michigan Data Science Team wrangles big data
MDST brings together students from many fields to get their hands dirty with real data science problems and tools.
Tool for structuring data creates efficiency for data scientists
Foofah is a tool that can help to minimize the effort and required background knowledge needed to clean up data.
Undocumented immigrants’ privacy at risk online, on phones
When it comes to their smartphones, immigrants struggle to apply instinctive caution, according to a study by a team of University of Michigan researchers.
Cafarella Receives VLDB Test of Time Award for Structured Web Data Search
This award is given to the VLDB paper published ten years earlier that has had the most influence since its publication.
Study maps careers of CS PhDs using decades of data
The researchers identified movement between industry, academia, and government work, tracked the growth of important organizations, and built predictive models for career transitions and employer retention.
“Stitching” together a web user from scattered, messy data
Even though we interact with different web services in different ways, there are clues in the data that can indicate trends and identify a unique profile.
Michigan, Georgia Tech researchers funded to deter financial market manipulation
Increasingly, market manipulators are attacking market integrity through complex computer-controlled attacks.
“Learning database” speeds queries from hours to seconds
Verdict can make databases deliver answers more than 200 times faster while maintaining 99 percent accuracy.
Codeon is the intelligent assistant for software developers
With Codeon, developers can request help by speaking their requests aloud within the context of their Integrated Development Environment (IDE).
Kurator Will Help You Curate Your Personal Digital Content
Kurator is a hybrid intelligence system leveraging mixed-expertise crowds to help families curate their personal digital content, including videos and photos.
Movie design for specific target audiences
Researchers are working to design a successful movie that will attract the interest of a targeted demographic by leveraging user ratings, reviews, and product characteristics.
Social interaction patterns provide clues to real life changes
The identified changes in social media behavior may point to real events and changes, some of which can benefit from intervention.
COVE: a tool for advancing progress in computer vision
Centralizing available data in the intelligent systems community through a COmputer Vision Exchange for Data, Annotations and Tools, called COVE.
Shadows in the Dark Web
Secrets lurk in the dark web, the 95 percent of the internet that most of us can't see. One U-M professor is bringing some of those secrets to light, making the digital and the real world a little safer.Summer Bootcamp prepares undergraduates for work with big data
The Big Data Summer Bootcamp is a six-week interdisciplinary training and research program at the University of Michigan.
Collecting data to better identify bipolar disorder
Prof. Emily Mower Provost is collaborating to develop new technologies that provide individuals with insight into how the disease changes over time.
Google, U-M to build digital tools for Flint water crisis
CSE students and faculty will collaborate as a part of a larger team to help respond to the crisis.
U-M researchers launch fight against C. difficile with $9.2M grant from NIH
Prof. Wiens will continue to use machine learning techniques to study the disease.
Machine learning proves useful for analyzing NBA ball screen defense
The team used machine learning to extract information from NBA sports data for automatically recognizing common defense strategies to ball screens.
Michael J. Cafarella selected for Sloan Research Fellowship
He has built software systems for information extraction, database integration, and feature engineering and applied these to problems in the social sciences.
Jenna Wiens receives NSF CAREER Award to increase the utility of machine learning in clinical care
Her primary research interests lie at the intersection of machine learning and healthcare.
Barzan Mozafari receives NSF CAREER Award to improve predictability of database systems
Prof. Mozafari is passionate about building large-scale data-intensive systems that are more scalable, more robust, and more predictable.
Censys enables fast searching of actionable internet data
The software enables users to ask questions about the hosts and networks that compose the Internet and get an immediate reply.
The Promise and Perils of Predictive Policing Based on Big Data
Such tactics, even if effective in reducing crime, raise civil liberty concerns.