Anti-Vaccine Messaging Is Well-Connected on Social Media

Inside Science  May 13, 2020 Social media platforms have removed the video for violating misinformation policies, but the 26-minute video highlights one way that the anti-vaccine movement is feeding into the recent surge of misinformation and disinformation swirling around COVID-19. In a new analysis researchers at Johns Hopkins University provide the first map of how three types of online communities interact: those who promote accurate information about vaccines, those who are against vaccines, and those who are interested in vaccines but don’t obviously lean in either direction, whom the researchers termed undecided. They found that pro-vaccine pages were not well […]

Big Data Helps Taiwan Fight Coronavirus

IEEE Spectrum  March 12, 2020 According to an international team of researchers (USA – Stanford University, UCLA, Rand Corporation, Taiwan) Taiwan owes its success largely to the emergency implementation of big data analytics and new technologies. In late January, as the novel coronavirus began spreading through China, computer scientists modeling the outbreak ranked Taiwan the region with the second highest risk of importation of the virus. The island sits just 130 km off the coast of mainland China and shuttles thousands of passengers to and from the mainland daily. But so far Taiwan reports that it has largely mitigated the […]

Department of Energy announces $5 million for new computational approaches to biology

EurekAlert  December 6, 2019 According to the DOE the rapid development of technologies for high-throughput screening of genomes, proteins, metabolites, and other biological features has generated and continues to generate mountains of data. The research will focus on data from plants and microbes relevant to DOE missions in energy and environment, including laying the scientific groundwork for cost-effective production of biofuels and bioproducts as well as enhancing understanding of the biological dimensions of environmental cleanup, among other topics. It is expected that many of the new software tools and approaches will ultimately be integrated into the DOE Systems Biology Knowledgebase an […]

Sniffing Out Errors

Inside Big Data  November 26, 2019 Error Analysis builds a model out of your existing model’s errors. From this it is possible to gain an understanding of where the model is succeeding and what can be amended to improve performance. Additionally, this process can be trivially integrated into the data science pipeline and run multiple times to iteratively improve model performance. Building a simple model using all your input features to explain the error will give an indication of which features are driving most of the error. A highly interpretable linear model in error analysis will yield much faster and […]

Four Big Factors Shaping the Future of Data Science

Inside Big Data  October 26, 2019 As the field grows four trends emerge, shaping data science in the next five years: Accelerate The Full Data Science Life-Cycle – The new concept of data science automation goes beyond machine learning automation, including data preparation, feature engineering, machine learning, and the production of full data science pipelines; Leverage Existing Resources for Democratization – The shortage of data scientists has created a challenge for anyone implementing AI and ML initiatives, forcing a closer look at how to build and leverage data science resources; Augment Insights for Greater Transparency – Increased regulatory oversight like […]

Free dataset archive helps researchers quickly find a needle in a haystack

EurekAlert  July 17, 2019 Researchers at UC Riverside have developed UCR Spatio-temporal Active Repository (UCRSTAR)  , a free data repository to provide easy access to large spatio-temporal datasets through an interactive exploratory interface. It contains 102 datasets and 5 billion records. The datasets were mapped using Da Vinci. The map interface visualizes the data, provides an interactive exploratory interface for the dataset. Once a dataset is selected, important details are displayed such as the original homepage, a link to the original download source, size in bytes, number of records, file format, and other useful information. The subset download feature allows […]

Big data is being reshaped thanks to 100-year-old ideas about geometry

The Conversation  February 20, 2019 Scientists in Switzerland recently produced the first-ever digital 3D brain cell atlas, a complete mapping of the brain of a mouse. While this is a colossal achievement, the great challenge now lies in learning to decipher the atlas. In more and more realms of science, researchers are finding themselves with more data than they can effectively make sense of. The response of modern mathematicians to meet the mathematical challenges of big data is still unfolding – and topology, a theory bound only by the imagination of its practitioners, is bound to help shape the future…read […]

Best of arXiv.org for AI, Machine Learning, and Deep Learning – January 2019

Inside Big Data  February 20, 2019 The articles are academic research papers, typically geared toward graduate students, post docs, and seasoned professionals. Articles are listed in no particular with a brief overview – Hard-Exploration Problems , Deep Neural Network Approximation for Custom Hardware: Where We’ve Been, Where We’re Going , Generating Textual Adversarial Examples for Deep Learning Models: A Survey , Revisiting Self-Supervised Visual Representation Learning  , Self-Driving Cars: A Survey read more.

Help IARPA develop better activity detection algorithms for long video

Fedscoop  November 8, 2018 IARPA has issued Activities in Extended Video Prize Challenge, a project which seeks to encourage the development of robust automatic activity detection algorithms for a multi-camera streaming video environment. Challenge participants will develop activity detection and temporal localization algorithms for 18 activities, such as a person opening a car trunk or talking to another person or texting, and locate those activities in time and space. The challenge kicks off Dec. 4 and will run until May 15, 2019…read more.

Big data used to predict the future

Science Daily  November 9, 2018 Using the multi-output regression models researchers in Spain have developed a technique to predict several variables simultaneously based on the same set of input variables, thus reducing the size of data necessary for the forecast to be exact. In tests they were able to reduce the amount of information needed to predict by 80% without affecting the predictive performance…read more. TECHNICAL ARTICLE