Investing in Penn’s Data Science Ecosystem

by Erica K. Brockmeier

As part of a major University-wide investment in science, engineering, and medicine, the Innovation in Data Engineering and Science Initiative aims to help Penn become a leader in developing data-driven approaches that can transform scientific discovery, engineering research, and technological innovation.

From smartphones and fitness trackers to social media posts and COVID-19 cases, the past few years have seen an explosion in the amount and types of data that are generated daily. To help make sense of these large, complex datasets, the field of data science has grown, providing methodologies, tools, and perspectives across a wide range of academic disciplines.

But the challenges that lie ahead for data scientists and engineers, from developing algorithms that don’t exacerbate biases to ensuring privacy protections, are equally complex and, in some instances, require entirely new ways of thinking.

As part of its $750 million investment in science, engineering, and medicine, the University has committed to supporting the future needs of this field. To this end, the Innovation in Data Engineering and Science (IDEAS) initiative will help Penn become a leader in developing data-driven approaches that can transform scientific discovery, engineering research, and technological innovation.

“The IDEAS initiative is game-changing for our University,” says President Amy Gutmann. “This new investment allows us to boost our interdisciplinary efforts across campus, recruit phenomenal additional team members, and generate an even more sound foundation for discovery, experimentation, and design. This initiative is a clear statement that Penn is committed to taking data science head-on.”

Building on a foundation of existing expertise

Led by the School of Engineering and Applied Science, the IDEAS initiative builds upon the steadily gathering momentum of its data-centric research. The Warren Center for Network and Data Sciences has been a major catalyst for this type of work, generating foundational research on ethical algorithms and data privacy, as well as collaborations that have drawn in faculty from the Wharton School, Law School, Perelman School of Medicine, and beyond. In addition, Wharton’s Department of Statistics and Data Science is an active partner in research and teaching initiatives that apply statistical modeling across a wide variety of fields.

“One of the unique things about data science and data engineering is that it’s a very horizontal technology, one that is going to be impacting every department on campus,” says George Pappas, Electrical and Systems Engineering Department chair. “When you have a horizontal technology in a competitive area, we have to figure out specific areas where Penn can become a worldwide leader.”

To do this, IDEAS aims to recruit new faculty across three research areas: artificial intelligence (AI) to transform scientific discovery, trustworthy AI for autonomous systems, and understanding connections between the human brain and AI.

Penn already has a strong foundation in using AI for scientific discovery thanks in part to investments in basic research facilities such as the Singh Center for Nanotechnology and the Laboratory for Research on the Structure of Matter. Additionally, there are centers focused on connecting researchers from different fields to address complex scientific questions, including the Center for Soft and Living Matter, Center for Engineering Mechanobiology, and Penn Institute for Computational Science.

Developing “trustworthy” algorithms, ones that work reliably outside of situations in which they are trained, is another key component of the IDEAS initiative. Ongoing research at the Penn Research in Embedded Computing and Integrated Systems Engineering (PRECISE) Center, the General Robotics, Automation, Sensing & Perception (GRASP) Lab, and DARPA-funded projects on the safety of AI-based aircraft control provide a starting point for furthering Penn’s research portfolio on safe, explainable, and trustworthy autonomous systems.

In the area of neuroscience and how the human brain is similar to AI and machine learning approaches, research from PIK Professor Konrad Kording and Dani Bassett’s Complex Systems lab exemplifies the types of cross-disciplinary efforts that are essential for addressing complex questions. By recruiting additional faculty in this area, IDEAS will help Penn make strides in bio-inspired computing and in future life-changing discoveries that could address cognitive disorders and nervous system diseases.

Read the full story in Penn Today.

Konrad Kording Receives Named University Professorship

Konrad Kording (Photo by Eric Sucar)

President Amy Gutmann has recently announced that two Penn Integrates Knowledge Professors, one of which is Penn Engineering’s own Konrad Kording, have received named University Professorships.  

Kording, who holds joint appointments in the Department of Neuroscience in the Perelman School of Medicine and the Department of Bioengineering in the School of Engineering and Applied Science, will become the Nathan Francis Mossell University Professor. 

When Nathan Francis Mossell graduated in 1882, he became the first African American to earn a medical degree from Penn. He soon became a prominent African American physician, the first to be elected to the Philadelphia County Medical Society. He helped found the Frederick Douglass Memorial Hospital and Training School, which treated Black patients and helped train the next generation of Black doctors and nurses.  

“Dr. Mossell was truly inspiring. He had to fight for everything, yet never reneged on his principles. He pretty much started a hospital and was a major champion for the advancement of equality for African Americans,” Kording said. “In my research, where I study how intelligence works, I am inspired by scholars like him who combine many different insights. He was a wonderful man, and I will be proud to carry his name.” 

Read more in Penn Today.

Penn Researchers Show ‘Encrypted’ Peptides Could be Wellspring of Natural Antibiotics

by Melissa Pappas

César de la Fuente, Ph.D.

While biologists and chemists race to develop new antibiotics to combat constantly mutating bacteria, predicted to lead to 10 million deaths by 2050, engineers are approaching the problem through a different lens: finding naturally occurring antibiotics in the human genome.

The billions of base pairs in the genome are essentially one long string of code that contains the instructions for making all of the molecules the body needs. The most basic of these molecules are amino acids, the building blocks for peptides, which in turn combine to form proteins. However, there is still much to learn about how — and where — a particular set of instructions are encoded.

Now, bringing a computer science approach to a life science problem, an interdisciplinary team of Penn researchers have used a carefully designed algorithm to discover a new suite of antimicrobial peptides, hiding deep within this code.

The study, published in Nature Biomedical Engineering, was led by César de la Fuente, Presidential Assistant Professor in Bioengineering, Microbiology, Psychiatry, and Chemical and Biomolecular Engineering, spanning both Penn Engineering and Penn Medicine, and his postdocs Marcelo Torres and Marcelo Melo. Collaborators Orlando Crescenzi and Eugenio Notomista of the University of Naples Federico II also contributed to this work.

“The human body is a treasure trove of information, a biological dataset. By using the right tools, we can mine for answers to some of the most challenging questions,” says de la Fuente. “We use the word ‘encrypted’ to describe the antimicrobial peptides we found because they are hidden within larger proteins that seem to have no connection to the immune system, the area where we expect to find this function.”

Read the full story in Penn Engineering Today.

Using Big Data to Measure Emotional Well-being in the Wake of George Floyd’s Murder

by Melissa Pappas

George Floyd’s murder had an undeniable emotional impact on people around the world, as evidenced by this memorial mural in Berlin, but quantifying that impact is challenging. Researchers from Penn Engineering and Stanford have used a computational approach on U.S. survey data to break down this emotional toll along racial and geographic lines. Their results show a significantly larger amount of self-reported anger and sadness among Black Americans than their White counterparts. (Photo: Leonhard Lenz)

The murder of George Floyd, an unarmed Black man who was killed by a White police officer, affected the mental well-being of many Americans. The effects were multifaceted as it was an act of police brutality and example of systemic racism that occurred during the uncertainty of a global pandemic, creating an even more complex dynamic and emotional response.

Because poor mental health can lead to a myriad of additional ailments, including poor physical health, inability to hold a job and an overall decrease in quality of life, it is important to understand how certain events affect it. This is especially critical when the emotional burden of these events  falls most on demographics affected by systemic racism. However, unlike physical health, mental health is challenging to characterize and measure, and thus, population-level data on mental health has been limited.

To better understand patterns of mental health on a population scale, Penn Engineers Lyle H. Ungar, Professor of Computer and Information Science (CIS), and Sharath Chandra Guntuku, Research Assistant Professor in CIS, take a computational approach to this challenge. Drawing on large-scale surveys as well as language analysis in social media through their work with the World Well-Being Project, they have developed visualizations of these patterns across the U.S.

Their latest study involves tracking changes in emotional and mental health following George Floyd’s murder. Combining polling data from the U.S. Census and Gallup, Guntuku, Ungar and colleagues have shown that Floyd’s murder spiked a wave of unprecedented sadness and anger across the U.S. population, the largest since relevant data began being recorded in 2009.

Read the full story in Penn Engineering Today.

N.B. Lyle Ungar is also a member of the Penn Bioengineering Graduate Group.

“This is What a Data Scientist Looks Like”

Speakers at the second annual Women in Data Science @ Penn Conference.

Last month, the second annual Women in Data Science (WiDS) @ Penn Conference virtually gathered nearly 500 registrants to participate in a week’s worth of academic and industry talks, live speaker Q&A sessions, and networking opportunities.

Hosted by Penn Engineering, Analytics at WhartonWharton Customer Analytics and Wharton’s Statistics Department, the conference’s theme — “This is What a Data Scientist Looks Like” – emphasized the depth, breadth, and diversity of data science, both in terms of the subjects the field covers and the people who enter it.

Following welcoming remarks from Erika James, Dean of the Wharton School, and Vijay Kumar, Nemirovsky Family Dean of Penn Engineering, the conference began with a keynote address from President of Microsoft US and Wharton alumna Kate Johnson.

Conference sessions continued throughout the week, featuring panels of academic data scientists from around Penn and beyond, industry leaders from IKEA Digital, Facebook and Poshmark, and lightning talks from students speakers who presented their data science research.

All of the conference’s sessions are now available on YouTube and the 2021 WiDS Conference Recap, including a talk titled “How Humans Build Models for the World” by Danielle Bassett, J. Peter Skirkanich Professor in Bioengineering and Electrical and Systems Engineering.

Read more about the conference at Wharton Stories: “How Women in Data Science Rise to the Top.

Originally posted in Penn Engineering Today.

Brain-machine interfaces: Villainous gadgets or tools for next-gen superheroes?

A Q&A with neuroscientist Konrad Kording on how connections between minds and machines are portrayed in popular culture, and what the future holds for this reality-defying technology.

Science fiction and superhero films portray brain-machine interfaces as malevolent robots that plug into human brains for fuel in The Matrix (top left) or as power-enhancing devices in X-Men (top right). In reality, they can help patients use artificial limbs or directly connect to computers. (Image credits, from top left to bottom right: Warner Brothers, 20th Century Fox, Intelligent Films, AFP Photo/Jean-Pierre Clatot)

For the many superheroes that use high-powered gadgets to save the day, there’s an equal number of villains who use technology nefariously. From robots that plug into human brains for fuel in “The Matrix” to the memory-warping devices seen in “Men in Black,” “Captain Marvel,” and “Total Recall,” technology that can control people’s minds is one of the most terrifying examples of technology gone wrong in science fiction and superhero films.

Now, progress made on brain-machine interfaces, technology that provides a direct communication link between a brain and an external device, is bringing us closer to a world that feels like science fiction. Elon Musk’s company NeuraLink is working on a device to let people control computers with their minds, while Facebook’s “mind-reading initiative” can decode speech from brain activity. Is this progress a glimpse into a dark future, or are there more empowering ways in which brain-machine interfaces could become a force for good?

Penn Today talked with Konrad Kording, a Penn Integrates Knowledge Professor of Neuroscience, Bioengineering, and Computer and Information Science whose group works at the interface of data science and neuroscience to better understand the human brain, to learn more about brain-machine interfaces and where real-world technologies and science fiction intersect.

Q: What are the main challenges in connecting brains to devices?

The key problem is that you need to get a lot of information out of brains. Today’s prosthetic devices are very slow, and if we want to go faster it’s a tradeoff: I can go slower and then I am more precise, or I can go faster and be more noisy. We need to get more data out of brains, and we want to do it electrically, meaning we need to get more electrodes into brains.

So what do you need? You need a way of getting electrodes into the brain without making your brain into a pulp, you want the electrodes to be flexible so they can stay in longer, and then you want the system to be wireless. You don’t want to have a big connector on the top of your head.

It’s primarily a hardware problem. We can get electrodes into brains, but they deteriorate quickly because they are too thick. We can have plugs on people’s heads, but it’s ruling out any real-world usage. All these factors hold us back at the moment.

That’s why the Neuralink announcement was very interesting. They get a rather large number of electrodes into brains using well-engineered approaches that make that possible. What makes the difference is that Neuralink takes the best ideas in all the different domains and puts them together.

Q: Most examples in pop culture of connecting brains to machines have villainous or nefarious ends. Does that match up with how brain-machine interfaces are currently being developed? 

Let’s say you’ve had a stroke, you can’t talk, but there’s a prosthetic device that allows you to talk again. Or if you lost your arm, and you get a new one that’s as good as the original—that’s absolutely a force for good.

It’s not a dark, ugly future thing, it’s a beautiful step forward for medicine. I want to make massive progress in these diseases. I want patients who had a stroke to talk again; I want vets to have prosthetic devices that are as good as the real thing. I think short-term this is what’s going to happen, but we are starting to worry about the dark sides.

Read the full interview at Penn Today.