Projects
Current Projects
Sequential Instance-wise Machine Learning
|
In a typical supervised machine learning setting, the predictions on all test instances are based on a common subset of features discovered during model training. However, in many real-world applications, a feature is typically acquired at a cost, which can relate to among others, the time and effort spent in generating it, and its discriminative power. For instance, in healthcare, certain tasks may be very informative (e.g., magnetic resonance imaging) during the diagnosis process, but can be intrusive. In ubiquitous computing, limited energy resources can prevent continuous collection of high-fidelity data, or users may not wish to reveal answers to certain questions. Such cases give rise to the very interesting phenomenon, where the full set of features can be accessed during the training process, while during the testing process, a complete set of features for all data instances cannot be obtained (i.e., features are different for each instance by choice or due to constraints). Most existing machine learning algorithms, however, ignore resource constraints and/or acquire a general solution for all cases, while not scaling in big data settings. The goal of this project is to devise mathematical frameworks, theory and algorithms for dynamic instance-wise feature acquistion and prediction in various settings.
|
Towards Optimized Operation of Cost-Constrained Complex Cyber-Physical-Human Systems
NSF CAREER award 1942330
|
Self-driving cars and home assistants provide just a small glimpse of the future cost-costrainted complex
cyber-physical-human systems (CPHS) that will integrate engineering systems with the natural word and humans. This
project will devise new mathematical tools and methods to systematically describe CPHS and optimize their
operation. The application focus is on wireless body area networks, a natural CPHS representative with humans in
the loop, heavily resource-constrained operation, and heterogeneous components that are intertwined with and
altered by human behavior. The end result will help understand important factors related to the operation of
CPHS and how to optimize their operation.
Project Website
|
Community on Multimodality: Participatory Action, Service, and Support (COMPASS)
NSF award 1737443
|
Those in need of help often do not know how to locate or access service providers. Likewise, service-providing
agencies often work in silos. The lack of communication also applies to volunteers; people do not know who to help
and how they can be resourceful. Response becomes even more problematic when a problem demands the coordination of
service providers, volunteers, and government structures, and after business hours, when the communication channels
that can aid people in need become sparse. The focus of this project is to (i) develop new data mining methods for
uncovering complex interdependencies within a dynamic sociotechnical system, (ii) devise novel information
processing, machine learning, and control methods to dynamically optimize delivery of human and physical services
under uncertainty with humans in the decision-making loop, and (iii) shed light on the ability of communities to
integrate emerging technologies to become more connected in human interactions.
Project Website
|
Cyberbullying Detection
|
Bullying, once limited to physical spaces (e.g., schools, workplaces or sports fields) and particular times of the
day (e.g., school hours), can now occur anytime, anywhere. Cyberbullying can take many forms, however, it typically
refers to repeated and hostile behavior (e.g., hurtful comments, videos and images) performed in an effort
to intentionally and repeatedly harass or harm individuals. The consequences can be devastating: learning
difficulties, psychological suffering and isolation, escalated physical confrontations, suicide. While
techniques to automatically detect cyberbullying incidents have been developed, the scalability and timeliness
of existing cyberbullying detection approaches have largely been ignored. The goal of this project is to derive
provably optimal, yet scalable online strategies to minimize the time-to-detection of cyberbullying incidents.
|
Towards Achieving Better Market Access for Smallholder Farmers
Google Research AI for Social Good Award
|
Access to appropriate innovation, information and advisory services by smallholders farmers is a vital element in transforming agriculture and food systems. In Ghana, a middle-income West African country with rich natural resources, agriculture is a key economy sector, accounting for 23% of its national GDP with > 50% of its total labor force employed in agriculture. However, smallholder farmers, who produce most of the agricultural commodities on local markets in Ghana, are unable to sustain their livelihoods. Over reliance on rainfed farming, influx of middle men, high postharvest loss and unsafe production practices have made this situation prevalent in Ghana and Africa to a larger extent. Working along AGRI-WEB, a nonprofit organization that connects a diversity of members and partners dedicated to creating financial freedom and growth opportunities through investing in agricultural activities, the goal of this project is to develop better prediction models for the crop yields of smallholder farmers in Ghana to optimize postharvest shortage/glut.
|
Past Projects
Machine Learning for Improving Classification and Detection in SSVEP-based Brain-computer Interfaces
NIH subcontract from the National Center for Adaptive Neurotechnologies
|
Brain-computer interfaces (BCIs) are devices that enable people to control computer systems using brain activity.
Since they require little to no voluntary motor control, they can help people with severe motor deficits (e.g., locked-in syndrome) to communicate, but can have applications for healthy individuals as well (e.g., multimedia and gaming). Steady-state visual evoked potential (SSVEP)-based BCIs are one common type of BCIs, where users are presented with a set of stimuli, each flashing at a unique frequency, and attention to one of these stimuli elicits changes in brain activity at the fundamental and higher harmonic frequencies of the flashing — an SSVEP — that can be measured using electroencephalography (EEG). In this project, we designed classifiers that meet various strict specification requirements for different SSVEP-based BCIs applications.
|
Real-time Accident Detection
|
In recent years, urban mobility demand has become highly variable over time challenging the sustainability of transportation networks of major cities. At the same time, various types of incidents such as accidents, construction zone closures and weather hazards exacerbate the
already congested transportation network. Timely detection of such events can offer an unprecedented opportunity to mitigate the consequences. In this project, we developed a mathematical framework for real-time accident detection in a road segment equipped with spatially distributed speed sensors of variable accuracy. Specifically, we designed fast and low-complexity algorithms that can quickly determine if a collision has taken place and where by appropriately selecting which sensors to query and when.
|
Context-Aware Human State Modeling and Monitoring
SUNY Faculty Research Award A
|
Fine–grained knowledge of human context information in terms of physical activity (e.g.,sit, walk), emotional state (e.g., happy, sad), surroundings (e.g., home, work) and mobile phone usage (e.g., web search, listen to music) offers an unprecedented opportunity to objectively and accurately understand when, where and what type of behaviors are exhibited. To this end, we have built an off-the-self body sensing platform that consists of an inertial measurement unit, an electromyograph, a galvanic skin response and a Motorola Nexus 6 mobile phone to
monitor human context information over time. We have also devised a mathematical framework that formally describe how sensor capabilities, biomedical, contextual and other variables (e.g., channel information) and their interactions can come together to enable accurate and cost-efficient monitoring of human context information over time.
|
Energy-Efficient Physical Activity Detection
|
Wireless Body Area Networks (WBANs) that consist of heterogeneous biometric sensors (e.g.,
heart–rate monitors, accelerometers) and an energy–constrained personal device (e.g., mobile phone, PDA), have the transformative
potential to influence a wide range of applications including health–care, sports, military
and emergency applications. However, the practical realization of a WBAN is hindered by a
number of unique challenges, including energy constraints that significantly impact its lifetime.
To address this issue, we have proposed novel resource allocation strategies that maximize the
network lifetime by minimizing energy spent at the energy–constrained fusion center. The
effectiveness of our proposed methods have been evaluated using extensive simulations on real
data collected from a real prototype WBAN, the KNOWME network,
used for preventing obesity. We have showed that we can improve energy gains by as much as 68%
compared to state–of–the–art schemes, without compromising detection accuracy.
|
Active Object Detection for Computer Vision
|
Object detection is a fundamental, yet very challenging task in image analysis. A typical object
detection algorithm first generates a set of region proposals. These can be either fixed and
class–independent or generated on the fly while searching for a particular object. Each such
proposal is then assigned a class label by running a set of detectors. Evaluating a large number of
region proposals will certainly lead to high detection accuracy, but will incur high computational
costs if each detector evaluation is computationally expensive. In this line of work, we proposed
an object detection algorithm that models image regions as vertices and overlap relationships as
edges in a directed weighted graph. Information is propagated from labeled vertices through graph
edges that operate as noisy channels via message passing over locally informative trees that are
extracted from the original graph using an information-theoretic criterion. Influential vertices are
determined by an appropriate centrality index. Our algorithm can be applied on top of any
state–of–the–art region proposal method as it treats it as a black box. We evaluated the performance
of our proposed algorithm on different scenarios and showed that in some cases only 0.45% of the
total regions is evaluated with maximum 21.45%.
|
Active State Tracking
|
Sensor heterogeneity complicates the inference process since observations generated by different
sensors are of different quality and may be collected at a different cost. Thus, it is necessary to consider
sensor heterogeneity when designing sensing algorithms, especially when the state of the system under
observation changes over time. In the context of this work, we devised a
general framework for active state tracking with heterogeneous observations, where the decision maker selects
which observations to consider at each step based on accuracy and cost criteria. Various criteria were considered
along with their effect on accuracy and were generalized appropriately to address the intertwined key problems of
information characterization and unified sensing and tracking. Based on this
framework, we derived fundamental theory that enabled us to characterize the structure of the
optimum sensing algorithm and devise low-complexity alternative algorithms.
|
|