Interface Evolution: Learning, Adaptation, Customisation

https://doi.org/10.1145/3586183.3606717

Virtual Reality (VR) has the potential to transform how we work: it enables flexible and personalized workspaces beyond what is possible in the physical world. However, while most VR applications are designed to operate in a single empty physical space, work environments are often populated with real-world objects and increasingly diverse due to the growing amount of work in mobile scenarios. In this paper, we present InteractionAdapt, an optimization-based method for adapting VR workspaces for situated use in varying everyday physical environments, allowing VR users to transition between real-world settings while retaining most of their personalized VR environment for efficient interaction to ensure temporal consistency and visibility. InteractionAdapt leverages physical affordances in the real world to optimize UI elements for the respectively most suitable input technique, including on-surface touch, mid-air touch and pinch, and cursor control. Our optimization term thereby models the trade-off across these interaction techniques based on experimental findings of 3D interaction in situated physical environments. Our two evaluations of InteractionAdapt in a selection task and a travel planning task established its capability of supporting efficient interaction, during which it produced adapted layouts that participants preferred to several baselines. We further showcase the versatility of our approach through applications that cover a wide range of use cases.

Carnegie Mellon University, Pittsburgh, Pennsylvania, United States

ETH Zurich, Zurich, Switzerland

ETH Zürich, Zurich, Switzerland

https://doi.org/10.1145/3586183.3606741

This paper presents LangAware, a collaborative approach for constructing personalized context for context-aware applications. The need for personalization arises due to significant variations in context between individuals based on scenarios, devices, and preferences. However, there is often a notable gap between humans and machines in the understanding of how contexts are constructed, as observed in trigger-action programming studies such as IFTTT. LangAware enables end-users to participate in establishing contextual rules in-situ using natural language. The system leverages large language models (LLMs) to semantically connect low-level sensor detectors to high-level contexts and provide understandable natural language feedback for effective user involvement. We conducted a user study with 16 participants in real-life settings, which revealed an average success rate of 87.50% for defining contextual rules in a variety of 12 campus scenarios, typically accomplished within just two modifications. Furthermore, users reported a better understanding of the machine's capabilities by interacting with LangAware.

Tsinghua University, Beijing, China

https://doi.org/10.1145/3586183.3606715

Companies and organizations rely on behavioral analytics tools like Google Analytics to monitor their digital experiences. Making sense of the data these tools capture, however, requires manual event tagging and filtering---often a tedious process. Prior research approaches have trained machine learning models to automatically tag interaction data, but they draw from fixed digital experience vocabularies which cannot be easily augmented or customized. This paper introduces a novel machine learning feedback loop that generates customized tag predictions for organizations. The approach uses a general experience vocabulary to bootstrap initial tag predictions on interactive Sankey diagrams representing user navigation paths on a digital asset. By interacting with the path visualization, organizations can manually revise predictions. The system leverages this feedback to refine an organization's experience ontology, computing custom word embeddings for each of its terms via vector space refinement algorithms. The updates made to the custom experience ontology and its associated word embeddings result in better event tag predictions for that organization in the future. We conducted a needfinding interview with web analytics professionals to ground our design choices, and present a real-world deployment that demonstrates how, even with just a few training examples, custom tags can be predicted over new data.

Inc., San Francisco, California, United States

UserTesting, Inc., San Francisco, California, United States

Inc., San Francsico, California, United States

UserTesting, Inc., Atlanta, Georgia, United States

UserTesting, Inc, San Franciscoe, California, United States

User Testing, Inc., San Francisco, California, United States

UserTesting, Inc., San Francisco, California, United States

UserTesting, San Francisco, California, United States

UserTesting, Inc., San Francisco, California, United States

University of Illinois at Urbana-Champaign, Urbana, Illinois, United States

https://doi.org/10.1145/3586183.3606728

Human-computer symbiosis is a crucial direction for the development of artificial intelligence. As intelligent systems become increasingly prevalent in our work and personal lives, it is important to develop strategies to support users across physical and virtual environments. While technological advances in personal digital devices, such as personal computers and virtual reality devices, can provide immersive experiences, they can also disrupt users' awareness of their surroundings and enhance the frustration caused by disturbances. In this paper, we propose a joint observation strategy for artificial agents to support users across virtual and physical environments. We introduce a prototype system, neighbor-environment observer (NEO), that utilizes non-invasive sensors to assist users in dealing with disruptions to their immersive experience. System experiments evaluate NEO from different perspectives and demonstrate the effectiveness of the joint observation strategy. A user study is conducted to evaluate its usability. The results show that NEO could lessen users' workload with the learned user preference. We suggest that the proposed strategy can be applied to various smart home scenarios.

Beijing Institute for General Artificial Intelligence, Beijing, China

Beijing Institute for General Artificial Intellgence, Beijing, China

Beijing Institute for General Artificial Intelligence, Beijing, China

https://doi.org/10.1145/3586183.3606824

Machine learning models have been trained to predict semantic information about user interfaces (UIs) to make apps more accessible, easier to test, and to automate. Currently, most models rely on datasets that are collected and labeled by human crowd-workers, a process that is costly and surprisingly error-prone for certain tasks. For example, it is possible to guess if a UI element is “tappable” from a screenshot (i.e., based on visual signifiers) or from potentially unreliable metadata (e.g., a view hierarchy), but one way to know for certain is to programmatically tap the UI element and observe the effects. We built the Never-ending UI Learner, an app crawler that automatically installs real apps from a mobile app store and crawls them to discover new and challenging training examples to learn from. The Never-ending UI Learner has crawled for more than 5,000 device-hours, performing over half a million actions on 6,000 apps to train three computer vision models for i) tappability prediction, ii) draggability prediction, and iii) screen similarity.

Carnegie Mellon University, Pittsburgh, Pennsylvania, United States

University of Michigan, Ann Arbor, Michigan, United States

Apple, Seattle, Washington, United States

Apple, Pittsburgh, Pennsylvania, United States

Apple Inc, San Diego, California, United States

https://doi.org/10.1145/3586183.3606783

Mobile apps bring us many conveniences, such as online shopping and communication, but some use malicious designs called dark patterns to trick users into doing things that are not in their best interest. Many works have been done to summarize the taxonomy of these patterns and some have tried to mitigate the problems through various techniques. However, these techniques are either time-consuming, not generalisable or limited to specific patterns. To address these issues, we propose UIGuard, a knowledge-driven system that utilizes computer vision and natural language pattern matching to automatically detect a wide range of dark patterns in mobile UIs. Our system relieves the need for manually creating rules for each new UI/app and covers more types with superior performance. In detail, we integrated existing taxonomies into a consistent one, conducted a characteristic analysis and distilled knowledge from real-world examples and the taxonomy. Our UIGuard consists of two components, Property Extraction and Knowledge-Driven Dark Pattern Checker. We collected the first dark pattern dataset, which contains 4,999 benign UIs and 1,353 malicious UIs of 1,660 instances spanning 1,023 mobile apps. Our system achieves a superior performance in detecting dark patterns (micro averages: 0.82 in precision, 0.77 in recall, 0.79 in F1 score). A user study involving 58 participants further showed that UIGuard significantly increases users' knowledge of dark patterns. We demonstrated potential use cases of our work, which can benefit different stakeholders, and serve as a training tool for raising awareness of dark patterns

CSIRO's Data61, Sydney, New South Wales, Australia

CSIRO's Data61, Sydney, NSW(AUS), Australia

Monash University, Melbourne, Victoria, Australia

CSIRO's Data61 adn Australian National University, ACTON, ACT, Australia

CSIRO, Sydney, NSW, Australia

CSIRO, Eveleigh, NSW, Australia

Monash University, Melbourne, Victoria, Australia