3. Manipulating Text

会議の名前
UIST 2024
Beyond the Chat: Executable and Verifiable Text-Editing with LLMs
要旨

Conversational interfaces powered by Large Language Models (LLMs) have recently become a popular way to obtain feedback during document editing. However, standard chat-based conversational interfaces cannot explicitly surface the editing changes that they suggest. To give the author more control when editing with an LLM, we present InkSync, an editing interface that suggests executable edits directly within the document being edited. Because LLMs are known to introduce factual errors, Inksync also supports a 3-stage approach to mitigate this risk: Warn authors when a suggested edit introduces new information, help authors Verify the new information's accuracy through external search, and allow a third party to Audit with a-posteriori verification via a trace of all auto-generated content. Two usability studies confirm the effectiveness of InkSync's components when compared to standard LLM-based chat interfaces, leading to more accurate and more efficient editing, and improved user experience.

著者
Philippe Laban
Salesforce Research, New York, New York, United States
Jesse Vig
Salesforce Research, Palo Alto, California, United States
Marti Hearst
UC Berkeley, Berkeley, California, United States
Caiming Xiong
Salesforce, Palo Alto, California, United States
Chien-Sheng Wu
Salesforce AI, Palo Alto, California, United States
論文URL

https://doi.org/10.1145/3654777.3676419

動画
ScriptViz: A Visualization Tool to Aid Scriptwriting based on a Large Movie Database
要旨

Scriptwriters usually rely on their mental visualization to create a vivid story by using their imagination to see, feel, and experience the scenes they are writing. Besides mental visualization, they often refer to existing images or scenes in movies and analyze the visual elements to create a certain mood or atmosphere. In this paper, we develop a new tool, ScriptViz, to provide external visualization based on a large movie database for the screenwriting process. It retrieves reference visuals on the fly based on scripts’ text and dialogue from a large movie database. The tool provides two types of control on visual elements that enable writers to 1) see exactly what they want with fixed visual elements and 2) see variances in uncertain elements. User evaluation among 15 scriptwriters shows that ScriptViz is able to present scriptwriters with consistent yet diverse visual possibilities, aligning closely with their scripts and helping their creation.

著者
Anyi Rao
Stanford University, Stanford, California, United States
Jean-Peïc Chou
Stanford University, Stanford, California, United States
Maneesh Agrawala
Stanford University, Stanford, California, United States
論文URL

https://doi.org/10.1145/3654777.3676402

動画
SkipWriter: LLM-Powered Abbreviated Writing on Tablets
要旨

Large Language Models (LLMs) may offer transformative opportunities for text input, especially for physically demanding modalities like handwriting. We studied a form of abbreviated handwriting by designing, developing, and evaluating a prototype, named SkipWriter, that converts handwritten strokes of a variable-length prefix-based abbreviation (e.g. "ho a y" as handwritten strokes) into the intended full phrase (e.g., "how are you" in the digital format) based on the preceding context. SkipWriter consists of an in-production handwriting recognizer and an LLM fine-tuned on this task. With flexible pen input, SkipWriter allows the user to add and revise prefix strokes when predictions do not match the user's intent. An user evaluation demonstrated a 60% reduction in motor movements with an average speed of 25.78 WPM. We also showed that this reduction is close to the ceiling of our model in an offline simulation.

著者
Zheer Xu
Dartmouth College, Hanover, New Hampshire, United States
Shanqing Cai
Google, Mountain View, California, United States
Mukund Varma T
UC San Diego, La Jolla, California, United States
Subhashini Venugopalan
Google, Mountain View, California, United States
Shumin Zhai
Google, Mountain View, California, United States
論文URL

https://doi.org/10.1145/3654777.3676423

動画
Bluefish: Composing Diagrams with Declarative Relations
要旨

Diagrams are essential tools for problem-solving and communication as they externalize conceptual structures using spatial relationships. But when picking a diagramming framework, users are faced with a dilemma. They can either use a highly expressive but low-level toolkit, whose API does not match their domain-specific concepts, or select a high-level typology, which offers a recognizable vocabulary but supports a limited range of diagrams. To address this gap, we introduce Bluefish: a diagramming framework inspired by component-based user interface (UI) libraries. Bluefish lets users create diagrams using relations: declarative, composable, and extensible diagram fragments that relax the concept of a UI component. Unlike a component, a relation does not have sole ownership over its children nor does it need to fully specify their layout. To render diagrams, Bluefish extends a traditional tree-based scenegraph to a compound graph that captures both hierarchical and adjacent relationships between nodes. To evaluate our system, we construct a diverse example gallery covering many domains including mathematics, physics, computer science, and even cooking. We show that Bluefish's relations are effective declarative primitives for diagrams. Bluefish is open source, and we aim to shape it into both a usable tool and a research platform.

著者
Josh M.. Pollock
Massachusetts Institute of Technology, Cambridge, Massachusetts, United States
Catherine Mei
Massachusetts Institute of Technology, Cambridge, Massachusetts, United States
Grace Huang
Massachusetts Institute of Technology, Cambridge, Massachusetts, United States
Elliot Evans
N/A, Ottawa, Ontario, Canada
Daniel Jackson
MIT, Cambridge, Massachusetts, United States
Arvind Satyanarayan
MIT, Cambridge, Massachusetts, United States
論文URL

https://doi.org/10.1145/3654777.3676465

動画