fix: incorrect soft-timeout implementation & fix hard-timeout follow-up command #6280

xingyaoww · 2025-01-15T00:18:20Z

End-user friendly description of the problem this fixes or functionality that this introduces

Include this change in the Release Notes. If checked, you must provide an end-user friendly description for your change below

Give a summary of what the PR does, explaining any non-trivial design decisions

Previously, we set .blocking = True for all .timeout = assignment -- this basically turn EVERY command with hard timeout = 120 sec (default value) -- and the soft timeout were not correctly enabled.

In this PR, we:

add two methods add_default_timeout and add_hard_timeout to set timeout better.
replace existing implementation of .timeout accordingly

Another big issue before is that when agent:

Runs a long command
That long command somehow get stuck (and exceed 120 sec timeout)
The agent tries to run the next (unrelated) command (e.g., ls) -- Because the previous command is NOT killed, the follow-up command will be stuck in the shell and not get executed anymore.

To fix this in the PR, we add an error message to remind the agent to kill the previous command properly before continuing.

metadata.suffix = (
    f'\n[Your command "{command}" is NOT executed. '
    f'The previous command was timed out but still running. Above is the output of the previous command. '
    "You may wait longer to see additional output of the previous command by sending empty command '', "
    'send other commands to interact with the current process, '
    'or send keys ("C-c", "C-z", "C-d") to interrupt/kill the previous command before sending your new command.]'
)

We also add a new test to stress test the bash terminal in loop for:

Long command output
Command that triggers soft timeout
Command that triggers long timeout

Link of any specific issues this addresses

#6259

#6218

To run this PR locally, use the following command:

docker run -it --rm   -p 3000:3000   -v /var/run/docker.sock:/var/run/docker.sock   --add-host host.docker.internal:host-gateway   -e SANDBOX_RUNTIME_CONTAINER_IMAGE=docker.all-hands.dev/all-hands-ai/runtime:818bfde-nikolaik   --name openhands-app-818bfde   docker.all-hands.dev/all-hands-ai/openhands:818bfde

openhands/runtime/impl/action_execution/action_execution_client.py

rbren · 2025-01-15T21:01:05Z

openhands/runtime/utils/bash.py

+                    last_pane_output, _ps1_matches
+                )
+                metadata = CmdOutputMetadata()  # No metadata available
+                metadata.suffix = (


this is kind of a hack 🙈

Will the agent get confused if it really did enter y as stdin?

Maybe we need separate actions for running commands and sending stdin

if we are gonna ask the agent to kill commands when they want.. this is probably the easiest thing we can do..

We can also kill the command for the agent (like what we did before), but it kinda adds more constraints in the system

Yep! the agent is able to do other stuff (not get confused) when "y" is entered

- Create new MetadataTable component for better metadata display - Update Terminal component to use MetadataTable for JSON metadata - Fix linting in chat-slice.ts

- Remove MetadataTable component - Update use-terminal hook to format metadata directly in terminal output - Clean up terminal component

enyst · 2025-01-15T23:10:35Z

openhands/runtime/utils/bash.py

+                    "You may wait longer to see additional output of the previous command by sending empty command '', "
+                    'send other commands to interact with the current process, '
+                    'or send keys ("C-c", "C-z", "C-d") to interrupt/kill the previous command before sending your new command.]'
+                )


In my understanding CmdOutputMetadata is a fairly complex BaseModel object that maps the output of ps1, but here we alter its structure and give it a different content, a rather large message for the LLM from us? (a prompt tweak)

Could we think about structuring this situation in some other way? Like, maybe don't save it in the action, and add an attribute to the CmdOutputObservation... 🤔 "instruction", or "error_detail" or "timeout_detail". Idk, but this is an Obs to the new action, and yet it contains deep buried info about the old action? If so, maybe we can surface it, make it super-clear in the obs

yeah i think these are really the info that we should show the user. @rbren had concerns early about directly displaying these in terminal so they should not go into .content, but maybe it make sense to move these suffix/and prefix to the CmdOutputObservation level of info

Good point! I think maybe a slightly different perspective is from a client developer / agent developer point of view. How do we define metadata and how easy is it for people to work with it for their purposes?
(I'm not sure why we call it metadata, if it's terminal output, maybe it would be easier to understand if it was, dunno, terminal_output. 😅)

- Create new MetadataSection component for collapsible metadata display - Update Message type to include metadata field - Update chat-slice.ts to store metadata separately from content - Update ChatMessage and Messages components to handle metadata

- Remove separate MetadataSection component - Update ExpandableMessage to handle metadata display - Clean up ChatMessage component - Improve metadata styling with border and spacing

add bash stress test to debug for #6259

2dd420e

xingyaoww changed the title ~~add bash stress test to debug for #6259~~ [WIP] fix: bash performance issue Jan 15, 2025

xingyaoww added 6 commits January 14, 2025 19:21

fix test

7681a53

add timer for iteration

e63d68f

update

7be2991

increase char per line

56770be

fix soft timeout and cleanup all the timeout set method in the repo

df5cad3

handle case for hard-timeout + unfinished process

81930f0

xingyaoww changed the title ~~[WIP] fix: bash performance issue~~ fix: incorrect soft-timeout implementation & fix hard-timeout follow-up command Jan 15, 2025

xingyaoww marked this pull request as ready for review January 15, 2025 20:54

Merge branch 'main' into xw/stress-bash

e4e992c

rbren reviewed Jan 15, 2025

View reviewed changes

openhands/runtime/impl/action_execution/action_execution_client.py Outdated Show resolved Hide resolved

rbren reviewed Jan 15, 2025

View reviewed changes

xingyaoww and others added 5 commits January 15, 2025 16:10

replace set_default_timeout with set_hard_timeout blocking=false

26b7ff1

show metadata table

5f30388

feat: make metadata section expandable in chat messages

3d08473

feat: add expandable metadata table component for terminal output

4cfd006

- Create new MetadataTable component for better metadata display - Update Terminal component to use MetadataTable for JSON metadata - Fix linting in chat-slice.ts

refactor: display metadata directly in terminal instead of using HTML

9e22e01

- Remove MetadataTable component - Update use-terminal hook to format metadata directly in terminal output - Clean up terminal component

enyst reviewed Jan 15, 2025

View reviewed changes

openhands-agent added 2 commits January 15, 2025 23:17

refactor: simplify metadata display using ExpandableMessage

818bfde

- Remove separate MetadataSection component - Update ExpandableMessage to handle metadata display - Clean up ChatMessage component - Improve metadata styling with border and spacing

All-Hands-AI deleted a comment from baxitfund Jan 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: incorrect soft-timeout implementation & fix hard-timeout follow-up command #6280

fix: incorrect soft-timeout implementation & fix hard-timeout follow-up command #6280

xingyaoww commented Jan 15, 2025 •

edited by github-actions bot

Loading

rbren Jan 15, 2025

rbren Jan 15, 2025

xingyaoww Jan 15, 2025 •

edited

Loading

xingyaoww Jan 15, 2025

enyst Jan 15, 2025

xingyaoww Jan 15, 2025 •

edited

Loading

enyst Jan 15, 2025

fix: incorrect soft-timeout implementation & fix hard-timeout follow-up command #6280

Are you sure you want to change the base?

fix: incorrect soft-timeout implementation & fix hard-timeout follow-up command #6280

Conversation

xingyaoww commented Jan 15, 2025 • edited by github-actions bot Loading

rbren Jan 15, 2025

Choose a reason for hiding this comment

rbren Jan 15, 2025

Choose a reason for hiding this comment

xingyaoww Jan 15, 2025 • edited Loading

Choose a reason for hiding this comment

xingyaoww Jan 15, 2025

Choose a reason for hiding this comment

enyst Jan 15, 2025

Choose a reason for hiding this comment

xingyaoww Jan 15, 2025 • edited Loading

Choose a reason for hiding this comment

enyst Jan 15, 2025

Choose a reason for hiding this comment

xingyaoww commented Jan 15, 2025 •

edited by github-actions bot

Loading

xingyaoww Jan 15, 2025 •

edited

Loading

xingyaoww Jan 15, 2025 •

edited

Loading