Studio: OS level sandbox for python/bash tool execution (macOS + Linux) by danielhanchen · Pull Request #83 · unslothai/unsloth-staging-1

danielhanchen · 2026-05-17T13:29:39Z

Staging mirror of unslothai#5468

Original PR: unslothai#5468
Author: NilayYadav

This is a staging copy for review and editing. Once finalized, changes will be pushed back to the original PR.

Original description

Summary

Wrap python_exec / bash_exec tool subprocesses in an OS level sandbox: Seatbelt (sandbox-exec) on macOS, bubblewrap (bwrap) on Linux
Deny network outright; restrict reads to the OS surface + Python runtime + per-session workdir; restrict writes to the workdir

Why

The python and bash tools previously ran as plain subprocesses, a tool call could read anywhere the Studio user could read, write outside the workdir, and reach the network. Seatbelt ships with macOS and bubblewrap is a one-package install on every major Linux distro.

Testing

pytest -q studio/backend/tests/test_sandbox.py
Manual macOS — python tool: read of ~/Desktop denied, urllib.request raises, write inside workdir succeeds
Manual Linux (Ubuntu 24.04, apt install bubblewrap) — python tool: ls / shows only the bound dirs (bin sbin usr lib lib64 etc proc dev tmp home), DNS fails

This PR tracks the moving review branch (pr-5468-head). Iteration fix commits land here directly. Review-added tests are in a separate PR.

Changed files:

.github/workflows/consolidated-tests-ci.yml
.github/workflows/lint-ci.yml
.github/workflows/mlx-ci.yml
.github/workflows/notebooks-ci.yml
.github/workflows/release-desktop.yml
.github/workflows/security-audit.yml
.github/workflows/stale.yml
.github/workflows/studio-api-smoke.yml
.github/workflows/studio-backend-ci.yml
.github/workflows/studio-frontend-ci.yml
.github/workflows/studio-inference-smoke.yml
.github/workflows/studio-mac-api-smoke.yml
.github/workflows/studio-mac-inference-smoke.yml
.github/workflows/studio-mac-ui-smoke.yml
.github/workflows/studio-mac-update-smoke.yml
.github/workflows/studio-tauri-smoke.yml
.github/workflows/studio-ui-smoke.yml
.github/workflows/studio-update-smoke.yml
.github/workflows/studio-windows-api-smoke.yml
.github/workflows/studio-windows-inference-smoke.yml
.github/workflows/studio-windows-ui-smoke.yml
.github/workflows/studio-windows-update-smoke.yml
.github/workflows/version-compat-ci.yml
.github/workflows/wheel-smoke.yml
studio/backend/core/inference/sandbox.py
studio/backend/core/inference/tools.py
studio/backend/run.py
studio/backend/startup_banner.py
studio/backend/tests/test_sandbox.py

for more information, see https://pre-commit.ci

danielhanchen · 2026-05-17T13:33:37Z

/gemini review

gemini-code-assist

Code Review

This pull request implements an OS-level sandbox for tool execution on macOS and Linux using Seatbelt and bubblewrap, integrating it into the Python and Bash execution paths. It also adds RTL language support to chat composers via the dir="auto" attribute, cleans up legacy CI environment variables, and introduces comprehensive tests for sandbox enforcement and multilingual input. Review feedback identified a potential AttributeError in sys.modules iteration and suggested a more robust approach for managing bound destinations in the Linux sandbox configuration.

gemini-code-assist · 2026-05-17T13:43:59Z

+    for name, mod in list(sys.modules.items()):
+        if not (name.startswith("__editable___") and name.endswith("_finder")):
+            continue


The sys.modules dictionary can contain None values (representing negative caching). Accessing attributes on a NoneType object will raise an AttributeError. It is safer to check if mod is not None before proceeding.

Suggested change

for name, mod in list(sys.modules.items()):

if not (name.startswith("__editable___") and name.endswith("_finder")):

continue

for name, mod in list(sys.modules.items()):

if mod is None or not (name.startswith("__editable___") and name.endswith("_finder")):

continue

gemini-code-assist · 2026-05-17T13:43:59Z

+    bound_dests = [
+        args[i + 2]
+        for i, arg in enumerate(args)
+        if arg in bind_flags and i + 2 < len(args)
+    ]


This logic for extracting bound destinations assumes that every flag in bind_flags is followed by exactly two arguments (source and destination). While this is true for bwrap's bind flags, it makes the code fragile if the structure of args changes or if other flags are added. Consider a more robust way to track bound destinations as they are added to the args list.

+        ok = False
+        label = "no sandbox primitive for this platform"
+
+    _sandbox_available_cache = ok


+            )
+            try:
+                page.wait_for_load_state("networkidle", timeout = 30_000)
+            except Exception:


+    step("wait for composer to mount")
+    try:
+        page.wait_for_load_state("networkidle", timeout = 30_000)
+    except Exception:


+            )
+            try:
+                shoot(f"02-composer-wait-attempt-{_mount_attempt + 1}-fail")
+            except Exception:


+            composer.element_handle(),
+            timeout = 5_000,
+        )
+    except Exception:


+                if os.path.islink(prefix):
+                    seen_links.add(prefix)
+                    out.append(prefix)
+            except OSError:


+    """
+    secret = f"SECRET-{uuid.uuid4().hex}"
+    path = os.path.expanduser(f"~/.studio_sandbox_test_{uuid.uuid4().hex}.txt")
+    Path(path).write_text(secret)


NilayYadav and others added 7 commits May 16, 2026 02:42

sandbox for mac/linux

e4a7a89

refactor

c69b865

sandbox unavailable banner

f73f290

[pre-commit.ci] auto fixes from pre-commit.com hooks

10b1544

for more information, see https://pre-commit.ci

improvements and bwrap availability

9542c20

Merge branch 'main' into main

455ed0e

Scrub .github/workflows for staging push (matches staging base)

8a10797

gemini-code-assist Bot reviewed May 17, 2026

View reviewed changes

github-code-quality Bot found potential problems May 17, 2026

View reviewed changes

github-advanced-security AI found potential problems May 17, 2026

View reviewed changes

Comment thread studio/backend/tests/test_sandbox.py

"""

secret = f"SECRET-{uuid.uuid4().hex}"

path = os.path.expanduser(f"~/.studio_sandbox_test_{uuid.uuid4().hex}.txt")

Path(path).write_text(secret)

danielhanchen force-pushed the main branch from e128c6f to 1555c15 Compare May 18, 2026 03:46

danielhanchen force-pushed the main branch from 9f47625 to b9dd7cf Compare June 7, 2026 10:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Studio: OS level sandbox for python/bash tool execution (macOS + Linux)#83

Studio: OS level sandbox for python/bash tool execution (macOS + Linux)#83
danielhanchen wants to merge 7 commits into
mainfrom
pr-5468-head

danielhanchen commented May 17, 2026

Uh oh!

danielhanchen commented May 17, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 17, 2026

Uh oh!

gemini-code-assist Bot May 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

danielhanchen commented May 17, 2026

Staging mirror of unslothai#5468

Original description

Summary

Why

Testing

Uh oh!

danielhanchen commented May 17, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 17, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants