ServiceNow

company

Verified

https://www.servicenow.com/

AI & ML interests

None defined yet.

Recent Activity

aldro61 updated a dataset about 1 hour ago

ServiceNow/WorkArena-Instances

spandanagella authored a paper 2 days ago

VectorGym: A Multitask Benchmark for SVG Code Generation, Sketching, and Editing

spandanagella authored a paper 2 days ago

Mem-$π$: Adaptive Memory through Learning When and What to Generate

View all activity

Papers

Apriel-Reasoner: RL Post-Training for General-Purpose and Efficient Reasoning

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

View all Papers

Articles

MosaicLeaks: Can your research agent keep a secret?

PipelineRL

updated a dataset about 1 hour ago

ServiceNow/WorkArena-Instances

Viewer • Updated 8 minutes ago • 6 • 24.2k • 5

authored 4 papers 2 days ago

VectorGym: A Multitask Benchmark for SVG Code Generation, Sketching, and Editing

Paper • 2603.29852 • Published Feb 22 • 6

Mem-$π$: Adaptive Memory through Learning When and What to Generate

Paper • 2605.21463 • Published May 20 • 8

Toward Open Weight Models Without Risks: Separating Public and Private Capabilities in LLMs

Paper • 2606.21638 • Published 11 days ago • 7

PrivacyAlign: Contextual Privacy Alignment for LLM Agents

Paper • 2606.21710 • Published 11 days ago • 3

gdar

published a dataset 7 days ago

ServiceNow/PrivacyAlign

Viewer • Updated 12 days ago • 1.35k • 56 • 2

published an article 11 days ago

Article

MosaicLeaks: Can your research agent keep a secret?

ServiceNow

•

11 days ago

• 13

updated a dataset 12 days ago

ServiceNow/PrivacyAlign

Viewer • Updated 12 days ago • 1.35k • 56 • 2

authored a paper about 1 month ago

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published May 13 • 75

authored 6 papers about 1 month ago

Developing Safe and Responsible Large Language Models -- A Comprehensive Framework

Paper • 2404.01399 • Published Apr 1, 2024 • 1

DNA Bench: When Silence is Smarter -- Benchmarking Over-Reasoning in Reasoning LLMs

Paper • 2503.15793 • Published Mar 20, 2025

Cats Confuse Reasoning LLM: Query Agnostic Adversarial Triggers for Reasoning Models

Paper • 2503.01781 • Published Mar 3, 2025 • 2

Apriel-Nemotron-15B-Thinker

Paper • 2508.10948 • Published Aug 13, 2025 • 6

AU-Harness: An Open-Source Toolkit for Holistic Evaluation of Audio LLMs

Paper • 2509.08031 • Published Sep 9, 2025 • 21

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published May 13 • 75

authored a paper about 1 month ago

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published May 13 • 75

submitted a paper to Daily Papers about 2 months ago

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Paper • 2605.13841 • Published May 13 • 75

authored a paper about 2 months ago

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

Paper • 2605.12178 • Published May 12 • 65

authored a paper about 2 months ago

Scope: Selective Cross-modal Orchestration of Visual Perception Experts

Paper • 2510.12974 • Published Oct 14, 2025

authored a paper about 2 months ago

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

Paper • 2605.12178 • Published May 12 • 65