ABOUT VAULTBOOK

We believe your notes
belong to you.

VaultBook is built by a team of 50 AI and data professionals in San Francisco who think the best software runs on your device, respects your privacy, and gets out of your way.

The story

VaultBook started with a frustration that most knowledge workers share: too many tools, too many tabs, and not a single place where notes, files, and research could live together — searchable, private, and fast. Every solution we tried either locked data in a cloud we didn't control, charged a recurring subscription for basic features, or collapsed under the weight of a growing library.

So we built the tool we wanted. A workspace that ships as a single HTML file, stores everything in a folder you choose, runs entirely offline, and packs in the intelligence — weighted search, OCR, related entries, and AI suggestions — that makes a large library usable instead of overwhelming. No servers, no accounts, no compromises.

We wanted a workspace smart enough to surface what matters, private enough to trust with everything, and simple enough to ship in one file.

— The VaultBook founding team, San Francisco 2020

Today VaultBook is used by researchers, engineers, analysts, legal teams, clinicians, journalists, and students — anyone who needs a serious workspace that respects their data and rewards organization. Our team of 50 designs, builds, and ships every feature from our headquarters in San Francisco.

What we believe

Six principles that shape every decision we make.

🔒

Privacy by architecture

Your data never leaves your device. We don't have servers to breach, accounts to compromise, or telemetry to leak. Privacy isn't a policy — it's the design.

⚡

Offline first, always

VaultBook works without Wi-Fi, without a login, and without a backend. If your browser can open an HTML file, VaultBook runs. Period.

🧠

Intelligence, locally

AI suggestions, weighted search, OCR, and vote-based learning all run in the browser. Smart doesn't have to mean connected.

🎯

Ship substance, not features

We don't chase feature counts. Every capability earns its place by solving a real problem for real workflows.

🤝

Respect the user

No dark patterns, no upsell pop-ups, no artificial limits on your data. We build software we'd want to use ourselves.

📦

Portable and transparent

Your notes are stored as plain JSON and Markdown files in a folder you control. No lock-in, no proprietary formats, no migration headaches.

VaultBook by the numbers

Team members

Headquarters — San Francisco

12+

Built-in tools

Cloud dependencies

The team behind VaultBook

50 specialists — here's what each one builds inside VaultBook.

Our team brings together deep expertise in machine learning, natural language processing, data engineering, information security, and product design. We hire people who care about craft, ship fast, and believe that great software should disappear into the workflow.

Principal ML Engineer

Architects the weighted scoring system behind QA search — titles at weight 8, labels at 6, OCR text at 5, body at 4.

NLP Research Lead

Designs the natural-language query pipeline that lets users ask questions across their entire library in plain English.

Data Platform Architect

Built the File System Access API layer — repository.json, sidecar .md files, and the attachment index manifest.

Sr. Search Engineer

Implements typeahead search, query history suggestions, and the warm-up system that pre-loads text for top 12 results.

Computer Vision Engineer

Owns inline OCR — automatic text extraction from images embedded in entries, cached per-item for instant search.

Sr. Data Scientist

Tunes the personalized relevance distribution in the AI Suggestions pager, learning patterns from reading history.

Information Security Lead

Designed AES-256-GCM per-entry encryption with PBKDF2 key derivation at 100,000 iterations and random salt/IV per encrypt.

ML Infrastructure Engineer

Runs all AI scoring and similarity calculations client-side in the browser — zero server calls, zero data leakage.

Cryptography Engineer

Implements the session password cache, lock screen blur overlay, and per-entry decryption held in memory only.

Data Engineering Lead

Built the deep indexing pipeline — text extraction from XLSX, PPTX, PDF, ZIP, and MSG files for search.

Recommendation Systems

Powers Related Entries — contextual similarity suggestions with Reddit-style upvote/downvote to train relevance over time.

Sr. Frontend Engineer

Builds the rich text editor — tables, code blocks, callout blocks, case transforms, headings H1–H6, inline images.

AI Product Manager

Shapes the 4-page AI Suggestions carousel: upcoming scheduled entries, weekday reading patterns, recent files, recent tools.

Applied AI Researcher

Develops vote-based learning — persistent upvote/downvote scores that influence search result and related entry ordering.

Knowledge Graph Engineer

Maps relationships between entries via labels, hashtags, and page paths to power Kanban Board auto-bucketing.

Data Quality Lead

Maintains the attachment reindex system and file integrity checks across the /attachments directory and index.txt manifest.

MLOps Engineer

Optimizes client-side model loading so AI Suggestions, QA search, and OCR run smoothly even on modest hardware.

OCR & Document AI

Extracts text from images inside DOCX, XLSX, ZIP archives, and renders scanned PDF pages for full searchability.

Embedding Systems Engineer

Generates and indexes content embeddings that drive similarity scoring for the Related Entries feature.

Ranking & Relevance

Tunes the multi-field scoring weights and vote-based reranking that float the best QA results to the top.

Sr. Data Analyst

Built the analytics dashboard — entry counts, storage metrics, label utilization charts, and 14-day activity graphs.

Backend Engineer

Implements the save system — autosave with dirty flags, debouncing, concurrent write guards, and sync status badges.

UX Research Lead

Runs usability studies that shaped the sidebar time tabs — Recent, Due, Expiring, and Tools quick-access panels.

Performance Engineer

Optimizes pagination, lazy attachment loading, and background OCR warm-up to keep large libraries responsive.

AI Ethics & Compliance

Ensures all AI features run locally with zero telemetry — no usage data, no model training on user content, ever.

Data Pipeline Engineer

Handles the background attachment text extraction pipeline that indexes content from text-based files and images.

Full-Stack Engineer

Ships the built-in tools — File Analyzer, PDF Merge & Split, MP3 Cutter, Password Generator, and Folder Analyzer.

Semantic Search Specialist

Develops smart label suggestions — content analysis that recommends relevant tags with pastel-styled suggestion chips.

Technical Writer

Writes the storage tutorial, update banners, FAQ, and every tooltip that guides first-time users through VaultBook.

QA & Test Automation

Tests across Chrome, Edge, Arc, and Brave on Mac, Windows, and Linux — every release, every platform, every time.

Product Designer

Designed the frosted-glass UI, sidebar toggle, floating action button, and the responsive layout that stacks at 900px.

Data Governance Analyst

Defines the data model — items, pages, sections, labels, pagePaths, and the userVotes schema for persistent learning.

Analytics Engineer

Builds the canvas-rendered charts — label pie charts, page utilization, monthly activity, and attachment type breakdowns.

AI Training Specialist

Curates the vote-based feedback loop where upvotes (+1M) and downvotes (-1M) persistently reshape search ordering.

DevOps Engineer

Manages the build pipeline that compiles VaultBook into a single deployable HTML file with zero external dependencies.

Annotation & Labeling Lead

Develops the color-coded label system, inline hashtag parsing, and the multi-select tag interface in the edit modal.

Deep Learning Engineer

Trains the models behind contextual Related Entries similarity — the engine that surfaces notes you didn't know you needed.

Privacy Engineer

Audits every code path to ensure zero network calls — no analytics beacons, no font CDNs, no external requests of any kind.

Data Visualization Lead

Designs the strength metric pills, expandable analytics details, and the file type breakdown chips in the analytics panel.

Interaction Designer

Crafts the micro-interactions — smooth transitions, hover states, close confirmation dialogs, and accordion section UX.

Feature Extraction Engineer

Parses structured content from PPTX slides via JSZip and XLSX sheets via SheetJS for deep searchable indexing.

ML Research Scientist

Researches and improves the personalized relevance distribution that powers the AI Suggestions reading patterns.

Cloud-to-Edge Specialist

Ensures every AI and search feature runs entirely in-browser — proving that cloud-grade intelligence works at the edge.

Accessibility Engineer

Makes VaultBook keyboard-navigable and screen-reader friendly — ARIA labels, focus states, and sr-only utility classes.

Content Strategist

Shapes the Threads tool, RSS Reader folder structure, and Save URL → Entry workflow that turns web pages into notes.

🏙️

Made in San Francisco

Our entire team works from a single office in San Francisco. We chose the city for its unmatched depth in AI and data science talent, the density of builders who obsess over craft, and a culture that rewards shipping real products over hype. From our windows, the Bay is always in view, and we like to think VaultBook reflects San Francisco at its best: curious, practical, relentlessly ambitious, and built to last.

Our journey

Key moments that shaped VaultBook.

2020

The first commit

VaultBook started as a personal tool — a single HTML file to organize research notes offline. The core concepts of local-first storage and single-file deployment were born here.

2021

Rich editing & encryption

The rich text editor, per-entry AES-256-GCM encryption, hierarchical pages, and the File System Access API storage layer turned VaultBook into a real workspace.

2022

AI search goes live

Weighted search scoring, QA mode, and vote-based reranking turned VaultBook from a note-taker into a searchable knowledge base. OCR followed shortly after.

2023

Team grows to 50

We hired across AI research, data engineering, security, and design. The San Francisco office became home to a team that ships features weekly.

2024

12+ built-in tools

Kanban, PDF tools, RSS reader, file analyzers, media explorer, and more — all built directly into VaultBook so users never need to leave the workspace.

2025

Deep indexing & version history

Pro gained deep file indexing for XLSX, PPTX, PDF, ZIP, and MSG files, OCR inside documents and archives, and 60-day version history with per-entry snapshots.

2026

VaultBook 3.0

The latest release brings the AI Suggestions pager, timetable views, analytics dashboards, version history, and a refined UI — our most ambitious update yet.

Come build with us

Try VaultBook, or reach out if you'd like to learn more about the team.

Get started Get in touch

We believe your notesbelong to you.

We believe your notes
belong to you.