Node SDK

The official @datablue/sdk package is a typed async SDK for Node.js, TypeScript, and modern JavaScript runtimes. It defaults to https://api.datablue.dev and sends your API key as a Bearer token.

Install

npm install @datablue/sdk

Scrape

import { DataBlue } from "@datablue/sdk";

const client = new DataBlue({ apiKey: "wh_your_api_key" });

const result = await client.scrape("https://example.com", {
  formats: ["markdown", "links"],
});

console.log(result.data?.markdown);

Extract with Gemini

const result = await client.scrape("https://example.com/pricing", {
  formats: ["markdown"],
  extract: {
    provider: "gemini",
    prompt: "Extract pricing tiers",
    schema: {
      type: "object",
      properties: {
        tiers: {
          type: "array",
          items: {
            type: "object",
            properties: {
              name: { type: "string" },
              price: { type: "string" },
            },
          },
        },
      },
    },
  },
});

console.log(result.data?.extract);

Crawl and Map

const crawl = await client.crawl("https://docs.example.com", {
  maxPages: 20,
  maxDepth: 2,
  scrapeOptions: { formats: ["markdown"] },
});

const urls = await client.mapUrl("https://docs.example.com", {
  limit: 100,
});

console.log(crawl.status, urls.links.length);

Configuration

Option Default Description
apiUrlhttps://api.datablue.devBase URL of the DataBlue API
apiKeyAPI key with wh_ prefix
timeout60000Request timeout in milliseconds
maxRetries3Max retries on transient network and server errors