Web Harvesting Services - Library of Congress

SOL #: 030ADV26R0010Combined Synopsis/Solicitation

Overview

Buyer

Library Of Congress
Library Of Congress
CONTRACTS SERVICES
Washington, DC, 20540, United States

Place of Performance

Washington, DC

NAICS

Computing Infrastructure Providers (518210)

PSC

Cloud Solutions Delivered As A Service. (DK10)

Set Aside

No set aside specified

Timeline

1
Posted
Jan 20, 2026
2
Last Updated
Feb 25, 2026
3
Submission Deadline
Mar 3, 2026, 10:00 PM

Qualification Details

Fit reasons
  • NAICS alignment with historical contract wins in similar service areas.
  • Scope strongly matches core technical capabilities and delivery model.
Risks
  • Past performance thresholds may require one additional teaming partner.
  • Potential clarification needed on staffing minimums before bid/no-bid.
Next steps

Validate eligibility requirements, assign capture owner, and schedule partner outreach to confirm teaming strategy before submission planning.

Quick Summary

The Library of Congress is seeking proposals for Web Harvesting Services under an Indefinite-Delivery, Indefinite-Quantity (IDIQ) contract with firm-fixed-price task orders. This opportunity aims to procure systematic, at-scale web content harvesting, including temporary access, crawl reports, and content transfer for preservation and public access. The contract has an estimated value between $300,000 and $15,000,000. Proposals are due March 3, 2026, at 5:00 PM Eastern Time.

Purpose & Scope

The Library of Congress requires contractor support to harvest web content based on staff instructions, provide temporary access for quality review, and enable content transfer to the Library for preservation. The scope includes capturing content from Library-provided seed lists at various frequencies (weekly, monthly, extended, and specific US Election 2026 crawls). Anticipated crawl volume ranges from 300-700 Terabytes (TB) per year. Deduplication is required, and robots.txt files are generally ignored.

Contract Details

  • Contract Type: Indefinite-Delivery, Indefinite-Quantity (IDIQ) with Firm-Fixed-Price Task Orders.
  • Period of Performance: Base period from June 1, 2026, to May 31, 2031 (5 years).
  • Estimated Value: Minimum order of $300,000.00; Maximum order of $15,000,000.00.
  • Place of Performance: Contractor's own facilities.
  • Set-Aside: Unrestricted.
  • Product/Service Code: DK10 (Cloud Solutions Delivered As A Service).

Key Requirements

Contractors must provide:

  • Web Content Harvesting: Perform crawls using Library specifications, seed lists, and scoping instructions, capturing various digital objects into valid WARC files (ISO 28500_2017).
  • Data Packaging & Transfer: Package captured content into WARC files with CDX indexes for transfer to the Library's S3 bucket via secure internet (HTTPS).
  • Quality Review & Reporting: Provide an access tool for Library staff review and generate detailed crawl reports (ASCII text and XML) within 5 days of completion. Develop and maintain a Quality Control Program (QCP).
  • Infrastructure & Security: Utilize US-based servers, maintain reliable and secure data storage, and adhere to strict information security policies, including restrictions on Generative AI use.
  • Key Personnel: Provide qualified Program Manager/Alternate, Crawl Engineer, and Quality Assurance Lead.

Submission & Evaluation

  • Proposal Submission: Electronically via email to both the Contracting Officer (jzwa@loc.gov) and Contract Specialist (cdaly@loc.gov). Total email attachment size not to exceed 20MB. Proposals must be valid through June 6, 2026.
  • Proposal Content: Must include four volumes: Technical Approach (including a sample web crawl), Corporate Experience and Capabilities, Past Performance (using Attachment J3), and Price (using Attachment J4).
  • Evaluation Criteria: Best-Value Trade-Off (BVTO) approach. Factors in descending order of importance: Technical Approach, Corporate Experience and Capabilities, Past Performance, and Price. Non-price factors combined are significantly more or equally important to price. The Library may award without discussions.

Important Dates

  • Questions Due Date: February 2, 2026, at 12:00 PM EST (Answers provided in Attachment J9).
  • Sample Web Crawl Transfer Information Due: February 17, 2026, at 5:00 PM Eastern Time.
  • Past Performance Questionnaires (PPQs) Due: February 24, 2026, at 12:00 PM Eastern Time (must be sent directly by references).
  • Proposal Due Date: March 3, 2026, at 5:00 PM Eastern Time.

Contact Information

People

Points of Contact

Colleen DalyPRIMARY
Jennifer ZwahlenSECONDARY

Files

Files

Download
Download
Download
Download
Download
Download
Download
Download
Download
Download
Download
Download
Download
Download
Download
Download
Download
Download
Download

Versions

Version 8
Combined Synopsis/Solicitation
Posted: Feb 25, 2026
View
Version 7
Combined Synopsis/Solicitation
Posted: Feb 19, 2026
View
Version 6Viewing
Combined Synopsis/Solicitation
Posted: Feb 18, 2026
Version 5
Combined Synopsis/Solicitation
Posted: Feb 13, 2026
View
Version 4
Combined Synopsis/Solicitation
Posted: Feb 3, 2026
View
Version 3
Combined Synopsis/Solicitation
Posted: Jan 28, 2026
View
Version 2
Combined Synopsis/Solicitation
Posted: Jan 23, 2026
View
Version 1
Combined Synopsis/Solicitation
Posted: Jan 20, 2026
View
Web Harvesting Services - Library of Congress | GovScope