-
Notifications
You must be signed in to change notification settings - Fork 4
bless-crawl plugin support #14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
This pull request adds support for a new distributed web scraping plugin called BlessCrawl. It introduces the BlessCrawl implementation in Rust, a JavaScript wrapper, runtime integration updates, documentation enhancements, and configuration changes.
- New Rust module and JavaScript binding for BlessCrawl.
- Runtime initialization and Cargo.toml updates for plugin integration.
- Documentation and example updates to illustrate usage.
Reviewed Changes
Copilot reviewed 7 out of 8 changed files in this pull request and generated 3 comments.
Show a summary per file
| File | Description |
|---|---|
| src/lib.rs | Initializes and exposes the BlessCrawl plugin in the global context. |
| src/crawl/mod.rs | Implements scraping, mapping, and crawling functionality. |
| src/crawl/crawl.js | Provides a JavaScript class wrapper for BlessCrawl. |
| examples/crawl.js | Demonstrates example usage of the BlessCrawl plugin. |
| README.md | Updates to documentation and architecture diagrams. |
| Cargo.toml | Adds the crawl feature to the default features and applies a dependency patch. |
| .github/workflows/release.yml | Updates the build matrix to include the crawl feature. |
Description
This pull request introduces a new plugin,
BlessCrawl, for distributed web scraping.It includes changes to the codebase, configuration, and documentation to integrate and support this new functionality.
The most important changes are grouped into three themes: feature integration, configuration updates, and documentation enhancements.
Depends on: blocklessnetwork/sdk-rust#23
Feature Integration:
BlessCrawlplugin, including its implementation in Rust (src/crawl/mod.rs) and JavaScript wrapper (src/crawl/crawl.js). The plugin supports scraping webpages, mapping links, and crawling websites with customizable options. [1] [2]BlessCrawlplugin to the runtime by modifying theinitialize_runtimefunction insrc/lib.rs. This includes binding the plugin to the global context and evaluating its JavaScript module. [1] [2]examples/crawl.js, demonstrating the usage of theBlessCrawlplugin with various operations such as scraping, mapping, and crawling.Configuration Updates:
Cargo.tomlto include thecrawlfeature in the default features and added a patch for theblockless-sdkcrate..github/workflows/release.yml) to include thecrawlfeature in the build matrix.Documentation Enhancements:
README.mdto document theBlessCrawlplugin, including its purpose, integration into the architecture diagram, and a new example command for compiling JavaScript using the plugin. [1] [2] [3] [4] [5]