Skip to content

Conversation

@valentin-naboka
Copy link

@valentin-naboka valentin-naboka commented Dec 28, 2025

Description

Add a new LlamaIndex reader that uses the Massive proxy network with Playwright browser automation for web scraping with geotargeting support.

Features:

  • Country, city, ZIP code, and ASN geotargeting
  • Device type targeting (mobile, common, tv)
  • Sticky sessions with configurable TTL
  • Sync and async loading support
  • Raw HTML mode option

Fixes # (issue)

New Package?

Did I fill in the tool.llamahub section in the pyproject.toml and provide a detailed README.md for my new integration or package?

  • Yes
  • No

Version Bump?

Did I bump the version in the pyproject.toml file of the package I am updating? (Except for the llama-index-core package)

  • Yes
  • No

Type of Change

Please delete options that are not relevant.

  • New feature (non-breaking change which adds functionality)

How Has This Been Tested?

Your pull-request will likely not be merged unless it is covered by some form of impactful unit testing.

  • I added new unit tests to cover this change
  • I believe this change is already covered by existing unit tests

Suggested Checklist:

  • I have performed a self-review of my own code
  • I have commented my code, particularly in hard-to-understand areas
  • I have made corresponding changes to the documentation
  • I have added Google Colab support for the newly added notebooks.
  • My changes generate no new warnings
  • I have added tests that prove my fix is effective or that my feature works
  • New and existing unit tests pass locally with my changes
  • I ran uv run make format; uv run make lint to appease the lint gods

Add a new LlamaIndex reader that uses the Massive proxy network with
Playwright browser automation for web scraping with geotargeting support.

Features:
- Country, city, ZIP code, and ASN geotargeting
- Device type targeting (mobile, common, tv)
- Sticky sessions with configurable TTL
- Sync and async loading support
- Raw HTML mode option
@dosubot dosubot bot added the size:XXL This PR changes 1000+ lines, ignoring generated files. label Dec 28, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

size:XXL This PR changes 1000+ lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant