Searchable Document Collection - Add Website Document

This task addes a Website's content as document to a collection that can be queried.

Input

Description

This task adds a Website's content as a document to a collection. The collection can be queried by the Searchable Document Collection - Query task.

Notes

Please note:

  • This Task only works for Websites. Not for PDFs.
  • The document is available for 6 months.
  • Currently only the first 200000 characters are used. We will increase this limit in the future.
  • Pricing is set linear based on the number chunks we create. We may change the pricing in the future.

Applications

  • Knowledge Base ChatBots
  • Semantic Information Retrieval from Document Collections

How to use the task

Enter the URL of the document you want to index. Add a collection name and press the "Run Task" button.

Privacy

We strongly advise against including sensitive or confidential information in your queries.

[1.0.0]:
First version
VersionAI ModelCreatedLink
1.0.022.10.2023

API

The REST API allows you to call the tool with the same costs as when running the tool. Please generate an Personal access token before using the REST API.

Parameters

  • url (URL): The HTML url(s) to extract the text from
  • document_collection (Document Collection): The name of the document.
  • Call the REST API by cURL
    curl -v -H "Authorization: Bearer PERSONAL_ACCESS_TOKEN" https://api.anysolve.ai/rest/v1/intern-memory-search-collection-add-url/1.0.0?url=https%3A%2F%2Fdocs.anysolve.ai%2Ffaq%2Findex.html&document_collection=document-collection-name
  • Install the package with pip
    python3 -m pip install anysolve
  • Run in python3
    import os
    from anysolve import AnySolve
    anysolve_token = os.environ.get('ANYSOLVE_PERSONAL_ACCESS_TOKEN') # Resolve your personal access token here
    client = AnySolve(anysolve_token)
    res = client.run('intern-memory-search-collection-add-url','1.0.0', {'url': 'https://docs.anysolve.ai/faq/index.html', 'document_collection': 'document-collection-name'})
    print(res)
  • Coming soon: Within AnySolve ChatComplete prompts you can use the following command to execute the task:
    /run('intern-memory-search-collection-add-url','1.0.0', url='https://docs.anysolve.ai/faq/index.html', document_collection='document-collection-name')