Uncrawlable Resources
This endpoint returns a list of resources detected on the target website that could not be crawled due to a content type inconsistency.
Documentation Index
Fetch the complete documentation index at: https://aisa.one/docs/llms.txt
Use this file to discover all available pages before exploring further.
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
ID of the task required field you can get this ID in the response of the Task POST endpoint example: "07131248-1535-0216-1000-17384017ad04"
the maximum number of returned uncrawlable resources optional field default value: 100 maximum value: 1000
offset in the results array of returned uncrawlable resources optional field default value: 0 if you specify the 10 value, the first ten invalid resources in the results array will be omitted and the data will be provided for the successive invalid resources
results sorting rules optional field you can use the same values as in the filters array to sort the results possible sorting types: asc - results will be sorted in the ascending order desc - results will be sorted in the descending order you should use a comma to set up a sorting type example: ["meta.content_type,desc"] note that you can set no more than three sorting rules in a single request you should use a comma to separate several sorting rules example: ["meta.content_type,asc","fetch_time,desc"]
array of results filtering parameters optional field you can add several filters at once (8 filters maximum) you should set a logical operator and, or between the conditions the following operators are supported: regex, not_regex, , , >, >=, =, , in, not_in, like, not_like you can use the % operator with like and not_like to match any string of zero or more characters example: [["meta.content_type","=","image/jpeg"], "and", ["url","not_like","%/help-center/%"]]The full list of possible filters is available by this link.
Response
Successful response
the current version of the API
general status code you can find the full list of the response codes here Note: we strongly recommend designing a necessary system for handling related exceptional or error conditions
general informational message you can find the full list of general informational messages here
execution time, seconds
total tasks cost, USD
the number of tasks in the tasks array
the number of tasks in the tasks array returned with an error
array of tasks
task identifier unique task identifier in our system in the UUID format
status code of the task generated by DataForSEO; can be within the following range: 10000-60000 you can find the full list of the response codes here
informational message of the task you can find the full list of general informational messages here
execution time, seconds
cost of the task, USD
number of elements in the result array
URL path
contains the same parameters that you specified in the POST request
array of results
status of the crawling session possible values: in_progress, finished
details of the crawling session
maximum number of pages to crawl indicates the max_crawl_pages limit you specified when setting a task
number of pages that are currently in the crawling queue
number of crawled pages
total number of uncrawlable resources found total number of uncrawlable resources found during the crawl of the target domain
number of uncrawlable resources in the items array
array of uncrawlable resources
URL of the uncrawlable resource
reason the resource is uncrawlable can take the following values: content_type_inconsistency
HTTP response code returned by the uncrawlable resource possible values: 200
date and time when the resource was fetched in the UTC format: “yyyy-mm-dd hh-mm-ss +00:00” example: 2026-03-09 18:20:32 +00:00
metadata of the uncrawlable resource
actual content type of the resource
expected content types for the resource list of content types that were expected by the crawler based on how the resource is referenced on the page