Duplicate URL Sets
Priority: Low
Impact: Negative
What issues it may cause
If the duplicate URLs return similar content, they are likely to be deduplicated in search engines.
How do you fix it
Identify if the URLs are created by malformed internal links in which case the links to the URL should be corrected to use single encoding.
What is the positive impact
- Search engines may index multiple copies of the same page resulting in dilution of authority.
- Crawling resources may be wasted which could impact the discovery of other URLs
- Search engines may not select the preferred URL format.
How to fetch the data for this report template
You will need to run a crawl for report template to generate report. When report has been generated and you have crawl id you can fetch data for the report using the following query:
- Query
- Variables
- cURL
query GetReportStatForCrawl(
$crawlId: ObjectID!
$reportTemplateCode: String!
$after: String
) {
getReportStat(
input: {crawlId: $crawlId, reportTemplateCode: $reportTemplateCode}
) {
crawlDuplicateUrls(after: $after, reportType: Basic) {
nodes {
pageTitle
description
primaryUrl
exampleDuplicate1
exampleDuplicate2
duplicateCount
deeprank
level
duplicateType
}
totalCount
pageInfo {
endCursor
hasNextPage
}
}
}
}
{"crawlId":"TjAwNUNyYXdsNDAwMA","reportTemplateCode":"duplicate_url_sets"}
curl -X POST -H "Content-Type: application/json" -H "apollographql-client-name: docs-example-client" -H "apollographql-client-version: 1.0.0" -H "x-auth-token: YOUR_API_SESSION_TOKEN" --data '{"query":"query GetReportStatForCrawl( $crawlId: ObjectID! $reportTemplateCode: String! $after: String ) { getReportStat( input: {crawlId: $crawlId, reportTemplateCode: $reportTemplateCode} ) { crawlDuplicateUrls(after: $after, reportType: Basic) { nodes { pageTitle description primaryUrl exampleDuplicate1 exampleDuplicate2 duplicateCount deeprank level duplicateType } totalCount pageInfo { endCursor hasNextPage } } } }","variables":{"crawlId":"TjAwNUNyYXdsNDAwMA","reportTemplateCode":"duplicate_url_sets"}}' https://api.lumar.io/graphql