Thin Pages
Indexable pages with less content than the Thin Page Threshold but more than the Empty Page Threshold specified in Advanced settings > Report settings.
Priority: Medium
Impact: Negative
What issues it may cause
Pages with a small content size may be classified as 'thin content' and might not be indexed or devalue your site quality.
If these pages are indexed they may result in a poor user-experience for the visitors referred from a search engine results page.
How do you fix it
The pages can be reviewed to identify pages that can be updated to include more unique content or prevented from being crawled to improve the crawl efficiency.
What is the positive impact
Adding more unique and relevant content to these pages should improve the user-experience, increase their chances to rank for more search terms and drive more traffic.
If you decide to remove these unnecessary pages then this should result in an improvement to your crawl efficiency so other more important pages have chance to be indexed.
How to fetch the data for this report template
You will need to run a crawl for report template to generate report. When report has been generated and you have crawl id you can fetch data for the report using the following query:
- Query
- Variables
- cURL
query GetReportStatForCrawl(
$crawlId: ObjectID!
$reportTemplateCode: String!
$after: String
) {
getReportStat(
input: {crawlId: $crawlId, reportTemplateCode: $reportTemplateCode}
) {
crawlUrls(after: $after, reportType: Basic) {
nodes {
pageTitle
url
description
foundAtUrl
contentSize
deeprank
level
wordCount
foundInGoogleAnalytics
foundInGoogleSearchConsole
foundInBacklinks
foundInList
foundInLogSummary
foundInWebCrawl
foundInSitemap
}
totalCount
pageInfo {
endCursor
hasNextPage
}
}
}
}
{"crawlId":"TjAwNUNyYXdsNDAwMA","reportTemplateCode":"thin_pages"}
curl -X POST -H "Content-Type: application/json" -H "apollographql-client-name: docs-example-client" -H "apollographql-client-version: 1.0.0" -H "x-auth-token: YOUR_API_SESSION_TOKEN" --data '{"query":"query GetReportStatForCrawl( $crawlId: ObjectID! $reportTemplateCode: String! $after: String ) { getReportStat( input: {crawlId: $crawlId, reportTemplateCode: $reportTemplateCode} ) { crawlUrls(after: $after, reportType: Basic) { nodes { pageTitle url description foundAtUrl contentSize deeprank level wordCount foundInGoogleAnalytics foundInGoogleSearchConsole foundInBacklinks foundInList foundInLogSummary foundInWebCrawl foundInSitemap } totalCount pageInfo { endCursor hasNextPage } } } }","variables":{"crawlId":"TjAwNUNyYXdsNDAwMA","reportTemplateCode":"thin_pages"}}' https://api.lumar.io/graphql