Embedding PDF previews and maintaining crawlability/link-equity.
-
One site that I'm working on has previously had a great deal of success from the pdf preview content on the site. The pdf previews are quite substantial and rank for many many long-tail terms that drive a reasonable amount of traffic back to the site to purchase the full version of the product.
As part of a site redesign, the way the pdf previews are embedded/presented on the page is changing slightly:
The proposed modal pop-up on the new site the code looks like thie:
<object data="my-pdf-preview.pdf" type="application/pdf" style="width:100%; min-height:600px; max-height:100%;max-height:100%;"><embed src="my-pdf-preview.pdf" type="application/pdf"></object>
Where as the old code looked like this:
<object data="mt-pdf-previewpreview.pdf#view=FitH,50&scrollbar=1&toolbar=0&statusbar=0&messages=0&navpanes=0" <br="">type='application/pdf'
width='100%'
height='600'>It appears your Web browser is not configured to display PDF files.
No worries, you can download the PDF file here.</object>
Note: how previously the code contained a plain, standard link to the pdf document.
My worry is that without this link, search engines won't a) be able to discover/crawl the pdf content or b) pass any link-equity to these pdfs.
Does anyone have any experience/recommendations about this? I'd like to have some information before I request that they add a plain link to the pdf previews back onto the on-page content.
-
That's the route I'd push for as well I think.
Agreed on experimentation. Please report back if you get a chance to test this. Perhaps choose a small number of PDFs on this site redesign and leave the link off of them?
-
Thanks Kane - I've managed to make the case for a real-simple "download preview pdf" link so at least I feel comfortable that they won't lose too much of this "hidden" traffic.
It would still be nice to understand how <embed> is handled and whether any link-equity passes though the embed. Tight deadlines on projects don't mean you have time to experiment.
-
I haven't seen any studies with <embed> the way I have with <iframe>. <embed> is also used for video and flash, but neither would be indexed the same way as PDF so hard to compare. The embed tag is pretty standardized, so I really doubt they wouldn't crawl this similarly.</p> <p>IIRC in the ugly era of flash, it was proper to have a <noscript> {crawlable content here} </noscript> section after the <embed>, so that's one comparable situation, but that's due to the flash itself not being crawled well.</p> <p>If it's not a hassle, I would add the text link to the PDF that says "download full PDF" or similar. If it is a hassle and takes longer than a couple hours, then it's a harder call.</p> <p>Similar thread that could be helpful:</p> <ul> <li><a href="http://stackoverflow.com/questions/3686331/does-google-index-html-content-supplied-by-the-object-tag">http://stackoverflow.com/questions/3686331/does-google-index-html-content-supplied-by-the-object-tag</a></li> </ul></iframe>
-
Search engines will still be able to crawl the PDF. They crawl images, don't they?
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
Ogranization Schema/Microformat for a content/brand website | Travel
Hi, One of our clients have a website specific to a place, for eg. California Tourism in which they publish local information related to tourism, blogs & other useful content. I want to understand how useful is to publish Organization Schema on such website mentioning the actual Organization, which in this case is a Travel Agency? Or any other schema would fit in for such websites?
Intermediate & Advanced SEO | | ds9.tech0 -
Domain Authority... http://www.domain.com/ vs. http://domain.com vs. http://domain.com/
Hey Guys, Looking at Page Authority for my Site and ranking them in Decending Order, I see these 3 http://www.domain.com/ | Authority 62 http://domain.com | Authority 52 http://domain.com/ | Authority 52 Since the first one listed has the highest Authority, should I be using a 301 redirects on the lower ranking variations (which I understand how works) or should I be using rel="canonical" (which I don't really understand how it works) Also, if this is a problem that I should address, should we see a significant boost if fixed? Thanks ahead of time for anyone who can help a lost sailor who doesn't know how to sail and probably shouldn't have left shore in the first place. Cheers ZP!
Intermediate & Advanced SEO | | Mr_Snack0 -
Help with https// redirects
Hey there
Intermediate & Advanced SEO | | Jay328
I have a client who just moved from a self hosted CMS to Adobe Catalyst (don't ask!)
The problem: Their url indexed with google is https://domain.com, Adobe Catalyst does not support third party SSL certificates or https domains. Now when people google them https://domain.com shows up in search, HOWEVER it does not have a trusted certificate and a pop up window blocks the site. They are a mortgage company so SSL is really not needed. What can I do to get google to recognize the site at http: vs. https? Would this be something in GWMT? Thanks!0 -
Should I have as few internal links as possible?
On most pages of my site i have a Quick Links section, which gives x3 cross sales links to other products, a newsletter sign up link, link to Blog, x4 links from images to surveys, newsletters, feedback etc. Will these links be hurting my optimal SEO juice between pages, should the number of internal links be kept to a minimum? My site is www.over50choices.co.uk if that helps. Thanks
Intermediate & Advanced SEO | | AshShep1
Ash0 -
Using a 302 re-direct from http://www to https://www to secure customer data
My website sends Customers from a http://www.mysite.com/features page to a https://www.mysite.com/register page which is an account sign-up form using a 302 re-direct. Any page that collects customer data has an authenticated SSL certificate to protect any data on the site. Is this 302 the most appropriate way of doing this as the weekly crawl picks it up as being bad practise? Is there a better alternative?
Intermediate & Advanced SEO | | Ubique0 -
What is value in a back-link from article with multiple links pointing to various other sites?
In a standard article with 400-500 words my site got a back-link. However, within the article there are 4 other links pointing to other external content as well (so total 5 links within articles all pointing to external sites, and 1 of the links is to my site). All links are to relevant external content that is. Question: wouldn't it be much more valuable for my site if only my site got a back-link from the article, as less link juice is now passed to my site, since there are 4 other links pointing to various sites from this same article? Or, is the case that given the other links are pointing to quality material it actually makes the link to my site look more credible and at the end of the day have more value. Conclusion: is it that on one hand less links in same article is better from a link juice perspective, however, from a credibility perspective it looks more convincing there are other links pointing to quality content?
Intermediate & Advanced SEO | | knielsen0 -
Bad links
Well just set up SEO Moz to find out someone thought it funny to build a load of links to our site http://bluetea.com.au/ with the anchor txt "Buy Cocks" .... PLEASE PLEASE let me know how much I should worry about this and how can I get rid of it?
Intermediate & Advanced SEO | | Intrested0 -
What are the different tactics for getting ranked/ included in Google finance searches such as http://www.google.com/finance/company_news?q=NASDAQ:ADBE
I don't know what ranking factors they are using for this feed. The results vary greatly from a search done at google.com or google.com/news and google.com/finance I'm working with a website that regularly publishes finance-related news and currently gets traffic from google finance. I'm wondering what we can do to optimize our news articles to possibly show more prominently or more often. Thanks
Intermediate & Advanced SEO | | joemascaro0