Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Science Direct fetcher is broken #2574

Closed
stefan-kolb opened this issue Feb 20, 2017 · 0 comments
Closed

Science Direct fetcher is broken #2574

stefan-kolb opened this issue Feb 20, 2017 · 0 comments
Assignees
Labels
bug Confirmed bugs or reports that are very likely to be bugs fetcher

Comments

@stefan-kolb
Copy link
Member

stefan-kolb commented Feb 20, 2017

URL no longer available inside id=pdfLink
http://www.sciencedirect.com/science/article/pii/0370269377900156?via%3Dihub
Only new design? Some articles?
This one still works:
http://www.sciencedirect.com/science/article/pii/S1674775515001079

<div class="PdfDropDownMenu"><ul><li><a href="/science/article/pii/0370269377900156/pdf?md5=3db7bb02cd67f3d839b1e563e6f21b00&amp;pid=1-s2.0-0370269377900156-main.pdf" target="_blank" aria-label="Download single PDF. Opens in a new window." aria-expanded="true" aria-live="polite" rel="nofollow" data-hack="#"><span class="Icon Pdf" style="display: inline-block; height: 16px; width: 16px;"><svg fill="#fff" tabindex="-1" focusable="false" height="32" width="32" viewBox="0 0 32 32" y="40" style="width: 100%; height: 100%;"><defs><linearGradient id="a" gradientTransform="matrix(.054 0 0 .05 -39.302 -11.44)" gradientUnits="userSpaceOnUse" x1="1010" x2="1200" y1="686" y2="339"><stop offset="0" stop-color="#fff"></stop><stop offset="1" stop-color="#e1e1e1"></stop></linearGradient></defs><path d="M7 .362h17.875l6.763 6.1V31.64H6.948V16z" fill="url(#a)" stroke="#000" stroke-width=".703"></path><path d="M.167 2.592H22.39V9.72H.166z" fill="#da0000" stroke="#aaa" stroke-width=".315"></path><path d="M5.97 3.638h1.62c1.053 0 1.483.677 1.488 1.564.008.96-.6 1.564-1.492 1.564h-.644v1.66h-.977V3.64m.977.897v1.34h.542c.27 0 .596-.068.596-.673-.002-.6-.32-.667-.596-.667h-.542m3.8.036v2.92h.35c.933 0 1.223-.448 1.228-1.462.008-1.06-.316-1.45-1.23-1.45h-.347m-.977-.94h1.03c1.68 0 2.523.586 2.534 2.39.01 1.688-.607 2.4-2.534 2.4h-1.03V3.64m4.305 0h2.63v.934h-1.657v.894H16.6V6.4h-1.56v2.026h-.97V3.638" fill="#fff9f9"></path><path d="M19.462 13.46c.348 4.274-6.59 16.72-8.508 15.792-1.82-.85 1.53-3.317 2.92-4.366-2.864.894-5.394 3.252-3.837 3.93 2.113.895 7.048-9.25 9.41-15.394zM14.32 24.874c4.767-1.526 14.735-2.974 15.152-1.407.824-3.157-13.72-.37-15.153 1.407zm5.28-5.043c2.31 3.237 9.816 7.498 9.788 3.82-.306 2.046-6.66-1.097-8.925-4.164-4.087-5.534-2.39-8.772-1.682-8.732.917.047 1.074 1.307.67 2.442-.173-1.406-.58-2.44-1.224-2.415-1.835.067-1.905 4.46 1.37 9.065z" fill="#f91d0a"></path></svg></span><!-- react-text: 451 -->Article<!-- /react-text --></a></li><li><a href="#" aria-label="Download whole issue."><span class="Icon Pdf" style="display: inline-block; height: 16px; width: 16px;"><svg fill="#fff" tabindex="-1" focusable="false" height="32" width="32" viewBox="0 0 32 32" y="40" style="width: 100%; height: 100%;"><defs><linearGradient id="a" gradientTransform="matrix(.054 0 0 .05 -39.302 -11.44)" gradientUnits="userSpaceOnUse" x1="1010" x2="1200" y1="686" y2="339"><stop offset="0" stop-color="#fff"></stop><stop offset="1" stop-color="#e1e1e1"></stop></linearGradient></defs><path d="M7 .362h17.875l6.763 6.1V31.64H6.948V16z" fill="url(#a)" stroke="#000" stroke-width=".703"></path><path d="M.167 2.592H22.39V9.72H.166z" fill="#da0000" stroke="#aaa" stroke-width=".315"></path><path d="M5.97 3.638h1.62c1.053 0 1.483.677 1.488 1.564.008.96-.6 1.564-1.492 1.564h-.644v1.66h-.977V3.64m.977.897v1.34h.542c.27 0 .596-.068.596-.673-.002-.6-.32-.667-.596-.667h-.542m3.8.036v2.92h.35c.933 0 1.223-.448 1.228-1.462.008-1.06-.316-1.45-1.23-1.45h-.347m-.977-.94h1.03c1.68 0 2.523.586 2.534 2.39.01 1.688-.607 2.4-2.534 2.4h-1.03V3.64m4.305 0h2.63v.934h-1.657v.894H16.6V6.4h-1.56v2.026h-.97V3.638" fill="#fff9f9"></path><path d="M19.462 13.46c.348 4.274-6.59 16.72-8.508 15.792-1.82-.85 1.53-3.317 2.92-4.366-2.864.894-5.394 3.252-3.837 3.93 2.113.895 7.048-9.25 9.41-15.394zM14.32 24.874c4.767-1.526 14.735-2.974 15.152-1.407.824-3.157-13.72-.37-15.153 1.407zm5.28-5.043c2.31 3.237 9.816 7.498 9.788 3.82-.306 2.046-6.66-1.097-8.925-4.164-4.087-5.534-2.39-8.772-1.682-8.732.917.047 1.074 1.307.67 2.442-.173-1.406-.58-2.44-1.224-2.415-1.835.067-1.905 4.46 1.37 9.065z" fill="#f91d0a"></path></svg></span><!-- react-text: 464 -->Article + other articles in this issue<!-- /react-text --></a></li></ul></div>

We use the Elsevier API for getting the Sciencedirect web page link and then scrape the web page for the deep link.

@stefan-kolb stefan-kolb added bug Confirmed bugs or reports that are very likely to be bugs fetcher labels Feb 20, 2017
@stefan-kolb stefan-kolb self-assigned this Feb 20, 2017
stefan-kolb added a commit that referenced this issue Feb 21, 2017
* Fixes #2574 Add logic for new Sciencedirect pages
* Add changelog
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Confirmed bugs or reports that are very likely to be bugs fetcher
Projects
None yet
Development

No branches or pull requests

1 participant