hello,
I try to scrape a website, but I don't know how to do with the pagination (JS) as you can see below:

Ex: https://www.eterritoire.fr/evenements/france/visites-et-circuits
Can someone help me?
Thanks,
D
hello,
I try to scrape a website, but I don't know how to do with the pagination (JS) as you can see below:

Ex: https://www.eterritoire.fr/evenements/france/visites-et-circuits
Can someone help me?
Thanks,
D
Hello, if you would use link selector for pagination, it should be working fine.
Thanks for the reply, but I dont know how to configure the link selector with JS.
Can you help me?
Thanks,
David
.pagination a:contains("Suivant")
Hello,
Thanks for the help, I have this:
{"_id":"eterritoire","startUrl":["https://www.eterritoire.fr/evenements/france/visites-et-circuits"],"selectors":[{"id":"link1","type":"SelectorLink","parentSelectors":["_root"],"selector":"a[itemprop='url']","multiple":true,"delay":0},{"id":"pagination","type":"SelectorElementClick","parentSelectors":["_root","pagination"],"selector":".pagination a:contains("Suivant")","multiple":true,"delay":"50","clickElementSelector":".pagination a:contains("Suivant")","clickType":"clickOnce","discardInitialElements":"do-not-discard","clickElementUniquenessType":"uniqueText"}]}
it doesnt' seem to work 
it is invalid sitemap.
{"_id":"eterritoire","startUrl":["https://www.eterritoire.fr/evenements/france/visites-et-circuits"],"selectors":[{"id":"link1","type":"SelectorLink","parentSelectors":["_root"],"selector":"a[itemprop='url']","multiple":true,"delay":0},{"id":"pagination","type":"SelectorElementClick","parentSelectors":["_root","pagination"],"selector":".pagination a:contains("Suivant")","multiple":true,"delay":"50","clickElementSelector":".pagination a:contains("Suivant")","clickType":"clickOnce","discardInitialElements":"do-not-discard","clickElementUniquenessType":"uniqueText"}]}
{"_id":"agenda","startUrl":["https://www.eterritoire.fr/evenements/france/visites-et-circuits"],"selectors":[{"id":"link1","type":"SelectorLink","parentSelectors":["pagination"],"selector":"a[itemprop='url']","multiple":true,"delay":0},{"id":"pagination","type":"SelectorElementClick","parentSelectors":["_root"],"selector":".lstline li[itemscope]","multiple":true,"delay":"1000","clickElementSelector":".pagination a:contains("Suivant")","clickType":"clickOnce","discardInitialElements":"do-not-discard","clickElementUniquenessType":"uniqueCSSSelector"}]}
Thanks but I have invalid JSON
{
"_id": "agenda",
"startUrl": ["https://www.eterritoire.fr/evenements/france/visites-et-circuits"],
"selectors": [{
"id": "link1",
"type": "SelectorLink",
"parentSelectors": ["pagination"],
"selector": "a[itemprop='url']",
"multiple": true,
"delay": 0
}, {
"id": "pagination",
"type": "SelectorElementClick",
"parentSelectors": ["_root"],
"selector": ".lstline li[itemscope]",
"multiple": true,
"delay": "1000",
"clickElementSelector": ".pagination a:contains("Suivant")",
"clickType": "clickOnce",
"discardInitialElements": "do-not-discard",
"clickElementUniquenessType": "uniqueCSSSelector"
}]
}
Thanks for the help, but still invalid JSON... Sorry
Hello,
It works, thanks for your help!
The only issue is that the scrap stops after page 4, is it normal ?
D
Site url does seem to change with each new page, e.g. page 5 is https://www.eterritoire.fr/evenements/france/visites-et-circuits/5
so you don't really need a paginator and you can use the Specify multiple urls with ranges method.
Thanks for the help!