Hello,
I want to get detailed information from https://far.nive.hu/publikus-adatok/felnottkepzok-nyilvantartasa but I can't find the solution, can you help me?
1. click on each page number and save the table. The rows are links but I can't set it. How should I? Can you insert a scraped field (I will expand with the others).
{"_id":"Felnottkepzes","startUrl":["https://far.nive.hu/publikus-adatok/felnottkepzok-nyilvantartasa"],"selectors":[{"id":"Oldalak","parentSelectors":["_root","Oldalak"],"paginationType":"auto","type":"SelectorPagination","selector":"button.p-paginator-page"},{"id":"Tabla","parentSelectors":["Oldalak"],"type":"SelectorTable","multiple":true,"selector":"table","tableDataRowSelector":"tbody","tableHeaderRowSelector":".p-datatable-thead tr","columns":[{"extract":true,"header":"Megnevezés","name":"Megnevezes"},{"extract":true,"header":"Székhely","name":"Szekhely"},{"extract":true,"header":"Nyilv. szám","name":"NyilvantartasiSzam"},{"extract":true,"header":"Engedélyszám","name":"Engedelyszam"}]}]}
2. The other way, clicking on the table row changes the URL. This is very slow because there are >22,000 rows, but a lot of deleted records, about 13,000 records. It is very slow because of the deleted records. Can it be skipped if there is no data on the page? e.g. https://far.nive.hu/nyilvantartasok/felnottkepzok/publikus/20175
{"_id":"Felnottkepzes_Reszletes","startUrl":["https://far.nive.hu/nyilvantartasok/felnottkepzok/publikus/[3-30]"],"selectors":[{"id":"nyilvSzam","parentSelectors":["_root"],"type":"SelectorText","selector":"div.osszesites-field.ng-star-inserted:nth-of-type(1) .horizontal-label > span","multiple":false,"regex":""},{"id":"NyilvVeve","parentSelectors":["_root"],"type":"SelectorText","selector":".mb-1 .horizontal-label > span","multiple":false,"regex":""},{"id":"EngedelySzam","parentSelectors":["_root"],"type":"SelectorText","selector":"div.ng-star-inserted:nth-of-type(3) .horizontal-label > span","multiple":false,"regex":""},{"id":"Engedelyezve","parentSelectors":["_root"],"type":"SelectorText","selector":"far-nyilvantartas-adatok > section div:nth-of-type(4) .horizontal-label > span","multiple":false,"regex":""},{"id":"Vallalkozas","parentSelectors":["_root"],"type":"SelectorText","selector":"far-beadvany-osszesites section:nth-of-type(1) div:nth-of-type(1) .horizontal-label > span","multiple":false,"regex":""},{"id":"Azonosito","parentSelectors":["_root"],"type":"SelectorText","selector":"far-beadvany-osszesites div:nth-of-type(4) .horizontal-label > span","multiple":false,"regex":""},{"id":"Adoszam","parentSelectors":["_root"],"type":"SelectorText","selector":"div:nth-of-type(5) .horizontal-label > span","multiple":false,"regex":""},{"id":"Szekhely","parentSelectors":["_root"],"type":"SelectorText","selector":"section:nth-of-type(2) div:nth-of-type(1) .horizontal-label > span","multiple":false,"regex":""},{"id":"Telephely","parentSelectors":["_root"],"type":"SelectorText","selector":"far-beadvany-osszesites div.ng-star-inserted:nth-of-type(2) .horizontal-label > span","multiple":false,"regex":""},{"id":"LevCim","parentSelectors":["_root"],"type":"SelectorText","selector":"section:nth-of-type(3) div:nth-of-type(1) .horizontal-label > span","multiple":false,"regex":""},{"id":"Telefon","parentSelectors":["_root"],"type":"SelectorText","selector":"section:nth-of-type(3) div:nth-of-type(2) .horizontal-label > span","multiple":false,"regex":""},{"id":"eMail","parentSelectors":["_root"],"type":"SelectorText","selector":"section:nth-of-type(3) div:nth-of-type(3) .horizontal-label > span","multiple":false,"regex":""},{"id":"Kepviselo","parentSelectors":["_root"],"type":"SelectorText","selector":"section:nth-of-type(4) div:nth-of-type(1) .horizontal-label > span","multiple":false,"regex":""},{"id":"KepvTisztsege","parentSelectors":["_root"],"type":"SelectorText","selector":"section:nth-of-type(4) div:nth-of-type(2) .horizontal-label > span","multiple":false,"regex":""}]}
Thx, ZS