Extracting a json out of the main body of page source

Question

Extracting a json out of the main body of page source

asked Apr 26, 2022 in Education by JackTerrance

I am trying to scrape the data off the webpage below, using Selenium in Python 3: https://www.whoscored.com/Matches/1285051/Live/England-Premier-League-2018-2019-West-Ham-Huddersfield If this URL is viewed as the page source (for Chrome users: view-source: https://www.whoscored.com/Matches/1285051/Live/England-Premier-League-2018-2019-West-Ham-Huddersfield), there are some JSONs within the text. My aim is to scrape the first, and quite substantial JSON, which sits in the 'var matchCentreData' bit. A snippet is featured below: var matchCentreData = {"playerIdNameDictionary":{"14244":"Pablo Zabaleta", "89998":"Manuel Lanzini","34693":"Marko Arnautovic","93026":"Felipe Anderson", "300359":"Issa Diop","122980" I am able to scrape the entirety of the page source, however, I am struggling to extract only the JSON above. Any help would be much appreciated! JavaScript questions and answers, JavaScript questions pdf, JavaScript question bank, JavaScript questions and answers pdf, mcq on JavaScript pdf, JavaScript questions and solutions, JavaScript mcq Test , Interview JavaScript questions, JavaScript Questions for Interview, JavaScript MCQ (Multiple Choice Questions)

1 Answer

Related questions

0 votes

Q: Extracting a json out of the main body of page source

I am trying to scrape the data off the webpage below, using Selenium in Python 3: https://www. ... JavaScript Questions for Interview, JavaScript MCQ (Multiple Choice Questions)...

asked Apr 24, 2022 in Education by JackTerrance

0 votes

Q: How to scrape image urls of comments?

Following code scrapes comments and customer country from each product page for example this product from aliexpress ... for Interview, JavaScript MCQ (Multiple Choice Questions)...

asked Apr 7, 2022 in Education by JackTerrance

0 votes

Q: How to auto extracting nested json in which few json object nested json object but can be parsed with __dict__

object looks like as below { "id":1, "image":"path/to/image", "employee_data":, } again ... JavaScript Questions for Interview, JavaScript MCQ (Multiple Choice Questions)...

asked May 13, 2022 in Education by JackTerrance

0 votes

Q: How to auto extracting nested json in which few json object nested json object but can be parsed with __dict__

object looks like as below { "id":1, "image":"path/to/image", "employee_data":, } again ... JavaScript Questions for Interview, JavaScript MCQ (Multiple Choice Questions)...

asked May 7, 2022 in Education by JackTerrance

0 votes

Q: How do I send JSon as BODY In a POST request to server from an Android application?

I am working on my first Android Application. What I am trying to do is a POST request to a REST ... JavaScript Questions for Interview, JavaScript MCQ (Multiple Choice Questions)...

asked Feb 13, 2022 in Education by JackTerrance

0 votes

Q: which section is used for text and tags that are shown directly on your web page? A head B body. c title. d HTML

which section is used for text and tags that are shown directly on your web page? A head B body. c title. d HTML Select the correct answer from above options...

asked Dec 27, 2021 in Education by JackTerrance

0 votes

Q: which section is used for text and tags that are shown directly on your web page? A head B body. c title. d HTML

which section is used for text and tags that are shown directly on your web page? A head B body. c title. d HTML Select the correct answer from above options...

asked Dec 26, 2021 in Education by JackTerrance

0 votes

Q: The body section displays the main content on the browser window. True or false.

The body section displays the main content on the browser window. True or false. No spam Select the correct answer from above options...

asked Dec 25, 2021 in Education by JackTerrance

0 votes

Q: Analyse the given program segment and answer the following questionsi) write the output of the program segment ii)how many times does the body of the loop gets executefor(int m=5 ; m<=20 ;m+=5){if (m%3= = 0)break;elseif (m%5= = 0) System.out.println(m) continue;}

Analyse the given program segment and answer the following questions i) write the output of the program segment ii)how many times ... body of the loop gets execute for(int m=5 ; m...

asked Nov 29, 2021 in Education by JackTerrance

0 votes

Q: iframe src attribute vs the actual iframe page source returned by Selenium's getPageSource() method

After reading the switch-to-iframe documentation, I am still not sure what will driver.getPageSource() ... Questions for Interview, JavaScript MCQ (Multiple Choice Questions)...

asked Apr 5, 2022 in Education by JackTerrance

0 votes

Q: The (first) page of the website that opens is called? options 1 (Homepage) 2(webpage) 3(main page)4( non of the above)

The (first) page of the website that opens is called? options 1 (Homepage) 2(webpage) 3(main page)4( non of the above) Select the correct answer from above options...

asked Dec 13, 2021 in Education by JackTerrance

0 votes

Q: numbers.4. (a) When running a particular program with Nmemory accesses, a computer with a cacheand paged virtual memory generates atotal of M cache misses and F page faults.Ti is the time for a cache hit; T2 is thetime for a main memory hit and T3 is thetime to load a page into main memoryfrom the disk.(i) What is the cache hit ratio ?(ii) What is the main memory hit ratio ?That is, what percentage of mainmemory accesses do not generate apage fault?(iii) What is the overall effective accessOtime for the system ?

numbers. 4. (a) When running a particular program with N memory accesses, a computer with a cache and paged ... the system ? cribe Select the correct answer from above options...

asked Dec 6, 2021 in Education by JackTerrance

0 votes

Q: numbers.4. (a) When running a particular program with Nmemory accesses, a computer with a cacheand paged virtual memory generates atotal of M cache misses and F page faults.Ti is the time for a cache hit; T2 is thetime for a main memory hit and T3 is thetime to load a page into main memoryfrom the disk.(i) What is the cache hit ratio ?(ii) What is the main memory hit ratio ?That is, what percentage of mainmemory accesses do not generate apage fault?(iii) What is the overall effective accessOtime for the system ?

numbers. 4. (a) When running a particular program with N memory accesses, a computer with a cache and paged ... the system ? cribe Select the correct answer from above options...

asked Nov 26, 2021 in Education by JackTerrance

0 votes

Q: Getting black blank image when extracting a thumbnail from a video

I am trying to get a thumbnail image from a video from a URL using AVFoundation but I am getting ... JavaScript Questions for Interview, JavaScript MCQ (Multiple Choice Questions)...

asked May 13, 2022 in Education by JackTerrance

0 votes

Q: parsings strings: extracting words and phrases [JavaScript]

I need to support exact phrases (enclosed in quotes) in an otherwise space-separated list of terms. Thus ... Questions for Interview, JavaScript MCQ (Multiple Choice Questions)...

asked Mar 24, 2022 in Education by JackTerrance

JackTerrance · Answer 1 · 2022-04-26T02:31:16+0000

This is all you need. page_json = driver.execute_script("return JSON.stringify(matchCentreData)") # Do what you want with the json. Worked for me just now. And if you want both this specifically AND the page html, then do this step, along with your page source grabbing logic. No need to extract it specifically from the page source when you have this.