I am trying to scrape data from a Bengali (language) website. When I inspect element on that website, everything is as it should.
request = requests.get("https://corona.gov.bd/") soup = BeautifulSoup(request.content, "lxml") print(soup.prettify())
Part of the output:
<strong> à¦¸à¦à¦°à¦¾à¦à¦° à¦à¦¿à¦à§à¦à¦¾à¦¸à¦¾ </strong>
à¦¸à¦à¦°à¦¾à¦à¦° à¦à¦¿à¦à§à¦à¦¾à¦¸à¦¾ >> should be >>"সচরাচর জিজ্ঞাসা"
I am not sure if it is ASCII or not. I used https://onlineasciitools.com/convert-ascii-to-unicode to convert that text into Unicode. As per this website, it may be ASCII. But I checked an ASCII table online and none of those characters were in it. So now I need to convert those text into readable stuff. Any help?