楼主: 654490929
3732 6

[问答] 怎么从字符串中提取其中的字典? [推广有奖]

  • 3关注
  • 0粉丝

本科生

40%

还不是VIP/贵宾

-

威望
0
论坛币
26 个
通用积分
2.0909
学术水平
2 点
热心指数
3 点
信用等级
1 点
经验
2321 点
帖子
39
精华
0
在线时间
99 小时
注册时间
2017-8-13
最后登录
2021-11-17

楼主
654490929 发表于 2017-11-2 21:53:21 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
最近刚开始学爬虫,遇到一点障碍请大神赐教!

问题描述:请问用哪种方法可以从结果字符串中提取需要的字典(即quote后面的字典)?

      部分代码为:
  1. if html == "":
  2.    continue
  3. infodict ={}
  4. soup = BeautifulSoup(html,"html.parser")
  5. #print(soup)
  6. stockinfosi = soup.find('这部分就不知道了')
  7. #print(type(stockinfosi))     
复制代码

print(soup)后的结果为:
  1. <stock-operate :quote='{"symbol":"SH600005","exchange":"SH","code":"600005","name":"武钢股份","current":"3.71","percentage":"0.0","change":"0.00","open":"0.00","high":"0.00","low":"0.00","close":"3.71","last_close":"3.71","high52week":"3.89","low52week":"2.69","volume":"-","lot_volume":"-","volumeAverage":"8673.27万","marketCapital":"374.48亿","eps":"0.04","pe_ttm":"-","pe_lyr":"-","beta":"-","totalShares":"100.94亿","time":"2017-02-14","afterHours":"0.00","afterHoursPct":"0.0","afterHoursChg":"0.00","afterHoursTime":"2011-10-20","updateAt":"1493004616997.00","dividend":"-","yield":"-","turnover_rate":"-","instOwn":"-","rise_stop":"4.08","fall_stop":"3.34","currency_unit":"CNY","amount":"-","net_assets":"2.85","hasexist":"","has_warrant":"-","type":"11","flag":"3","rest_day":"","amplitude":"1.87%","market_status":"已收盘","lot_size":"100.00","min_order_quantity":"-","max_order_quantity":"-","tick_size":"0.01","variable_tick_size":"-","kzz_stock_symbol":"-","kzz_stock_name":"-","kzz_stock_current":"-","kzz_convert_price":"-","kzz_covert_value":"-","kzz_cpr":"-","kzz_putback_price":"-","kzz_convert_time":"-","kzz_redempt_price":"-","kzz_straight_price":"-","kzz_stock_percent":"-","pb":"1.30","benefit_before_tax":"-","benefit_after_tax":"-","convert_bond_ratio":"-","totalissuescale":"-","outstandingamt":"-","maturitydate":"-","remain_year":"-","convertrate":"-","interestrtmemo":"-","release_date":"-","circulation":"-","par_value":"-","due_time":"-","value_date":"-","due_date":"-","publisher":"-","redeem_type":"F","issue_type":"","bond_type":"-","warrant":"-","sale_rrg":"-","rate":"-","after_hour_vol":"-","float_shares":"100.94亿","float_market_capital":"374.48亿","disnext_pay_date":"-","convert_rate":"-","volume_ratio":"-","percent5m":"-","pankou_ratio":"0.0%","psr":"0.72","moneySymbol":"¥","parsedTime":"02-14 15:14:49(北京时间)","changeStr":"0.00","percentStr":"0.00","stockColor":"","nav_date":"-","hasPankou":true,"flagStr":"退市","tableHtml":"<table class=\"quote-info\"><tr><td>今开:<span class=\"stock-fall\">0.00</span></td><td>最高:<span id=\"quote-high\" class=\"stock-fall\">0.00</span></td><td>52周最高:<span>3.89</span></td><td title=\"当日成交量,单位:股\">成交量:<span id=\"quote-volume\">-</span></td></tr><tr class=\"separateTop\"><td>昨收:<span>3.71</span></td><td>最低:<span id=\"quote-low\" class=\"stock-fall\">0.00</span></td><td>52周最低:<span>2.69</span></td><td>成交额:<span>-</span></td></tr><tr class=\"separateBottom\"><td>涨停价:<span>4.08</span></td><td>换手率:<span>-</span></td><td title=\"最近报告期每股收益,单位:undefined\">每股收益:<span>0.04</span></td><td>市盈率(静)/(动):<span>-/-</span></td></tr><tr><td>跌停价:<span>3.34</span></td><td title=\"单位:%\">振幅:<span>1.87%</span></td><td title=\"最近报告期每股净资产,单位:undefined\">每股净资产:<span>2.85</span></td><td>市净率(动):<span>1.30</span></td></tr><tr><td>总股本:<span>100.94亿</span></td><td>流通股本:<span>100.94亿</span></td><td>总市值:<span>374.48亿</span></td><td>流通市值:<span>374.48亿</span></td></tr></table>","isMF":false,"isFundChart":false,"isDanjuan":false,"isNormalStock":true,"isFund":false}'></stock-operate>
复制代码


  
二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:字符串 beautiful continue print Stock python

沙发
gjinwei 发表于 2017-11-2 21:58:41
你请求的网址是什么,有可能数据来自某个API接口

藤椅
654490929 发表于 2017-11-2 22:18:44 来自手机
gjinwei 发表于 2017-11-2 21:58
你请求的网址是什么,有可能数据来自某个API接口
雪球。网址提取出来了,只是不知道怎么进一步提取下面的这个字典

板凳
gjinwei 发表于 2017-11-2 22:28:13
不知道你访问的是哪个页面,我随便在雪球网找了个页面,不需要用BeautifulSoup去解析的。直接取json,或者用NOSQL存储都可以
QQ图片20171102222441.png

报纸
654490929 发表于 2017-11-2 22:38:34
gjinwei 发表于 2017-11-2 22:28
不知道你访问的是哪个页面,我随便在雪球网找了个页面,不需要用BeautifulSoup去解析的。直接取json,或者用 ...
我就随便找了任何一个股票的网址   比如http://xueqiu.com/S/SZ300710   
ps.我现在好菜鸟啊,理解了好久也没找到json或者nosql存储

地板
gjinwei 发表于 2017-11-2 22:43:50
654490929 发表于 2017-11-2 22:38
我就随便找了任何一个股票的网址   比如http://xueqiu.com/S/SZ300710   
ps.我现在好菜鸟啊,理解了好久 ...
https://xueqiu.com/v4/stock/quote.json?code=SZ300710&_=1509633641382这是你这个网址的json。我建议你先学会用浏览器的F12

7
654490929 发表于 2017-11-2 22:48:46 来自手机
gjinwei 发表于 2017-11-2 22:43
https://xueqiu.com/v4/stock/quote.json?code=SZ300710&amp;_=1509633641382这是你这个网址的json。我建议你 ...
好的,我先去学习一下f12,不会再请教你。感谢

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群
GMT+8, 2026-2-3 07:10