<!-- markdown css tag --><div class="pinggu_markdown">
<div class="pinggu_markdown__html"><p>我有这个样子的数据</p>
<p>| DEVICE_NUMBER(电话号码)| MONTH(月份) | TOTAL_FEE(收入) |</p>
<p>| :–: | :--------: | :--------: |</p>
<p>| a13453543 | 202301 | 100 |</p>
<p>| a13453543| 202302 | 98 |</p>
<p>| a13453543 | 202303 | 97 |</p>
<p>| <span class="katex--inline"><span class="katex"><span class="katex-mathml"><math><semantics><mrow><mo>⋯</mo></mrow><annotation encoding="application/x-tex">\cdots</annotation></semantics></math></span><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height: 0.31em; vertical-align: 0em;"></span><span class="minner">⋯</span></span></span></span></span> | <span class="katex--inline"><span class="katex"><span class="katex-mathml"><math><semantics><mrow><mo>⋯</mo></mrow><annotation encoding="application/x-tex">\cdots</annotation></semantics></math></span><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height: 0.31em; vertical-align: 0em;"></span><span class="minner">⋯</span></span></span></span></span> | <span class="katex--inline"><span class="katex"><span class="katex-mathml"><math><semantics><mrow><mo>⋯</mo></mrow><annotation encoding="application/x-tex">\cdots</annotation></semantics></math></span><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height: 0.31em; vertical-align: 0em;"></span><span class="minner">⋯</span></span></span></span></span> |</p>
<p>| a27546765 | 202301 | 80 |</p>
<p>| a27546765 | 202302 | 100 |</p>
<p>| a27546765 | 202303 | 98 |</p>
<p>用最小二乘法拟合:收入= <span class="katex--inline"><span class="katex"><span class="katex-mathml"><math><semantics><mrow><mi>a</mi><mo>∗</mo></mrow><annotation encoding="application/x-tex">a *</annotation></semantics></math></span><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height: 0.46528em; vertical-align: 0em;"></span><span class="mord mathit">a</span><span class="mord">∗</span></span></span></span></span>月份<span class="katex--inline"><span class="katex"><span class="katex-mathml"><math><semantics><mrow><mo>+</mo><mi>b</mi></mrow><annotation encoding="application/x-tex">+b</annotation></semantics></math></span><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height: 0.77777em; vertical-align: -0.08333em;"></span><span class="mord">+</span><span class="mord mathit">b</span></span></span></span></span>,用电话号码进行循环,使用sklearn中的linear_model.LinearRegression(fit_intercept=True)求<span class="katex--inline"><span class="katex"><span class="katex-mathml"><math><semantics><mrow><mi>a</mi></mrow><annotation encoding="application/x-tex">a</annotation></semantics></math></span><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height: 0.43056em; vertical-align: 0em;"></span><span class="mord mathit">a</span></span></span></span></span>的值,<span class="katex--inline"><span class="katex"><span class="katex-mathml"><math><semantics><mrow><mn>4</mn></mrow><annotation encoding="application/x-tex">4</annotation></semantics></math></span><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height: 0.64444em; vertical-align: 0em;"></span><span class="mord">4</span></span></span></span></span>万个电话号码,大概要执行<span class="katex--inline"><span class="katex"><span class="katex-mathml"><math><semantics><mrow><mn>10</mn></mrow><annotation encoding="application/x-tex">10</annotation></semantics></math></span><span class="katex-html" aria-hidden="true"><span class="base"><span class="strut" style="height: 0.64444em; vertical-align: 0em;"></span><span class="mord">1</span><span class="mord">0</span></span></span></span></span>分钟左右,效率很低。</p>
<p>想用pandas.Datafr ame中的groupby加上聚合函数进行计算(在R中也是最初用循环,后来用改group_by和summarize</p>
<pre class=" language-r"><code class="prism language-r">
t<span class="token percent-operator operator">%>%</span>group_by<span class="token punctuation">(</span>DEVICE_NUMBER<span class="token punctuation">)</span><span class="token percent-operator operator">%>%</span>summarize<span class="token punctuation">(</span>TOTAL_FEE_K<span class="token operator">=</span>lm<span class="token punctuation">(</span>formula<span class="token operator">=</span>TOTAL_FEE<span class="token operator">~</span>MONTH<span class="token punctuation">)</span><span class="token operator">$</span>coefficients<span class="token punctuation">[</span><span class="token number">2</span><span class="token punctuation">]</span><span class="token punctuation">)</span>
</code></pre>
<p>效率提升很多 ),只是不知道这个聚合函数该怎么写。</p>
<p>迫切希望大家指导。</p>
</div>
</div>


雷达卡



京公网安备 11010802022788号







