如题,我现在merge了两张表,一张是click表的数据,只记录user的clickdate和sku_id,另外一张表是order表,记录了user下单的时间(orderdate)和商品id(sku_id)和该商品的原价,折扣,最后价格等,现在我想要根据order表中的折扣数据(即direct_discount_per_unit)去填充click表中的折扣缺失值,由于折扣可能在不同的时候变化比较大,所以我现在希望做的是:先比较orderdate和clickdate是否相等,如果相等,则用当天order表中的折扣数据填充对应click表中sku的折扣,如果order表里一个sku在一天内有多个折扣则用均值mean填充click表中的折扣。现在主要的问题是我不太懂如何去表达orderdate和clickdate是否处于同一天这个条件命令,希望能得到各位老师的回答,谢谢!
另外,现在已经尝试过bys sku_id: egen discount=mean(direct_discount_per_unit) if orderdate==clickdate,但是该命令做出来只能够把同一个用户对应的clickdate和orderdate相同的填充出来,但我想要的结果是,假设有一个用户在3.20下单商品A,就有该日的discount数据,假设为10,那么不管是哪个用户,只要是在3.20号点击了A商品但没下单的折扣数据,全部都填充为10
- * Example generated by -dataex-. To install: ssc install dataex
- clear
- input str10(sku_id user_id) double(clickdate orderdate) float direct_discount_per_unit
- "00104dbcd7" "2d32e918d6" 21251 . .
- "00104dbcd7" "c340c63382" 21269 21269 20
- "00104dbcd7" "630699ed68" 21258 . .
- "00104dbcd7" "0020bc07df" 21245 . .
- "00104dbcd7" "f67628079a" 21258 . .
- "00104dbcd7" "51d3aef1d6" 21272 21250 20
- "00104dbcd7" "2db2516659" 21273 . .
- "00104dbcd7" "d7001d5ab5" 21267 . .
- "00104dbcd7" "c7f569a403" 21269 . .
- "00104dbcd7" "d1261c4f1a" 21256 . .
- "00104dbcd7" "dddd850d2f" 21245 . .
- "00104dbcd7" "951d5909dc" 21259 . .
- "00104dbcd7" "ab8606dfe1" 21255 . .
- "00104dbcd7" "2005aad556" 21263 . .
- "00104dbcd7" "221de94d82" 21264 21263 20
- "00104dbcd7" "92930db053" 21269 21271 20
- "00104dbcd7" "95108b339e" 21267 . .
- "00104dbcd7" "51f6ad1e2a" 21249 . .
- "00104dbcd7" "96b657568f" 21261 . .
- "00104dbcd7" "e9d97606e3" 21257 . .
- "00104dbcd7" "7f643c4494" 21253 . .
- "00104dbcd7" "9c95553c58" 21255 . .
- "00104dbcd7" "45383bcaf4" 21251 . .
- "00104dbcd7" "0d360a51a7" 21249 21251 20
- "00104dbcd7" "3118cdd0a8" 21258 21257 20
- "00104dbcd7" "d00acca3f3" 21270 21270 20
- "00104dbcd7" "58877da43e" 21264 . .
- "00104dbcd7" "bb7c828e74" 21251 . .
- "00104dbcd7" "f8f2c7a704" 21258 . .
- "00104dbcd7" "3157c3fe59" 21274 . .
- "00104dbcd7" "c958feca57" 21274 . .
- "00104dbcd7" "92930db053" 21271 21271 20
- "00104dbcd7" "29b38385d2" 21247 . .
- "00104dbcd7" "8005881b3f" 21249 21249 20
- "00104dbcd7" "d33aae81ed" 21253 21250 20
- "00104dbcd7" "34c624705f" 21250 . .
- "00104dbcd7" "6b24b41f01" 21257 21257 20
- "00104dbcd7" "7aedc11fb7" 21244 . .
- "00104dbcd7" "08410557e2" 21246 . .
- "00104dbcd7" "78afb804d5" 21251 . .
- "00104dbcd7" "49c7063c7b" 21274 21274 20
- "00104dbcd7" "ab4a91dff4" 21257 . .
- "00104dbcd7" "a6906b5d2c" 21264 . .
- "00104dbcd7" "42f1fe6e8a" 21257 21256 20
- "00104dbcd7" "fa80d3736a" 21261 . .
- "00104dbcd7" "ab2bc31c41" 21261 . .
- "00104dbcd7" "eb361c7ab3" 21259 . .
- "00104dbcd7" "546e95d5ac" 21274 . .
- "00104dbcd7" "b7489ac487" 21263 . .
- "00104dbcd7" "6418488832" 21253 . .
- "00104dbcd7" "627f2d92ab" 21250 . .
- end
- format %td clickdate
- format %td orderdate


雷达卡






京公网安备 11010802022788号







