论文:https://arxiv.org/pdf/2407.17490 https://github.com/YuxiangChai/AMEX-codebase/tree/main/data_utils https://huggingface.co/datasets/Yuxiang007/AMEX
AMEX数据集包括三个层次的注释:
下载数据后,合并并解压数据:
zip --fix screenshot.zip --out screenshot_merged.zip unzip screenshot_merged.zip # 解压 # 或者 sudo apt install -y p7zip-full 7z x -mmt=on screenshot.zip # 启用多线程解压
AMEX数据集包括三个层次的注释:
下载数据后,合并并解压数据: zip --fix screenshot.zip --out screenshot_merged.zip unzip screenshot_merged.zip 数据统计
Total JSON files processed: 3046 Total dictionaries (steps) found: 38709
PRESS_BACK: 135 PRESS_ENTER: 651 PRESS_HOME: 13 SWIPE: 7628 TAP: 24815 TASK_COMPLETE: 2828 TASK_IMPOSSIBLE: 220 TYPE: 2419
样本示例
总共有 8 种不同的动作类型:
暂时无法在飞书文档外展示此内容 让我为您解释这个样本的含义: 这是一个记录用户在宜家(IKEA)安卓应用上购买游戏椅的交互过程的数据样本。让我们从几个关键部分来分析:
Sample 1: { "step_id": 1, "action": "SWIPE", "touch_coord": [ 637, 2567 ], "lift_coord": [ 564, 1390 ], "device_dim": [ 1440, 3040 ], "package_name": "com.android.launcher3", "type_text": "", "need_human_check": false, "interest_region": [ [ 0, 0 ], [ 0, 0 ] ], "image_path": "2024_4_28_15_40_b31f339a2d6441bb994b4a30f184a267-1.png" }
Sample 1: { "step_id": 4, "action": "TYPE", "touch_coord": [ 0, 0 ], "lift_coord": [ 0, 0 ], "device_dim": [ 1440, 2960 ], "package_name": "com.amazon.mShop.android.shopping", "type_text": "Nike Basketball Shoes", "need_human_check": false, "interest_region": [ [ 0, 0 ], [ 0, 0 ] ], "image_path": "2024_5_14_15_40_ca5bacfe2a574da4ac05174b973cf321-4.png" }
Sample 1: { "step_id": 4, "action": "PRESS_ENTER", "touch_coord": [ 0, 0 ], "lift_coord": [ 0, 0 ], "device_dim": [ 1440, 3120 ], "package_name": "com.microsoft.office.outlook", "type_text": "", "need_human_check": false, "interest_region": [ [ 0, 0 ], [ 0, 0 ] ], "image_path": "2024_5_8_15_56_d950b959ebad431a84368d7b63da755d-4.png" }
Sample 1: { "step_id": 5, "action": "TASK_IMPOSSIBLE", "touch_coord": [ 0, 0 ], "lift_coord": [ 0, 0 ], "device_dim": [ 1440, 3120 ], "package_name": "com.espn.score_center", "type_text": "", "need_human_check": false, "interest_region": [ [ 0, 0 ], [ 0, 0 ] ], "image_path": "2024_4_26_0_42_87159d38462b4ffdb7fee96370eea8cf-5.png" }
Sample 2: { "step_id": 6, "action": "TASK_IMPOSSIBLE", "touch_coord": [ 0, 0 ], "lift_coord": [ 0, 0 ], "device_dim": [ 1440, 3040 ], "package_name": "musclebooster.workout.home.gym.abs.loseweight", "type_text": "", "need_human_check": false, "interest_region": [ [ 0, 0 ], [ 0, 0 ] ], "image_path": "2024_4_22_16_39_72e6b2de98254c44b9295e165567ac83-6.png" }
Sample 3: { "step_id": 10, "action": "TASK_IMPOSSIBLE", "touch_coord": [ 0, 0 ], "lift_coord": [ 0, 0 ], "device_dim": [ 1440, 3120 ], "package_name": "com.podcast.podcasts", "type_text": "", "need_human_check": false, "interest_region": [ [ 0, 0 ], [ 0, 0 ] ], "image_path": "2024_4_26_15_24_95f480ce21604f1792190e5bfef54549-10.png" }
Sample 4: { "step_id": 15, "action": "TASK_IMPOSSIBLE", "touch_coord": [ 0, 0 ], "lift_coord": [ 0, 0 ], "device_dim": [ 1440, 3040 ], "package_name": "com.agoda.mobile.consumer", "type_text": "", "need_human_check": false, "interest_region": [ [ 0, 0 ], [ 0, 0 ] ], "image_path": "2024_4_28_15_41_f7039f20e9944e92bdada777525a2268-15.png" }
Sample 5: { "step_id": 12, "action": "TASK_IMPOSSIBLE", "touch_coord": [ 0, 0 ], "lift_coord": [ 0, 0 ], "device_dim": [ 1440, 3040 ], "package_name": "musclebooster.workout.home.gym.abs.loseweight", "type_text": "", "need_human_check": false, "interest_region": [ [ 0, 0 ], [ 0, 0 ] ], "image_path": "2024_4_22_16_27_3d79d989a7fd4f22a7cf0ce44ec8d47e-12.png" }
Sample 1: { "step_id": 13, "action": "PRESS_HOME", "touch_coord": [ 0, 0 ], "lift_coord": [ 0, 0 ], "device_dim": [ 1440, 2960 ], "package_name": "com.seatgeek.android", "type_text": "", "need_human_check": false, "interest_region": [ [ 0, 0 ], [ 0, 0 ] ], "image_path": "2024_3_20_14_36_21b637d11bea46b8adb3c2efc9f03501-13.png" }
Sample 1: { "step_id": 18, "action": "TASK_COMPLETE", "touch_coord": [ 0, 0 ], "lift_coord": [ 0, 0 ], "device_dim": [ 1440, 3120 ], "package_name": "com.microsoft.teams", "type_text": "", "need_human_check": false, "interest_region": [ [ 0, 0 ], [ 0, 0 ] ], "image_path": "2024_4_24_18_25_f100d55037f04908900699457c6f5676-18.png" }
Sample 1: { "step_id": 5, "action": "TASK_IMPOSSIBLE", "touch_coord": [ 0, 0 ], "lift_coord": [ 0, 0 ], "device_dim": [ 1440, 3120 ], "package_name": "com.espn.score_center", "type_text": "", "need_human_check": false, "interest_region": [ [ 0, 0 ], [ 0, 0 ] ], "image_path": "2024_4_26_0_42_87159d38462b4ffdb7fee96370eea8cf-5.png" }
本文作者:Dong
本文链接:
版权声明:本博客所有文章除特别声明外,均采用 CC BY-NC。本作品采用《知识共享署名-非商业性使用 4.0 国际许可协议》进行许可。您可以在非商业用途下自由转载和修改,但必须注明出处并提供原作者链接。 许可协议。转载请注明出处!