项目

个人项目

Personal Projects

动态爬虫与数据采集平台

Dynamic Web Crawler & Data Collection Platform

使用 Selenium + BeautifulSoup 搭建可复用动态页爬虫，支持无限滚动与复杂 DOM；设计标准化 ETL 清洗/解析并写入 MySQL/PostgreSQL；可配置 XPath/CSS/正则提取框架；错误处理与重试保证稳定性；Streamlit 配置任务、批量运行与导出。

Built a reusable crawler for dynamic pages using Selenium + BeautifulSoup, supporting infinite scroll and complex DOM structures. Designed a standardized ETL pipeline to clean/parse data and load into MySQL/PostgreSQL. Created a configurable extraction framework (XPath/CSS selector/regex). Implemented error handling and retry mechanisms. Developed a Streamlit UI for task configuration, batch runs, monitoring, and export.

游戏市场数据分析与预测系统

Computer Game Market Data Analysis & Forecasting System

基于 PostgreSQL 自动化采集与存储，含去重与批量写入；调度与中断恢复实现 24/7 更新；特征工程与多模型预测（Random Forest、Gradient Boosting、ARIMA、Prophet）；输出 7 日预测、趋势与风险区间，支持买卖决策。

Automated data ingestion and storage with PostgreSQL, including scraping, deduplication, and batch inserts. Built a resilient pipeline with scheduling and interruption control for 24/7 updates. Performed feature engineering + multi-model forecasting (Random Forest, Gradient Boosting, ARIMA, Prophet). Delivered 7-day forecasts, trend indicators, and risk intervals to support buy/sell/hold decisions.

电商用户与营销分析

E-commerce User & Marketing Analytics

搭建 KPI 体系（UV/PV、漏斗转化、留存、复购），端到端分析用户行为；构建 cohort 数据集与统一指标；RFM + K-means 用户分群，识别高价值与流失风险；基于分群表现给出营销与广告优化建议。

Built KPI system (UV/PV, funnel conversion, retention, repurchase) to analyze user behavior end-to-end. Created cohort datasets with consistent metric definitions. Segmented users via RFM + K-means, identifying high-value and churn-risk groups. Produced targeted marketing and ad optimization recommendations based on segment performance.

运营与选品分析项目

Operations & Product Selection Analytics Project

整合多业务系统数据，构建选品指标框架（销量、销售额、增长率、转化率、退款率等）；输出每日运营简报，支撑快速决策；聚焦商品与品类表现差异，识别异常并协助业务调整策略。

Integrated multi-system data to build a product selection metrics framework (sales volume, revenue, growth rate, conversion, refund rate, etc.). Delivered daily operations briefs for rapid decision-making. Focused on product/category performance gaps, identified anomalies, and supported business strategy adjustments.

校园及社会项目

Academic & Social Projects

NASA Space Apps Challenge — 小行星撞击模拟平台

NASA Space Apps Challenge — Asteroid Impact Simulation Platform

基于 NASA 开放数据的交互式小行星撞击模拟与分析平台；负责 Prediction 模块，实现 API 接入与可视化联动；从三维地球重构为二维地图展示，提升系统集成稳定性。

Web-based interactive asteroid impact simulation platform using NASA open data. Led the Prediction module: API integration and visualization linkage. Refactored from 3D globe to 2D map display for better system integration stability.

社交网络与起薪分析

Social Networks & Starting Salary Analysis

基于问卷数据构建分析数据集，完成清洗、重编码与缺失值处理；描述统计、分组比较与相关分析探索社交网络使用与结果的关系；产出图表与结论用于团队报告与课堂展示。

Built an analysis dataset from survey data; performed cleaning, recoding, and missing-value handling. Used descriptive stats, group comparisons, and correlation analysis to explore links between social network usage and outcomes. Delivered visuals and insights for team report and class presentation.