合肥生活安徽新聞合肥交通合肥房產生活服務合肥教育合肥招聘合肥旅游文化藝術合肥美食合肥地圖合肥社保合肥醫院企業服務合肥法律

        297.201代做、代寫python編程語言
        297.201代做、代寫python編程語言

        時間:2025-03-25  來源:合肥網hfw.cc  作者:hfw.cc 我要糾錯



        297.201-2025 Semester 1 Massey University
        1
        Project 1
        Deadline: Hand in by midnight March 25th 2025
        Evaluation: 33% of your final course grade.
        Late Submission: Refer to the course guide.
        Work This assignment is to be done individually.
        Purpose: Gain experience in performing data wrangling, data visualization and introductory data 
        analysis using Python with suitable libraries. Begin developing skills in formulating a 
        problem from data in a given domain, asking questions of the data, and extracting 
        insights from a real-world dataset. Learning outcomes 1, 3 and 4 from the course 
        outline.
        Please note that all data manipulation must be written in python code in the Jupyter Notebook environment. No marks 
        will be awarded for any data wrangling that is completed in excel. 
        Also, demonstrating your own skills, critical thinking and understanding is still the most important aspect of assessment, 
        so you must keep a record of all your AI prompts and outputs, and submit these as an appendix to your completed 
        assessment. Failure to keep an accurate record of your AI use for an assessment may be a breach of the Use of Artificial 
        Intelligence in Assessment Policy, and of the Academic Integrity Policy. Refer to Stream as to the level of Generative AI 
        use that is permitted and what this means. This particular assignment is designated as permitting ‘AI Planning’.
        Download this Word document to guide you in creating your AI use statement.
        In addition, do not copy the work of others – there are many ways to solve the problems below, and we expect that no 
        two answers will produce the same code. Copying the work of others (even if object/variable names are changed) will be 
        considered plagiarism.
        The dataset problem domain: Analysis of Professional Tennis Match Results (ATP – men, or WTA - women)
        You are asked to download a curated dataset on the topic of professional tennis and use your data wrangling and 
        visualisation skills covered I the first few weeks, to answer a series of analytics questions. You do not have to be an 
        expert in tennis to answer these questions and solve this assignment, but you will need to cover some basics. Once you 
        enter the workforce as a data scientist, you will need to quickly learn about domains previously unknown to you in order 
        to perform your job, so this is an exercise in practicing how to do this. Some helpful information on tennis and various 
        tournaments can be found here:
        https://www.olympics.com/en/news/tennis-rules-regulations-how-to-play-basics
        https://www.tennisleo.com/basic-tennis-rules/
        https://thetennisbros.com/tennis-tips/what-are-the-major-tournaments-in-tennis/
        Keep in mind that some questions can be interpreted in slightly different ways and so depending on your interpretation 
        and assumptions, you might come up with slightly different answers – this is perfectly acceptable. The purpose of this 
        assignment is not to answer all questions in the ‘right’ way, but to develop your technical and problem solving skills. 
        Therefore, you will not be marked down for having slightly different answers as long as you have stated your 
        assumptions clearly and have gone about in a technically sound and reasonable manner in answering the questions.
        The datasets we’re after can be found below.
        Dataset source: http://tennis-data.co.uk/alldata.php
        You will need to download this dataset from home since Massey’s filter restricts access to this website due to its 
        categorization as a gambling site. 
        297.201-2025 Semester 1 Massey University
        2
        If for some reason, you’re unable to download it from home, please let us know. As a an alternatively, you can download 
        the similar data from a GitHub source here, but there are some columns not present in this data, so we would prefer 
        that you use the tennis-data.co.uk source instead:
        For men: https://github.com/JeffSackmann/tennis_atp (use only: atp_matches_<xxxx>.csv)
        For women: https://github.com/JeffSackmann/tennis_wta (use only: wta_matches_<xxxx>.csv)
        Task 1: Wrangling, reshaping, EDA (20 marks)
        - Collect data covering 10 years (2015 - 2024) from the above website. Read each excel dataset using Python and 
        combine into a single dataset.
        - Check that all the data has been read. Check that all the data in the combined dataset is in order based on the 
        date column. 
        - What other data-checking operations could you perform to make sure that the data is ready for analysis? Use 
        various approaches to perform sanity checking on the data, including some plotting and discuss.
        - Create EDA 6-8 visualisations of the dataset and explain each one. Be curious and creative. Ensure that the plots 
        are clean and interpreted.
        Task 2: Analysis questions and plotting (20 marks)
        - Who are the top 10 players by total wins in the dataset, and how many wins do they have? Plot and discuss this.
        - Who are the top 10 players according to the largest number of First Round tournament losses across all 10 
        years? Plot and discuss this.
        297.201-2025 Semester 1 Massey University
        3
        - Identify the 5 biggest upsets for each year in the dataset based on ranking differentials. List player names, 
        rankings, winner/loser, score, and tournament name and what the difference in the rankings was at the time – a 
        table is fine. 
        - Who were the top 10 players at year-end in 2019? How have their rankings changed over the period of 2015 to 
        2024? Plot and discuss this.
        Task 3: Advanced analysis questions (20 marks)
        - Which tournaments have had on average the most upsets (where a lower-ranked player defeated a higher ranked player)? List the top 10 and plot their averages.
        - Determine who the top 10 ranked players (by ranking) were at the end of 2024. Then calculate their head-to head win-loss record against each other for all the matches they played in 2024. Present this result and discuss.
        - List the top 5 players who had the longest winning streaks between 2015 – 2024. List their names, the lengths 
        of their winning streaks and the year(s) in which they occurred.
        - In tennis, each set is played first to 6, but sometimes it is played to 7. A tiebreak is a set that someone wins 7-6 
        and is different to someone winning a set 7-5. Tiebreaks are stressful and some players perform better than 
        others in tiebreaks. Count how many tiebreaks each player in the entire dataset has played. Then, calculate the 
        percentage of tiebreaks that each player has won. List the top 10 players according to the percentage of 
        tiebreaks won.
        Task 4: Open questions and analyses (30 marks)
        - Come up with 3 more questions of your own. 
        - Try to demonstrate the usage of more advanced data wrangling functionalities as you answer the questions like 
        group by, pivots etc…
        - Create several plots and discuss them.
        A Jupyter notebook template will be provided for you. Please use it for this assignment.
        Hand-in: Submit a single zipped file via the Stream assignment submission link. It should contain one notebook with all 
        the answers embedded, and an HTML version of your notebook also with its output showing as well in case we have 
        issues running your code. Also, you must submit the AI use statement.
        Use of Generative AI in This Assignment
        In industry, AI and online resources are commonly used to improve efficiency and productivity. However, at university, 
        the primary goal is to develop your understanding and ability to work through problems independently. We need you to 
        master these skills first, so that you will be able to use the AI tools more effectively and efficiently later on. This means 
        that while AI can be a helpful tool for learning, it should not replace your own thought process or problem-solving efforts
        as it will actually short-circuit your learning and development
        .
        Allowed Uses of AI for assignment 1
        You may use AI along the lines of the following prompts to:
        • Understand background knowledge related to professional tennis, tournament structures, and general 
        concepts about tennis matches. 
        o Example: “Explain the rules of a tennis match and how scoring works.”
        • Seek feedback on your problem-solving approach without directly generating code. 
        o Example: "I plan to find the top 10 players by total wins using pandas. Does this approach make sense?"
        • Clarify error messages or debugging hints, as long as you are the one writing the code. 
        o Example: “Why am I getting a KeyError in pandas when trying to merge two dataframes?”
        • Find alternative ways to visualize data for inspiration, but not for direct copying. 
        o Example: “What are common ways to visualize win-loss records in sports data?”
        297.201-2025 Semester 1 Massey University
        4
        Prohibited Uses of AI for assignment 1
        You must NOT:
        • Copy AI-generated code directly into your submission.
        • Input the assignment questions directly into AI and use its responses as your own.
        • Paraphrase AI-generated explanations/code and present them as original work.
        • Ask AI to write step-by-step solutions to any of the assignment tasks.
        • Academic Integrity & AI Use Statement

        請加QQ:99515681  郵箱:99515681@qq.com   WX:codinghelp

        掃一掃在手機打開當前頁
      1. 上一篇:代做Tiny Calculator parsing with YACC
      2. 下一篇:代寫COMP9021、代做Python程序設計
      3. 無相關信息
        合肥生活資訊

        合肥圖文信息
        急尋熱仿真分析?代做熱仿真服務+熱設計優化
        急尋熱仿真分析?代做熱仿真服務+熱設計優化
        出評 開團工具
        出評 開團工具
        挖掘機濾芯提升發動機性能
        挖掘機濾芯提升發動機性能
        海信羅馬假日洗衣機亮相AWE  復古美學與現代科技完美結合
        海信羅馬假日洗衣機亮相AWE 復古美學與現代
        合肥機場巴士4號線
        合肥機場巴士4號線
        合肥機場巴士3號線
        合肥機場巴士3號線
        合肥機場巴士2號線
        合肥機場巴士2號線
        合肥機場巴士1號線
        合肥機場巴士1號線
      4. 短信驗證碼 酒店vi設計 NBA直播 幣安下載

        關于我們 | 打賞支持 | 廣告服務 | 聯系我們 | 網站地圖 | 免責聲明 | 幫助中心 | 友情鏈接 |

        Copyright © 2025 hfw.cc Inc. All Rights Reserved. 合肥網 版權所有
        ICP備06013414號-3 公安備 42010502001045

        亚洲精品蜜桃久久久久久| 国产精品视频一区二区三区不卡 | 日韩精品电影一区| 久久久久久亚洲精品影院| 亚洲AV无码精品蜜桃| 久久99国内精品自在现线| 一本久久a久久精品综合夜夜| 亚洲国产精品一区二区第一页| 精品国产一区二区三区久久| 国产亚洲精品资源在线26u| 国产在线精品无码二区| 久久国产精品成人免费 | 亚洲AV无码精品国产成人| 成人精品一区二区户外勾搭野战| 国产成人无码精品久久久露脸 | 国产精品免费看久久久无码| 亚洲AV永久无码精品成人| 91麻豆精品国产| aaaaaa精品视频在线观看| 国产高清国内精品福利99久久| 日韩加勒比一本无码精品| 成人日韩熟女高清视频一区| 永久免费无码日韩视频| 在线日韩日本国产亚洲| 久久精品一区二区三区日韩| 日韩小视频在线观看| 日韩一区二区在线播放| 亚洲欧美日韩一区二区三区 | 国内精品伊人久久久久网站| 成人区人妻精品一区二区不卡视频 | 色噜噜精品视频在线观看| 2020亚洲男人天堂精品| 久久精品免费全国观看国产| 亚洲国产精品无码久久| 日韩精品久久一区二区三区| 99国产精品免费观看视频| 中文字幕国产精品| 99热这里只有精品9| 精品人妻少妇一区二区三区不卡| 精品久久久久久久免费加勒比| 国产精品久久久久久久伊一|