Data mining | Data mining | Bloomsburg University of Pennsylvania


You are just hired as a data analyst (Note1) in the newly formed analytics department of RiiiD, which is a leading AI startup company specializing in providing learning resources and adaptive practices to English learners in South Korea. On the first day of your job, you are invited to attend a meeting with the business operation team in which you are briefed on the company’s platform and then you are handed over a dataset (‘EdNet’)  that was collected from this platform over the last two years from over 700K users . This dataset logged detailed user activities while they were interacting with systems.  Intrigued by the sheer amount of data collected, your manager is interested in how the analytics department can help to support the company’s missions to reimagine the learner’s experience using AI/ML/data analytics techniques. You are asked to spend some time to look into the data and prepare a brief to your manager. Specifically, your manager is looking for answers to the following questions. 


  1. (25      points) Who are the users?  To answer this question,      you will need to compile a user profile table (Table 1) with information      about users including
    1. Overall       practice volume and performance (e.g. # of questions answered, % of       questions answered correctly)
    2. Learning       activity (e.g. # lectures watched, # explanation read)
    3. Add       three additional metrics you would like to compute to describe users

Create a few plots to illustrate the information in the tables. Feel free to choose the appropriate type of plots you think is appropriate. 


2. (25 points) What are the questions/items? To answer this question, you will need to compile a question profile table (Table 2)  with information including

a. Question ID

b. Question Type

c. Number of times being practiced

d. Number of times answered correctly

Create a few plots to illustrate the information in the tables. Feel free to choose the appropriate type of plots you think is appropriate. 


3. (10 points) Design a modified metric of “accuracy” to fairly describe users’ ability by taking into account the difficulty level as derived from Table 2. Describe the procedure to compute the metrics.  Be sure to be specific so that interns can use your pseudo code to implement the metrics without much trouble. 

BONUS (10 points), implement the proposed metrics and plot a histogram of the metrics across all users (or subset of users of your choices). 


4. (30 points) Pick a user with a reasonable amount of activity (you will define the “reasonableness” and specify the selection criteria) and create a dashboard that consists of a series of plots to tell a story of this user’s activity patterns. For inspiration, you may look at the user dashboard for fitness tracker such as Fitbit. 


5. (10 points) Propose two tasks for your interns to work on. The first task is of unsupervised/descriptive type and a second one is of supervised/predictive task. Please provide clear specification of the tasks so that your interns can start to work right away. You should try to propose tasks that are not attempted in the existing work with this dataset. 

BONUS (10 points): propose a reinforcement learning task 


  • A      google slide deck summarizing the above findings in the format of plots or      tables or other contents as requested by your manager. Please label      clearly on the slide which question you are answering. There are no      lower/upper limits of the number of slides. Always keep the message      concise and effective and keep your audience in mind. In this case, it is      your manager. Please change the slides permission to editable, we will      make comments on your slides. 
  • Two      summary tables, Table 1 and 2
  • Please      deposit the above items into your own google folder you are asked to      create, under this folder, create a folder named homework1. Please deposit      the files there. 
  • Please      don’t deposit your finalized product earlier than October 4th. After you      finish, please post a link to your homework google folder on blackboard      submission. 

Dataset and Background Readings:

Please download the dataset from the following link. For this homework, you may only use KT4 (uncompressed size 6.4GB). But you may need to download other small lookup tables for the purpose of this assignment. Note: it may take a few hours to download/unzip the data files, please make sure you plan ahead

Please refer to this paper for details of dataset (mainly Section 1 and 2, up to page 7)

Software and Tools

You are free to use any software tools you feel comfortable with, which include but are not limited to Python, R, WEKA or Tableau. 

Calculate the price of your order

Select your paper details and see how much our professional writing services will cost.

We`ll send you the first draft for approval by at
Price: $36
  • Freebies
  • Format
  • Formatting (MLA, APA, Chicago, custom, etc.)
  • Title page & bibliography
  • 24/7 customer support
  • Amendments to your paper when they are needed
  • Chat with your writer
  • 275 word/double-spaced page
  • 12 point Arial/Times New Roman
  • Double, single, and custom spacing
  • We care about originality

    Our custom human-written papers from top essay writers are always free from plagiarism.

  • We protect your privacy

    Your data and payment info stay secured every time you get our help from an essay writer.

  • You control your money

    Your money is safe with us. If your plans change, you can get it sent back to your card.

How it works

  1. 1
    You give us the details
    Complete a brief order form to tell us what kind of paper you need.
  2. 2
    We find you a top writer
    One of the best experts in your discipline starts working on your essay.
  3. 3
    You get the paper done
    Enjoy writing that meets your demands and high academic standards!

Samples from our advanced writers

Check out some essay pieces from our best essay writers before your place an order. They will help you better understand what our service can do for you.

Get your own paper from top experts

Order now

Perks of our essay writing service

We offer more than just hand-crafted papers customized for you. Here are more of our greatest perks.

  • Swift delivery
    Our writing service can deliver your short and urgent papers in just 4 hours!
  • Professional touch
    We find you a pro writer who knows all the ins and outs of your subject.
  • Easy order placing/tracking
    Create a new order and check on its progress at any time in your dashboard.
  • Help with any kind of paper
    Need a PhD thesis, research project, or a two-page essay? For you, we can do it all.
  • Experts in 80+ subjects
    Our pro writers can help you with anything, from nursing to business studies.
  • Calculations and code
    We also do math, write code, and solve problems in 30+ STEM disciplines.

Frequently asked questions

Get instant answers to the questions that students ask most often.

See full FAQ
    See full FAQ

    Take your studies to the next level with our experienced specialists

    Hello, we are online 24/7. Leave a message we will reply instantly