ECS 271,
Machine Learning: Homework #1
Due: 8 April 2004
Hand-in your assignments in the class.
Instructor: Prof. Rao Vemuri, rvemuri@ucdavis.edu
1. (10 points) Please look at the list of potential projects given elsewhere on this web site. Either pick one of those or come up with a suggestion of your own. Prepare your proposal along the lines described on the "Projects" page of this site. Your project has to do with one of the ML topics discussed in the text. You can pick any application domain.
When I grade your term project, I will consider both the quality of your work and the inherent difficulty (or potential) of the problem. The challenge is to pick a small enough topic that you can do within 9 weeks and a topic about which you lnow something.
2. (20 points) Read Chapter 1 and do Problem 1.3, page 18 of the text book
3. (20 points) You are given the following "user profile" problem for web browsing.
Domain |
Platform |
Browser |
Day |
Screen |
Continent |
Click? |
edu |
Mac |
Mozilla |
Mon |
XVGA |
America |
yes |
com |
Mac |
NetCom |
Tue |
XVGA |
America |
yes |
com |
PC |
IE |
Sat |
VGA |
Eur. |
no |
org |
Unix |
Net5 |
Wed |
XVGA |
America |
yes |
The attributes of the instances x (x
is an element of the instance space X) have the following possibiliities
- Domain: edy, com, org
-Platform: Mac, PC, Unix
-Browser: Mozilla, Netscape 5, Netscape Communicator, Microsoft IE
-Day: Monday - Sunday
-Screen: VGA, XVGA
-Continent: America, Europe, Africa, Asia, Australia
SKIP PROBLEM 3 BELOW FOR NOW. We will do it later
3. (20
points) Suppose you are a doctor treating a patient who occasionally suffers from
an allergic reaction. Your intuition tells you that the allergy is a direct
consequence of certain foods the patient eat, place where it is eaten, time of
day, day of week and the amount spent on food. So we gathered some data and the
data is summarized in the table below.
Number |
Restaurant |
Meal |
Day |
Cost |
Reaction |
1 |
Sam's |
breakfast |
Friday |
cheap |
yes |
2 |
Hilton |
lunch |
Friday |
expensive |
no |
3 |
Sam's |
lunch |
Saturday |
cheap |
yes |
4 |
Denny's |
breakfast |
Sunday |
cheap |
no |
5 |
Sam's |
breakfast |
Sunday |
expensive |
no |
Use the notation [?, ?, ?, ?] for the most general model, where the ? indicates a don't care condition (i.e., any attribute value in the corresponding position is allowed.)
What is the hypothesis that fits this data using (a) Find-S algorithm described in Section 2.4 and the Version Space method described in the text. Do not simply give me an answer; show me a trace of the steps that led you to the result. Add any commentary to clarify the steps.