Loading
0 Follower
0 Following
demarsylvain

Location

CA

Badges

4
2
2

Activity

Nov
Dec
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Mon
Wed
Fri

Ratings Progression

Loading...

Challenge Categories

Loading...

Challenges Entered

What data should you label to get the most value for your money?

Latest submissions

No submissions made in this challenge.

Machine Learning for detection of early onset of Alzheimers

Latest submissions

See All
graded 145471
graded 145458
graded 145420

Play in a realistic insurance market, compete for profit!

Latest submissions

See All
graded 125508
graded 125497
graded 125493

5 Problems 21 Days. Can you solve it all?

Latest submissions

See All
graded 124827
graded 124826
graded 124825

5 Puzzles 21 Days. Can you solve it all?

Latest submissions

No submissions made in this challenge.

5 Puzzles, 3 Weeks | Can you solve them all?

Latest submissions

No submissions made in this challenge.

Predict the winner through prior chess moves

Latest submissions

See All
graded 124827
graded 124826
graded 124825
Participant Rating
Participant Rating
demarsylvain has not joined any teams yet...

ADDI Alzheimers Detection Challenge

Sharing Best submissions

Over 3 years ago

Hi all,

This competition was great :sparkles::+1:, and even it’s over, i’m still curious about what other participants have done :thinking::grey_question:. Is your best submission (kept in the private LB) the one you expected? Do you think you find a feature others didn’t think about? Did you test ensemble models or other approaches?

Mine (20th position) is a quite simple R xgboost with:

  • pretty high learning rate: 0.1
  • low depth: 3
  • iterations: 750
  • features: 110

and weightings giving the following distribution (using all observations of both training and test datasets):

  • normal: 68.2%
  • post: 24.8%
  • pre: 7%

About feature engineering :hammer_and_wrench::link:, I gather several variables with mean and sum, like most of you probably. Maybe one feature (with good ranking) we never talk about it, is the ratio between the minute hand and the average distance from centre of the digits:
ratio_hand_Xi = minute_hand_length / mean_dist_from_cen__

Kudos to the organizers, and congrats to all winners and contributors! :trophy:

Do you trust your Leaderboard Score?

Over 3 years ago

Hi @moto ,

Just curious, your best submission (the one kept in the private LB) is it Python or R? :wink:

Motivation from the submission history

Over 3 years ago

Hi @Johnowhitaker,

Thank you for the graph. Did you use 3-digits submissions logloss? My feeling is that the shape will be quite different with no rounded results.

And it made me think: what would happen if more than 10 participants reach the top? (as there are 10 prizes).

Fictive example:

  • 1 β€” 0.590
  • 2 β€” 0.594
  • 3 β€” 0.598
  • 4 to 8 β€” 0.600
  • 9 to 12 β€” 0.601

Random variable(s)

Over 3 years ago

Hi,

I edited my notebook by adding new sections, and one of them is β€œRandom” :game_die:.

When I build a model, I always add a random variable (from uniform distribution) to detect useless variables. I consider that all variables ranked under the random one are explaining noise. So I remove them :scissors:, and run a simpler and faster model :dash:. As these variables were rarely used by the model, predictions and performances are very similar (but always a little lower :angry:). But the model is easier to implement.

Here, the rank is 39 !

image

It mean I should remove a lot of variables :astonished:. I tested it: 2 successive models, one with all variables (LB logloss: 0.6136) and one with only 38 variables (LB logloss: 0.6147). Very similar …

My feeling, and it was already discuss in a different post, is that random play a strong part in our submissions. Results and ranks will (randomly?) change on the 40% hidden dataset.

What do you think ? image

Not able to edit an existing notebook

Over 3 years ago

I just tested with β€œupload” and it worked!
Thank you.

I could now edit and add some new stuffs in the notebook.

Not able to edit an existing notebook

Over 3 years ago

I tested both.
Initially, using colab link didn’t work, that’s why I tested to download the file and upload it (and it worked). But now that I want to edit it, even the upload way doesn’t work. I will try today again.

Do you trust your Leaderboard Score?

Over 3 years ago

I hope you’ll invite me for a game if you win the playstation :wink:

Do you trust your Leaderboard Score?

Over 3 years ago

Thank’s. Don’t hesitate to like it :upside_down_face:, this will increase my chance for the Community Price, even if I participated lately.

For the R notebook, you have to install R in order to see β€œR kernel” at the top right of your notebook.

Not able to edit an existing notebook

Over 3 years ago

Hi,

I shared a notebook yesterday, by uploading a .ipynb file downloaded from Google Colab. It edited this Colab, download it again, and try to edit the notebook, but I got the following error:

β€œUnable to process the notebook. Check the file and try again”.

I tried with the file that worked yesterday, and I got the same error. I tried several time, on Edge and Chrome.

Do you know what could happen? Thanks in advance.

Add a 4th class?

Over 3 years ago

You probably noticed in the dataset some β€œnormal” observations with very strange clocks :alarm_clock: (less than 4 digits, no hands, etc.). In the training dataset, it’s almost 50% of β€œnormal” observations.

One assumption is that these patients are affected by a disease :microbe: which is not Alzheimer (Parkinson, Lewy-body, …), so considered as β€œnormal” in our classification puzzle.

Sharing this particularity to the model :handshake: could be helpful … I tried to split the main class in 2:

  • β€œtrue normal”
  • β€œanormal”

What do you think of that? Thinking Face on JoyPixels

Do you trust your Leaderboard Score?

Over 3 years ago

Interesting discussion!

I was working on a notebook that shares some throughs similar of yours @etnemelc . I am convinced that most of the time I managed to improve my leaderboard score (from 0.610 to 0.606), I was overfitting this specific dataset. The difference are so little, it could clearly be noise, especially on β€œsmall” datasets like those.

I agree with you @michael_bordeleau, I wonder if observations came from the same source. I find the split in number of observations (33,000 for the train, 362 for test and 1,500 for leaderboard) really intriguing … is it to make the competition harder, or is there a reason behind? I’m curious to see.

πŸ‘‹ Welcome to the ADDI Alzheimer's Detection Challenge!

Over 3 years ago

change your username for β€œbordeleau_michael” … :wink: :stuck_out_tongue:

FastAI Starter Notebook and Class Balance

Over 3 years ago

Excellent notebook, thank you for sharing!

One point that makes me think, is the size of test dataset (only 362 obs.), meaning significant variability of the criteria. Some logloss of your notebook are so closed, they could be considered as similar, with no guarantee order will stay the same on leaderboard. I experimented it, with a model that decrease test dataset logloss (while never seen it), but increase leaderboard one … :slight_smile:

My feeling is that current top 10 are the ones, as @no_name_no_data said, that slight overfit leaderboard dataset.

I am curious to see our this will evolve, I hope some participants will found tricks that generally reduce logloss.

Did anyone get R to work?

Over 3 years ago

(post withdrawn by author, will be automatically deleted in 24 hours unless flagged)

Not able to get starter notebook

Over 3 years ago

Hi,
I followed the instructions of several discussions, and the first lines of code seem to work (no warning nor error). I managed to download aicrowd.R file, to download and load R libraries, load the datasets and made some explorations.

But I got an error when I tried to linked with my Aicrowd account.

aicrowd_login(AICROWD_API_KEY)

running command … had status 1

I copy-paste directly from my aicrowd page, so I don’t think I made an error. Do you know what could be the problem?

Load model

Over 3 years ago

Hi,

Just to be sure I correctly understand: we don’t have access to the test dataset (the one used for Leaderboard, that contains 1,473 rows), even its features, right? Our model will be loaded and applied on it?

Thank you.

Did anyone get R to work?

Over 3 years ago

Hi,
I followed the instructions shared by @michael_bordeleau, and the first lines of code seem to work (no warning nor error). I managed to download aicrowd.R file.
I can’t reach CRAN url to download a library, but I can load already installed ones, load the datasets and made some explorations.

But I got an error when I tried to linked with my Aicrowd account.

aicrowd_login(AICROWD_API_KEY)

running command … had status 1

I copy-paste directly from my aicrowd page, so I don’t think I made an error. Do you know what could be the problem?

Did anyone get R to work?

Over 3 years ago

(post withdrawn by author, will be automatically deleted in 24 hours unless flagged)

Common issues for Windows users

Over 3 years ago

Hi,

I managed to open the R starter notebook (in Jupyter, on Windows VM), but I got a warning message at the first line of code, which is:

cat(system('curl -sL …

I didn’t edit the code, just click on β€œRun”.

The warning is:

aicrowd.R had status 6

is it normal? it generates an error at the next chunk, when trying to source this aicrowd.R file.

demarsylvain has not provided any information yet.

Notebooks

Create Notebook