Public
Authored by SIMONIN Matthieu

netflix.db (sqlite + python)

Initial tests to compute the similarity. Findings:

  • importing the data in the db is fine (approx 400000 users and 1/4 films ~ 4000 films)
  • calculating the joint is reasonnablee (at least for 1/4 of the films)
  • BUT iterating on the results will take too much time (400000^2 rows=(4.10^5)^2=16.10^10= 160 billions)
Edited
sim.py 6.31 KB
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment