recently i started an analysis project, using #lichess data. these screenshots were just downloading this year so far and uncompressing a month 🥵
now i have to preprocess such a BIG data 😅
recently i started an analysis project, using #lichess data. these screenshots were just downloading this year so far and uncompressing a month 🥵
now i have to preprocess such a BIG data 😅
6h 😶 fortunately it was done in one night
such a simple "tweak" (much simpler than multiprocessing.Pool and anything else i've tried these days) make a decisive improvement in performance ✌️
from this (1% each ~3h):
found a solution. #grep to the rescue! 🦸
https://github.com/jartigag/chess-blunders/blob/master/data/raw/pre_preprocess.sh
@jartigag let me tell you something: one of the best ways to reduce the processing time is to remove the prints :
@ekaitz_zarraga yeah, i could remove them now that i have an acceptable preprocessing method 👌
but look at the times in that case: i need at least one print each 3 hours to know that it keeps working! 😂
tiflolinux.org - GNU Social is a social network, courtesy of tiflolinux.org. It runs on GNU social, version 2.0.1-beta0, available under the GNU Affero General Public License.
All tiflolinux.org - GNU Social content and data are available under the Creative Commons Attribution 3.0 license.