1 exabyte (EB) = 1,000,000 terabyte (TB).↩︎
no C++ or FORTRAN API calls.↩︎
and sometimes milliseconds.↩︎
numba
, or even Rcpp
or cython
?↩︎
have a look at some deep learning libraries in GitHub and you’ll be surprised that Python is only 25%-33% of the codebase.↩︎
this is mostly a Python ecosystem problem, and while R doesn’t suffer heavily from this, it’s not blue skies either.↩︎
or with little effort necessary.↩︎
sometimes even faster than C.↩︎
a petaflop is one thousand trillion, or one quadrillion, operations per second.↩︎
LLVM stands for Low Level Virtual Machine, you can find more at the LLVM website (http://llvm.org).↩︎
if you like to learn more about how Julia is designed you should definitely check Bezanson et al. (2017).↩︎
please note that the Julia results depicted above do not include compile time.↩︎
or, that the memory address pointers to the elements in the column are stored next to each other.↩︎
it is easier because first
and last
also work on many other collections, so you need to remember less.↩︎
According to Bogumił Kamiński (lead developer and maintainer of DataFrames.jl
) on Discourse (https://discourse.julialang.org/t/pull-dataframes-columns-to-the-front/60327/5).↩︎
thanks to Sudete on Discourse (https://discourse.julialang.org/t/pull-dataframes-columns-to-the-front/60327/4) for this suggestion.↩︎
also notice that regular data (up to 10 000 rows) is not big data (more than 100 000 rows). So, if you are dealing primarily with big data please exercise caution in capping your categorical values.↩︎
we are using the LinearAlgebra
module from Julia’s standard library.↩︎