profile
viewpoint
James Lamb jameslamb @saturncloud Chicago, IL Engineer @ Saturn Cloud. Economist at heart.

jameslamb/doppel-cli 7

Test framework for comparing the consistency of library APIs

jameslamb/bigforecast 4

Forecasting of globally important economic time series using lots of data and cool statistical stuff. Final project for "W251: Scaling Up!" in UC-Berkeley MIDS program

jameslamb/ci-tools 1

Miscellaneous scripts for use in continuous integration (CI) pipelines

jameslamb/add_static_search_to_pkgdown 0

WIP - experimental project aiming to add static search to a pkgdown site for an R package

jameslamb/avr6 0

R package to auto-gen R6 classes from AVRO schema files

jameslamb/bedazzle 0

Experimenting with Bazel to build a C library shared between R and Python packages

jameslamb/blinkdb 0

BlinkDB: Sub-Second Approximate Queries on Very Large Data.

PR opened microsoft/LightGBM

v3.2.0 release

This is a stable release. The next release may is 4.0.0, for many breaking changes. And it may take a long time to finish. So it better to fix the critical bugs in 3.2.0.

Please list the PR need to be merged in this release.

+1 -1

0 comment

1 changed file

pr created time in an hour

create barnchmicrosoft/LightGBM

branch : guolinke-patch-1

created branch time in an hour

pull request commentmicrosoft/LightGBM

CTR and Count values for categorical features

Hi @StrikerRUS , yeah, we may have 4.0.0 release, there are sereval on-going works need to in 4.0.0 as well. I think we can release a 3.2.0 stable release recently, then merge this PR.

shiyu1994

comment created time in an hour

pull request commentmicrosoft/LightGBM

CTR and Count values for categorical features

Please let me take a look at Python part.

I see this PR breaks bakward compatibility with a lot of C API functions for Dataset creation. Does it mean next release will be 4.0.0?

shiyu1994

comment created time in an hour

pull request commentmicrosoft/LightGBM

CTR and Count values for categorical features

@jameslamb can you check the R's part?

shiyu1994

comment created time in 2 hours

pull request commentmicrosoft/LightGBM

CTR and Count values for categorical features

Thank you @shiyu1994 , finally, we can merge this PR. The next step is to remove the old categorical features supports and update the documentations accordingly.

shiyu1994

comment created time in 2 hours

startedsuperhighfives/pika

started time in 2 hours

push eventsaturncloud/examples

hugo

commit sha 0e12f182d003a50f9114629f694335d7b8df72f5

recording user progress

view details

push time in 3 hours

startedgadenbuie/rstudio-global-2021-calendar

started time in 4 hours

fork yihui/cpp11

cpp11 is a header-only R package that helps R package developers handle R objects with C++ code.

https://cpp11.r-lib.org/

fork in 4 hours

pull request commentmicrosoft/LightGBM

[dask] Add type hints in Dask package

I'm going to try disabling this by setting autodoc_typehints = "none" in conf.py

Don't forget to bump minimum required Sphinx version then

https://www.sphinx-doc.org/en/master/usage/extensions/autodoc.html#confval-autodoc_typehints

https://github.com/microsoft/LightGBM/blob/b4b1b75ac582e99202062ec784269bd5b77f2018/docs/conf.py#L77

jameslamb

comment created time in 4 hours

startedbenbjohnson/litestream

started time in 4 hours

startedIntel-bigdata/oap-raydp

started time in 4 hours

fork jeroen/gdal

GDAL is an open source X/MIT licensed translator library for raster and vector geospatial data formats.

https://gdal.org

fork in 4 hours

push eventsaturncloud/examples

hugo

commit sha 79f017731a2869b855663b1d5f55a8492fd75182

removing quotes from links

view details

push time in 4 hours

push eventsaturncloud/images

saturn-automation

commit sha ab148bad4d0080a3627046bc90d3342e17aacd75

[release-images] Update .env_deps

view details

saturn-automation

commit sha 9c524a8203cc1c205b4f6bea265f50dea8ad220b

Merge branch 'release-2021.01.13' into env_deps/release-2021.01.13

view details

push time in 4 hours

issue commentmicrosoft/LightGBM

Error saving very large LightGBM models

Updated to python 3.8.7 and ran one job with keep_training_booster=True and one with the default. First one fails after trying to load the model after dumping it to disk with joblib. Second one fails after finishing boosting but before the code I have to save the model.

Forgot to add back all the print statements I had put into the lightgbm python so don't have any more details but I'm fairly confident this is the same issue as with 3.6.6. The pickle file is also 2,097,153KB (so very close to 2GB as before).

Also of note is this job took 5x longer than my 3.6.6 job so not sure if something changed. Looked through the params and they all look the same

CHDev93

comment created time in 4 hours

pull request commentmicrosoft/LightGBM

[dask] Add type hints in Dask package

@jameslamb Could you please replace all occurrences of dask array with dask Array fort the consistency with dask DataFrame, or dask Series?

jameslamb

comment created time in 5 hours

issue commentmicrosoft/LightGBM

[RFC] [dask] [docs] Override details in API docs that differ between dask and sklearn interfaces

I "like" how official Dask documentation deals with the problem! 😃

image

I think Option #1 is preferable for me. But how will it help to deal with different args? For example, client argument in Dask API.

jameslamb

comment created time in 5 hours

PR closed saturncloud/examples

Reviewers
Feat/quickstarts
+442 -1168

4 comments

5 changed files

hhuuggoo

pr closed time in 5 hours

pull request commentsaturncloud/examples

Feat/quickstarts

I'm not sure what the status of this branch is. If it's something we eventually hope to merge to main, I think the LightGBM example can be removed. The LightGBM quickstart is now readying from https://github.com/saturncloud/workshop-lightgbm-dask.

I'll remove the PR. this branch is where I'm messing around with stuff still. (hence the draft status)

hhuuggoo

comment created time in 5 hours

push eventsaturncloud/examples

hugo

commit sha 2ce6317a9baf253eece2ea1c94e0eb2e6bbb57fd

clearing outputs

view details

push time in 5 hours

push eventsaturncloud/examples

hugo

commit sha 59e1cae0e31cecc5e518c186f126e7de149a797f

hugo + julia experimenting on making shorters

view details

push time in 5 hours

issue commentdask/community

Dask Distributed Summit - speaker suggestions

I'd happily make intros to any of the people we've had on our live streams at Coiled. You can find them all here: https://coiled.io/videos/

Happy to chat about this also, @jsignell!

jsignell

comment created time in 5 hours

startedytoren/presentation-bsts

started time in 8 hours

issue commentmicrosoft/LightGBM

Issue with call to c_api Predict functions in multi threaded way from golang

@AlbertoEAF Sure, you can close this issue for now

tarunreddy1018

comment created time in 9 hours

pull request commentsaturncloud/examples

remove language about GPUs in CPU-only notebooks

Check out this pull request on  <a href="https://app.reviewnb.com/saturncloud/examples/pull/79"><img align="absmiddle" alt="ReviewNB" height="28" class="BotMessageButtonImage" src="https://raw.githubusercontent.com/ReviewNB/support/master/images/button_reviewnb.png"/></a>

See visual diffs & provide feedback on Jupyter Notebooks.


<i>Powered by <a href='https://www.reviewnb.com/?utm_source=gh'>ReviewNB</a></i>

jameslamb

comment created time in 10 hours

pull request commentsaturncloud/workshop-lightgbm-dask

remove language about GPUs

Check out this pull request on  <a href="https://app.reviewnb.com/saturncloud/workshop-lightgbm-dask/pull/2"><img align="absmiddle" alt="ReviewNB" height="28" class="BotMessageButtonImage" src="https://raw.githubusercontent.com/ReviewNB/support/master/images/button_reviewnb.png"/></a>

See visual diffs & provide feedback on Jupyter Notebooks.


<i>Powered by <a href='https://www.reviewnb.com/?utm_source=gh'>ReviewNB</a></i>

jameslamb

comment created time in 10 hours

issue commentmicrosoft/LightGBM

Issue with call to c_api Predict functions in multi threaded way from golang

@tarunreddy1018 can we close this issue?

If you want to see improvements regarding threading maybe open a new issue like a feature request regarding threading support with no locking and mention this issue and https://github.com/microsoft/LightGBM/issues/3675 so we know all the details we discussed.

tarunreddy1018

comment created time in 10 hours

issue commentmicrosoft/LightGBM

Getting random outputs when calling LightGBM Predict for single row function in a multi threaded environment (LGBM_BoosterPredictForMatSingleRowFast) method

@tarunreddy1018 can we close this issue?

If you want to see improvements regarding threading maybe open a new issue like a feature request and mention this issue and https://github.com/microsoft/LightGBM/issues/3751 so we know all the details we discussed.

tarunreddy1018

comment created time in 10 hours

more