profile
viewpoint
Arjen P. de Vries arjenpdevries Radboud University Nijmegen, The Netherlands http://www.cs.ru.nl/~arjen/ Computer scientist and entrepeneur specialized in information access & integration of IR and DB. Organisations: @rubigdata @informagi @WGI-NL

arjenpdevries/ru-bsc-thesis-template 2

A latex template for a RU BSc thesis

arjenpdevries/clueweb 1

Hadoop tools for manipulating ClueWeb collections

arjenpdevries/preface-book-alan-said 1

Contribution to book project

arjenpdevries/blockly-games 0

Games for tomorrow's programmers.

arjenpdevries/boot2docker 0

Lightweight Linux for Docker

arjenpdevries/capitalizer 0

A demonstration on a potentially misleading warning in Rust 1.33 stable

issue openedMonetDB/MonetDB

Improve error message regarding 3-level SQL names

Describe the bug When a query (seems to) contain a 3-level SQL name, i.e. foo.bar.baz, it is rejected, with a message saying:

TODO: column names of level >= 3

the line of code is in sql/server/rel_select.c:

		return sql_error(sql, 02, SQLSTATE(42000) "TODO: column names of level >= 3");

this bug is not about the missing feature, but about an error message. It should be clearer - as the query author may not realize what exactly they did wrong: They may not recognize the term "name level", or even not intended to have typed in one of the periods etc.

Please change it to something like:

Use of 3-level SQL names (database_name.table_name.column_name) is not supported yet. You may only refer to tables within the database to which you have connected.

created time in 2 hours

pull request commentcastorini/anserini

add dpr nq-dev topic & topic reader

Codecov Report

Merging #1470 (bc88dd8) into master (a31d386) will increase coverage by 0.09%. The diff coverage is 100.00%.

Impacted file tree graph

@@             Coverage Diff              @@
##             master    #1470      +/-   ##
============================================
+ Coverage     45.94%   46.03%   +0.09%     
- Complexity      812      815       +3     
============================================
  Files           194      195       +1     
  Lines         10167    10185      +18     
  Branches       1347     1348       +1     
============================================
+ Hits           4671     4689      +18     
  Misses         5103     5103              
  Partials        393      393              
Impacted Files Coverage Δ Complexity Δ
.../anserini/search/topicreader/DprNqTopicReader.java 100.00% <100.00%> (ø) 3.00 <3.00> (?)
...va/io/anserini/search/topicreader/TopicReader.java 79.78% <100.00%> (+0.43%) 11.00 <0.00> (ø)
...in/java/io/anserini/search/topicreader/Topics.java 100.00% <100.00%> (ø) 2.00 <0.00> (ø)

Continue to review full report at Codecov.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update a31d386...bc88dd8. Read the comment docs.

MXueguang

comment created time in 5 hours

push eventMonetDB/MonetDB

Pedro Ferreira

commit sha 5ba8e22d2fbd5f1c59c7e180a91ecca681c3a4d5

Ported Jun2020-mmt fixes into nospare

view details

Aris Koning

commit sha 5d10cb303c7a9a37cf0a219315c23fe47da6a82e

Make rollback respect safepoints.

view details

push time in 5 hours

pull request commentcastorini/anserini

add dpr nq-dev topic & topic reader

yes, we should. will add in this PR

MXueguang

comment created time in 7 hours

push eventMonetDB/MonetDB

Pedro Ferreira

commit sha f578b10b72f253a6a225adbd475603e079fe47e8

My fix was not completely right. Undoing some of the changes. I still have to check waht to do with the between predicate on projections

view details

Pedro Ferreira

commit sha 154f8de32920a21147f755cec611b000a84fa44a

Forgot the other case

view details

push time in 7 hours

push eventMonetDB/MonetDB

Pedro Ferreira

commit sha d820a3712463766ee9d1010d9df8ffcb30ce3b2f

Tackling tpcds q72 performance degradation. At rel_push_func_down, create projection only on the side where functions will be pushed down

view details

Pedro Ferreira

commit sha c1b2147fffe2f9a36f9a38e85b5da109fc1b4d6a

Undoing one of my changes. This check is needed (fix tpcds q8 crash)

view details

Pedro Ferreira

commit sha f578b10b72f253a6a225adbd475603e079fe47e8

My fix was not completely right. Undoing some of the changes. I still have to check waht to do with the between predicate on projections

view details

Pedro Ferreira

commit sha b58785d1396773af3a0afd1edfe29a147fcb6fa3

Merged with Oct2020

view details

push time in 7 hours

push eventMonetDB/MonetDB

Pedro Ferreira

commit sha d820a3712463766ee9d1010d9df8ffcb30ce3b2f

Tackling tpcds q72 performance degradation. At rel_push_func_down, create projection only on the side where functions will be pushed down

view details

Pedro Ferreira

commit sha c1b2147fffe2f9a36f9a38e85b5da109fc1b4d6a

Undoing one of my changes. This check is needed (fix tpcds q8 crash)

view details

Pedro Ferreira

commit sha f578b10b72f253a6a225adbd475603e079fe47e8

My fix was not completely right. Undoing some of the changes. I still have to check waht to do with the between predicate on projections

view details

Pedro Ferreira

commit sha b58785d1396773af3a0afd1edfe29a147fcb6fa3

Merged with Oct2020

view details

push time in 7 hours

PR opened castorini/anserini

add dpr nq-dev topic & topic reader
+8870 -2

0 comment

6 changed files

pr created time in 7 hours

push eventcwida/duckdb

Mark Raasveldt

commit sha 8f468d429ec11653202b2cb8dfe87bb01008f001

Add support for Lambda functions to parser

view details

Mark Raasveldt

commit sha f8a702564d54682401ef3805d577298348724bba

Remove redundant blank line

view details

Mark Raasveldt

commit sha d818a8b9abff27902bed93bfcedfb4d81d1e029e

Add support for lambda functions with multiple parameters

view details

Mark Raasveldt

commit sha 2c8b106da531b1dff76d5e90efc33eb1eeeb7c4d

Merge branch 'master' into lambdas

view details

Mark Raasveldt

commit sha 80d723ad404771c6dc9fa028af5a254b60165e43

Remove print and increase lambda operator precedence further so lambdas such as x -> x > 10 AND z < 20 are correctly parsed

view details

Mark Raasveldt

commit sha 132ac405e5d966c83da86bb7d12a4ec6beb8fa02

Fix for single file compilation

view details

Mark Raasveldt

commit sha e5a64a52f56083f989e2675a287d28f2f96adf69

Automatically replace generated calls to fprintf and exit in src_backend_parser_scan.cpp to avoid triggering R CRAN warnings

view details

Mark

commit sha f79660c66b8d97e598e30390f6b638dd5ffd6ad2

Merge pull request #1313 from Mytherin/lambdas Add support for Lambda functions to parser

view details

push time in 8 hours

PR merged cwida/duckdb

Add support for Lambda functions to parser

This PR adds basic support for lambda functions to the parser. They are not supported anywhere else yet and not bound yet, but the plan is to use them later on in functions that can apply to lists.

Lambda expressions look like this:

class LambdaExpression : public ParsedExpression {
public:
	string capture_name;
	unique_ptr<ParsedExpression> expression;
};

Example syntax:

SELECT map(i, x -> x + 1) FROM (VALUES (list_value(1, 2, 3))) tbl(i);
+13456 -13161

9 comments

30 changed files

Mytherin

pr closed time in 8 hours

push eventch007m/home-devices

Charles Moulliard

commit sha 8010fc43994eda7778ff0efada48c69e911dbe85

Add missing package to access read-write the partition HFSplus

view details

push time in 8 hours

push eventMonetDB/MonetDB

Niels Nes

commit sha 72985a84535f2ecf2fe79b7f706c74f682313c0d

add missing cleanup functions

view details

Aris Koning

commit sha bcf522dfdb5d3ebfbdc065ed82727b441d7f04e7

Introduce (soon to be) atomic getters and setters.

view details

push time in 8 hours

push eventch007m/home-devices

Charles Moulliard

commit sha eaf1f020db11eac4bb4de9f9494f5eb92d40e1bf

Add all in one steps

view details

push time in 8 hours

pull request commentcwida/duckdb

Parser clean up: no longer transform multi-node CASE and NULLIF in the transformer

Ha I think I may have been responsible for those in the first place. All that hard work 😭. Great this is cleaned up!

Mytherin

comment created time in 9 hours

push eventch007m/home-devices

Charles Moulliard

commit sha 21be93020db1cd286038b3878e31d93d06fbee0e

Add all in one steps

view details

push time in 9 hours

push eventMonetDB/MonetDB

Niels Nes

commit sha a3e936cdaf93978d96f467714df2d4465a8b6cb1

use ATOMIC_GET to retrieve atomic value

view details

Niels Nes

commit sha f74edd2fd2dcd6d2193b93bc4196f004a4a4f448

add some more destroy functions

view details

push time in 9 hours

PR opened cwida/duckdb

Parser clean up: no longer transform multi-node CASE and NULLIF in the transformer

Previously a CASE statement with multiple WHEN ... THEN ... nodes would be transformed into a chain of case statements in the transformer. In this PR we modify the case so that this step is only performed during binding. The case statement now looks like this after the transformer phase:

struct CaseCheck {
	unique_ptr<ParsedExpression> when_expr;
	unique_ptr<ParsedExpression> then_expr;
};

//! The CaseExpression represents a CASE expression in the query
class CaseExpression : public ParsedExpression {
	vector<CaseCheck> case_checks;
	unique_ptr<ParsedExpression> else_expr;
};

NULLIF(a, b) used to be transformed into CASE WHEN a=b THEN NULL ELSE a in the transformer phase. In this PR I have changed this to instead be transformed to a regular function call nullif(a, b) and have created a macro that does this transformation during binding instead.

+206 -95

0 comment

13 changed files

pr created time in 9 hours

issue openedfacebookresearch/wav2letter

Can I integrate it with my college website to transcribe the lecture videos online?

I am new to ASR (Automatic Speech Recognition), I am working on a college website where professors can upload their lectures. I want to integrate an automatic speech transcript generator so that users can turn on the captions while streaming. While searching for open source options, I came across wav2letter. Could anyone please tell me whether it is possible to do that and if Yes, could you please guide me?

created time in 9 hours

pull request commentcwida/duckdb

Add support for Lambda functions to parser

Nice. Thank you, Mark.

Mytherin

comment created time in 11 hours

push eventMonetDB/MonetDB

Sjoerd Mullender

commit sha 8abd06bc07eaf44a17a3e4edaa83f583b49a7056

Added missing test files.

view details

push time in 11 hours

push eventMonetDB/MonetDB

Sjoerd Mullender

commit sha 8abd06bc07eaf44a17a3e4edaa83f583b49a7056

Added missing test files.

view details

push time in 11 hours

push eventMonetDB/MonetDB

Niels Nes

commit sha c1a55e6727ae135548d3ce55c744ab45444fc188

use modern initialisation (needed for ATOMIC_VAR_INIT)

view details

Niels Nes

commit sha 46507a2995bbf643ca8064fc014776b99cf92c16

small compilation fix

view details

Sjoerd Mullender

commit sha 8abd06bc07eaf44a17a3e4edaa83f583b49a7056

Added missing test files.

view details

Niels Nes

commit sha 7c2334391cec3ed1f990a704247fa79727f92b5d

make sure temporary tables handle updates properly

view details

Niels Nes

commit sha 1da03f4bf9e336d0d157a0da74856bb997e82591

merged with default

view details

Aris Koning

commit sha 990ed38569cbf5effa74c0941e8f5b0d330158c4

Fix rollback.

view details

push time in 11 hours

push eventMonetDB/MonetDB

Pedro Ferreira

commit sha c1b2147fffe2f9a36f9a38e85b5da109fc1b4d6a

Undoing one of my changes. This check is needed (fix tpcds q8 crash)

view details

push time in 11 hours

pull request commentcwida/duckdb

Add support for Lambda functions to parser

All the changes are implemented now, lambda functions now look like this:

class LambdaExpression : public ParsedExpression {
	vector<string> parameters;
	unique_ptr<ParsedExpression> expression;
};

I also fixed several operator precedence rules so that lambda arrows take priority over other operators, which causes e.g. x -> x + 1 AND y + 1 to be correctly parsed as x + 1 AND y + 1 without requiring brackets.

select map(i, (x, y) -> x + y) from tbl;
-- lambda: parameters { x, y }, function: x + y
select map(i, x -> x + 1) from (values (list_value(1, 2, 3))) tbl(i);
-- lambda: parameters { x }, function: x + 1
select map(i, x -> x + 1 AND y + 1) from (values (list_value(1, 2, 3))) tbl(i);
-- lambda: parameters { x }, function: x -> x + 1 AND y + 1
Mytherin

comment created time in 11 hours

issue commentcwida/duckdb

Data conversion: Virtual table streaming and exporting with EXPORT DATABASE

Excellent. We missed that it is possible with COPY but have found that all the info we needed is here. We can probably close this issue.

dforsber

comment created time in 11 hours

push eventcastorini/anserini

Larry Li

commit sha a31d386e23d399da8d2841d45b9e500f71fe1c9b

Relicate MS MARCO passage & document ranking experiment with Anserini (#1469)

view details

push time in 11 hours

PR merged castorini/anserini

Relicate MS MARCO passage & document ranking experiment with Anserini

Environment

  • Google Colab
  • Java 11
  • Python 3.6
  • Maven 3.6.0

Challenges

  • Initially try to build Maven and replicate the experiment on Windows 10. However, the Java 11 JDK installer is not available for download from the link on Oracle's website.
  • Initial Maven build failed, then has to reset JAVA HOME as os.environ["JAVA_HOME"] = "/usr/lib/jvm/java-11-openjdk-amd64" and the build is successful.

Results

Passage rank Capture1

Doc rank Capture2

The replicated results are exactly the same as the documentation mentions.

+2 -0

1 comment

2 changed files

larryli1999

pr closed time in 11 hours

push eventMonetDB/MonetDB

Sjoerd Mullender

commit sha ffd9d07f2d1da6a0bbed7ce9a946ce85f3d3fd50

Remove more unused old test files.

view details

Sjoerd Mullender

commit sha 8389c01e4b7f8fa9dcdf51cff3720df0011b4842

Remove unused .sql files.

view details

push time in 12 hours

push eventMonetDB/MonetDB

Sjoerd Mullender

commit sha ffd9d07f2d1da6a0bbed7ce9a946ce85f3d3fd50

Remove more unused old test files.

view details

Sjoerd Mullender

commit sha 8389c01e4b7f8fa9dcdf51cff3720df0011b4842

Remove unused .sql files.

view details

push time in 12 hours

push eventMonetDB/MonetDB

Sjoerd Mullender

commit sha ffd9d07f2d1da6a0bbed7ce9a946ce85f3d3fd50

Remove more unused old test files.

view details

Niels Nes

commit sha 65a9bde32dd943ca71a270e20fef526e8fe995e7

output errors on write conflicts

view details

Sjoerd Mullender

commit sha 8389c01e4b7f8fa9dcdf51cff3720df0011b4842

Remove unused .sql files.

view details

Sjoerd Mullender

commit sha c9fa460fa6c02b7c402f96104e1e041e324a8f9b

Merge with default branch.

view details

push time in 12 hours

more