profile
viewpoint
Lorenz Bühmann LorenzBuehmann University of Leipzig Leipzig

gerbsen/dbpedia-lucene-index 4

index for all resources with labels, uris, surface forms, images etc.

Giuseppe-Rizzo/SWMLAlgorithms 4

Various implementation of machine learning algorithms for Semantic Web knowledge bases

LorenzBuehmann/ontop 1

Ontop is a platform to query relational databases as Virtual RDF Knowledge Graphs using SPARQL

coderiot/webprotege 0

The webprotege code base

LorenzBuehmann/Archived-SANSA-ML 0

SANSA Machine Learning Layer

LorenzBuehmann/autoupdate 0

Autoupdate files for Protege Desktop

LorenzBuehmann/Bioportal-Protege-Plugin 0

A plugin for Protégé to import data from BioPortal

push eventSmartDataAnalytics/ontop

Davide Lanti

commit sha 0455c00ea408a325fec43de417efe8f204626137

+OntologyFetcherController

view details

Francesco Corcoglioniti

commit sha 041d952e0bf7ffb650afaf229e69a150efc8dca8

Working version of reflection-based Ontop Protégé refactoring that allows using a newer version of Guava

view details

Francesco Corcoglioniti

commit sha 069a551880ec782b7ed16962be7e3bfb407b2224

Bug fix

view details

Francesco Corcoglioniti

commit sha 2baf93a17853ad3e82cd1769547ba31c163dcf09

ontology/owlapi/src/main/java/it/unibz/inf/ontop/spec/ontology/owlapi/OWLAPITranslatorOWL2QL.java

view details

Francesco Corcoglioniti

commit sha ad321e46515671e7970ee9de3e9e22188a6309da

Getting rid of OWLAPIAdapter in OntopReasoningCommandBase

view details

Francesco Corcoglioniti

commit sha 3e7100b6c196bd12fb31f2d2f9b7de8c240c5e70

Updated bundle plugin to compile on JDK 17, and guice dependency to run on JDK 17

view details

Francesco Corcoglioniti

commit sha 0309fe2e99b450f2e97d7bb6ba75f777a4e608af

Merge branch 'version4' into feature/protege-refactoring-reflection

view details

Francesco Corcoglioniti

commit sha eef330d274a2a592d9e2077d7446870513ab7991

Running on both Java 8 and 17 when compiled with both JDK 8 and 17

view details

Davide Lanti

commit sha f121d2c01268185b52321e8e39f07cc56065f788

owlapi

view details

Benjamin Cogrel

commit sha 4daad48cce9c1299acee445937a7301962b0ff5a

MirrorProfFKTest added.

view details

Benjamin Cogrel

commit sha a2dad92f427f3925299b384b18dd8db2e2b394d4

MetadataProvider.insertIntegrityConstraints(relations, ...) added.

view details

Benjamin Cogrel

commit sha 19aacfe338249e619f3e6ca0413c01744382562b

Revert "MetadataProvider.insertIntegrityConstraints(relations, ...) added." This reverts commit a2dad92f427f3925299b384b18dd8db2e2b394d4.

view details

Benjamin Cogrel

commit sha bcdf2833598965cb3cb99660c8ccb85f28565ded

View ordering for normalization and safety for inserting constraints.

view details

Benjamin Cogrel

commit sha 95de796d4029398341661e993d6c347fa4d38078

BasicOntopViewFKSaturator added.

view details

Benjamin Cogrel

commit sha 0807b4b76d56b37cad01e859ed3106aba83e53fb

Merged metadata lookup added for view dependencies.

view details

Benjamin Cogrel

commit sha c4d24697215178ddbfbacd5b7f58f9040a7ffbd0

FK derived from the base relations added.

view details

Benjamin Cogrel

commit sha b5178c82bf6c2a94072bae06912d1d2c0894b445

Bugfix: the wrong target relation was used in inferred FKs.

view details

Benjamin Cogrel

commit sha 858cc052be7deb1236a4a408caf44e4a71851382

Minor cleaning and test re-enabled.

view details

Francesco Corcoglioniti

commit sha 73acc7cc2d603f441327943d6fbabcbc5294d9ad

Merge branch 'version4' into feature/protege-refactoring-reflection

view details

Francesco Corcoglioniti

commit sha 00c5463b61ab983bc7d015781c984bb9f787dbd6

OWLAPIAdapter class moved to ontop-protege and reimplemented dropping unused public wrapper methods

view details

push time in 10 days

push eventSmartDataAnalytics/ontop

Lukas sundqvist

commit sha 438571c593b061bccec21d0505e7e807aa246849

ValuesNode added

view details

Benjamin Cogrel

commit sha 9ccf290bf4dc004d08bd7cbacbedd630f361f00e

IQVisitor and mutable IQ visitor/transformer methods for ValuesNode.

view details

Benjamin Cogrel

commit sha 628f83e81cf100a838fa945415b22eccc0fcae0f

IQTreeVisitingTransformer now supports ValuesNode.

view details

Lukas sundqvist

commit sha 76cc324584370071631c492290d1c127d94eead0

Wrote test for normalization.

view details

Lukas sundqvist

commit sha f535a2a8a7cf1126f0a7e595bf87389d0f96491c

Merge branch 'feature/values-node-visitor' into feature/values-node

view details

Lukas sundqvist

commit sha cc2bf0f9fa37d0138d3f55c50d2a9568045563e7

Added ignore tags to normalization tests

view details

Lukas sundqvist

commit sha f62596483b3386b5443209de7425e5eca52f0c4b

Wrote Normalization function for ValuesNode

view details

Lukas sundqvist

commit sha e5ac601e8bd0f2da64d38a7792641365a9f6f9e1

Initial implementation of applyFreshRenaming for ValuesNode

view details

Lukas sundqvist

commit sha 1364eea46794cb9e92e94dbc984cb06220d9e702

Rewrote Normalize method of ValuesNode

view details

Lukas sundqvist

commit sha dffdd7dfe109ec9bf8d308ad660830b41483e992

Implemented support for descending substitution of GroundFunctionalTerms in ValuesNode

view details

Lukas sundqvist

commit sha ce8f6ed3486144bc3800715cfaa48c572990a314

Implemented applyDescendingSubstitutionWithoutOptimizing for ValuesNode

view details

Lukas sundqvist

commit sha cee6494f07f8ae022a1b2f82dcdedb8ad1ca5031

Implemented propogateDownConstraint in ValuesNode

view details

Lukas sundqvist

commit sha ff1378e3e567a5def7bcefa2f1129e1f6f9c3e4d

Fixed error in descending substitution of ValuesNode

view details

Benjamin Cogrel

commit sha c56dd26c69fc68ebc0e13518aafa220bd353e1df

FactExtractor added. ABoxFactIntoMappingConverter named.

view details

Benjamin Cogrel

commit sha b9e8fbb5c781371e4d31a07e811f4f2b48d61185

FactExtractorWithSaturatedTBox partially implemented.

view details

Lukas sundqvist

commit sha 4fa378ce76d928987b8dc30d906474962aaaa3ed

Started work on ABoxFactIntoMapping..

view details

Lukas sundqvist

commit sha b1a7907ee6d58f79773f2855a56edb60ba9c0fc8

Merge branch 'version4' into feature/values-node

view details

Lukas sundqvist

commit sha fa13b5d4579f2ee9d7984c3d2484eaf448e5abc8

Further work on ABoxFactConverter

view details

Lukas sundqvist

commit sha 8a46fcc0fc9d515561a53aeb4f2e45909e3697de

Had to do small changes to IQ2CQ.java

view details

Lukas sundqvist

commit sha 43083f606fc8e8778b914cdaa3fa53d30ac3ef94

Further work on ABoxFactConverter, encountered a bug I don't manage to fix right now.

view details

push time in 10 days

create barnchSANSA-Stack/jena

branch : tdb2/parallel-sort-adapted

created branch time in a month

PR opened apache/jena

JENA-2225: Handle large datasets size in TDB stats serializer

This PR should fix the issue with serializing the size of a dataset in the TDB/TDB2 stats which failed for larger datasets beyond int value space.

+6 -2

0 comment

2 changed files

pr created time in a month

create barnchSANSA-Stack/jena

branch : JENA-2225

created branch time in a month

push eventSANSA-Stack/jena

Andy Seaborne

commit sha 0f77659b2bd26d8e7ed4c8ffca50e880aa879765

Add strStartsWithIgnoreCase, strEndsWithIgnoreCase Tidy. Add assertion in TestDatabaseOps.

view details

Andy Seaborne

commit sha 4bc0d40ee182dc9ca4d8379c9eb0df3da2e45021

JENA-2097: Bad URIs are parser warnings

view details

Andy Seaborne

commit sha c495b2bf52189e550a12a464953f555107026ef5

JENA-2101: Expression use of <<>>

view details

Brandon Sara

commit sha b10414354f8a16331e7c1a4598ca8279c0017f81

JENA-2096 - Added ability to delete old DB after compaction has completed successfully

view details

Andy Seaborne

commit sha 155af2a8eb82854b1f839b287bfb55eb11587020

JENA-2102: Explicitly initialize map for injected commands

view details

Andy Seaborne

commit sha 37c3697aa5d0528c21593dd528cbaa216d710860

Merge pull request #989 from afs/jena2097-iri JENA-2097: Bad URIs are parser warnings

view details

Andy Seaborne

commit sha c52f52f1cdb40271473b2146b1d049d553d3c2cb

Merge pull request #997 from afs/rdf-star-expr JENA-2101: Expression use of <<>>

view details

Andy Seaborne

commit sha d144861a279b3b4e64132a4493cf94314f946a2f

Merge pull request #988 from bsara/compact-delete-old [JENA-2096] Added ability to delete old DB after compaction has completed successfully

view details

Andy Seaborne

commit sha cb1def6ce03d54e6719a9bbe9d5d0d95de7daf8b

Merge pull request #998 from afs/jena2102-cmd-init JENA-2102: Explicitly initialize map for injected commands

view details

Andy Seaborne

commit sha ad24b3777c3515fbaabe81e8ec2ad9d95f012e4c

JENA-2102: Fix init check

view details

Andy Seaborne

commit sha 248731a0543d93901e70452da7a14ababc10770a

JENA-2096: Skip mmap file check on MS Windows

view details

Andy Seaborne

commit sha ffd6377ef7901488ec06a8dae71067210a5cc5f5

JENA-2103: Allow recusive <<>>

view details

Andy Seaborne

commit sha 6b31b692ccbe4ab517e1b053f8ebc43c50087ba2

Fix printing of triple term functions

view details

Andy Seaborne

commit sha c14f37a6d9fc57de83bddf0c224d3798bb9747d9

JENA-2103: RDF-star test suite

view details

Andy Seaborne

commit sha 3c49e27bdb649919ea97e2dc9dca837924a47ca9

JENA-2103: Fix decoding RDF-star embedded triples in TDB1

view details

Andy Seaborne

commit sha e727494b4a8ac4593facdf9d9d64893b2bc481a4

JENA-2103: Additional tests for ecoding RDF-star embedded triples

view details

Andy Seaborne

commit sha 7cefe42c5fb381caeb7defe5e5a1428becfd794f

Escape HTML chars

view details

Andy Seaborne

commit sha 79e1343153e596a2b33fa0847b9a87ae14f76a3b

Merge pull request #999 from afs/rdf-star JENA-2103: RDF-star updates

view details

Andy Seaborne

commit sha 0a9b1cfc9cea18c78ae915675ab8a76a6ee341ea

Merge pull request #1000 from afs/webapp Escape HTML chars

view details

Andy Seaborne

commit sha dcf09b62fd5a10e6ad916ef1d4df7770196fcf62

Escape HTML chars

view details

push time in a month

push eventSANSA-Stack/SANSA-Stack

Lorenz Buehmann

commit sha 36021868b38d1f6549465aa91f6d11847f8e8bb8

extended example shows how to enable GeoSPARQL support in ontop

view details

push time in a month

push eventSANSA-Stack/SANSA-Stack

Lorenz Buehmann

commit sha d5db8f83c63f5d35cd097e1eeede6ca18d2cdfc3

fixed typo in docs

view details

push time in a month

push eventSmartDataAnalytics/DL-Learner

dependabot[bot]

commit sha a5baa2a100ad1f13eee8bb08fe11e9d921398ba6

Bump log4j-core from 2.13.3 to 2.16.0 Bumps log4j-core from 2.13.3 to 2.16.0. --- updated-dependencies: - dependency-name: org.apache.logging.log4j:log4j-core dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>

view details

Lorenz Bühmann

commit sha bfb73e38f9ff05d1c79b86270becdcac8d88f6ec

Merge pull request #91 from SmartDataAnalytics/dependabot/maven/org.apache.logging.log4j-log4j-core-2.16.0 Bump log4j-core from 2.13.3 to 2.16.0

view details

push time in a month

PR merged SmartDataAnalytics/DL-Learner

Bump log4j-core from 2.13.3 to 2.16.0 dependencies

Bumps log4j-core from 2.13.3 to 2.16.0.

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


<details> <summary>Dependabot commands and options</summary> <br />

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
  • @dependabot use these labels will set the current labels as the default for future PRs for this repo and language
  • @dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language
  • @dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language
  • @dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

You can disable automated security fix PRs for this repo from the Security Alerts page.

</details>

+1 -1

0 comment

1 changed file

dependabot[bot]

pr closed time in a month

push eventSANSA-Stack/SANSA-Stack

Claus Stadler

commit sha 3df1b976319a3ab249f003fa7b4ca6786f088a4b

Bump to jena 4.3.1 (probably there are or will be separate spark/hadoop releases with log4j 2.15.0)

view details

Lorenz Buehmann

commit sha c37eae8eb172b58c1c18a4e176e948b49c8b4fb5

Merge branch 'develop' of github.com:SANSA-Stack/SANSA-Stack into develop

view details

Lorenz Buehmann

commit sha 284ed25ee68618a829531835a2928a899dedd652

Merge branch 'develop' into geospatial

view details

push time in a month

push eventSANSA-Stack/SANSA-Stack

Lorenz Buehmann

commit sha 7afd3b9f37a59accb24547e2f4ea09cb045d9c99

extended docs

view details

push time in a month

push eventSANSA-Stack/SANSA-Stack

Lorenz Buehmann

commit sha 3c66bd377deb9e8b0f5d0310eab03e45ac8091b9

owl api versions fix

view details

Lorenz Buehmann

commit sha 5a6d3f0bc541443df9048be1bdf73ae4bdb2e79a

remove unused deps

view details

Lorenz Buehmann

commit sha 49b26f614bab83f4d1aebdbdc41083c3fbbb1a45

typos

view details

Gezim Sejdiu

commit sha cbeba7f4eee31cfec5761f3bfa1ae789fb1d5d53

Merge branch 'develop' into feature/add-badges-to-readme

view details

carstendraschner

commit sha edb53cfb9746716996ef14c07530ead6448369b1

update unclear sentence

view details

Lorenz Buehmann

commit sha d96836adc6d2f36f78014bd8d998be1a644b544c

SBT cleanup

view details

Lorenz Bühmann

commit sha 6d3cc99ca98ec14e23171baf947b8fbb9cb49c4f

Merge pull request #118 from SANSA-Stack/feature/add-badges-to-readme Add Badges and How to Contribute sections on README file

view details

carstendraschner

commit sha 63f9313ebb3e86859eee4c3e7c60f57e51e031ab

remove scala annotations for code to circumvent undesired code block parsing in markdown

view details

carstendraschner

commit sha 418f66e6c108201212a38f1d0270c8f687def96c

git ignore ds store as artifact from mac devices

view details

carstendraschner

commit sha 8bf8840f38e782198cda0e12b1cd79e42a5533c2

remove unused/unknown dependencies for cleanup of poms

view details

carstendraschner

commit sha ec841bdd8cf02d762684f95fbca6bc7e31fb6c6a

in readme references

view details

carstendraschner

commit sha ca802adf8945996717593dff87708df5c6abada4

only one hash

view details

carstendraschner

commit sha df2fcbe98ee51950b8ae390005d3ce7afd9b0c22

change link

view details

carstendraschner

commit sha 0460b816e495f5e66d14c1d6ce0c6724390abaab

update and clean links

view details

carstendraschner

commit sha cf3a14025706c20b8945260c50a51f3ded864640

edid header to make link reference possible

view details

carstendraschner

commit sha ad9091af5c25b3362078356aaabeea073d6b120a

Update README.md edit contributions

view details

Lorenz Bühmann

commit sha d2964be07786a18ca5bd59beb3076dba596b0328

Update README.md

view details

Lorenz Bühmann

commit sha 3155caf04144a7b65979a664ae162422bac62287

Update README.md

view details

carstendraschner

commit sha 1f42714a1df8aedae52ea888bddfef82fd780c75

link subsections in head bulletpoint list

view details

carstendraschner

commit sha ab20c66a0a6677bf23ee4228328267fd8a3a9350

update unclear sentence

view details

push time in a month

push eventSANSA-Stack/SANSA-Stack

dependabot[bot]

commit sha 573ef20d722b1ca5fe8fe9eee799f44dcb325a2a

Bump commons-compress from 1.20 to 1.21 Bumps commons-compress from 1.20 to 1.21. --- updated-dependencies: - dependency-name: org.apache.commons:commons-compress dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>

view details

Lorenz Bühmann

commit sha a0b26a67331ed83567aeff5f1c1dc44d9e1d9ae1

Merge branch 'develop' into dependabot/maven/org.apache.commons-commons-compress-1.21

view details

Lorenz Bühmann

commit sha 8f68e3fe2cc16a5d99c8a4eb50e8ae450d133f6a

Merge pull request #165 from SANSA-Stack/dependabot/maven/org.apache.commons-commons-compress-1.21 Bump commons-compress from 1.20 to 1.21

view details

push time in a month

PR merged SANSA-Stack/SANSA-Stack

Bump commons-compress from 1.20 to 1.21 dependencies

Bumps commons-compress from 1.20 to 1.21.

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


<details> <summary>Dependabot commands and options</summary> <br />

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
  • @dependabot use these labels will set the current labels as the default for future PRs for this repo and language
  • @dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language
  • @dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language
  • @dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

You can disable automated security fix PRs for this repo from the Security Alerts page.

</details>

+1 -1

0 comment

1 changed file

dependabot[bot]

pr closed time in a month

push eventSANSA-Stack/SANSA-Stack

dependabot[bot]

commit sha 8155e554330e77258cdea455cadc2b9c44654108

Bump httpclient from 4.5.12 to 4.5.13 Bumps httpclient from 4.5.12 to 4.5.13. --- updated-dependencies: - dependency-name: org.apache.httpcomponents:httpclient dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>

view details

Lorenz Bühmann

commit sha f2bfc2b5307fe7902b5beebe51428c9f91fc88c5

Merge branch 'develop' into dependabot/maven/org.apache.httpcomponents-httpclient-4.5.13

view details

Lorenz Bühmann

commit sha 8342bcacef01537b89ddd653688b95f09cb8e961

Merge pull request #160 from SANSA-Stack/dependabot/maven/org.apache.httpcomponents-httpclient-4.5.13 Bump httpclient from 4.5.12 to 4.5.13

view details

push time in a month

PR merged SANSA-Stack/SANSA-Stack

Bump httpclient from 4.5.12 to 4.5.13 dependencies

Bumps httpclient from 4.5.12 to 4.5.13.

Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


<details> <summary>Dependabot commands and options</summary> <br />

You can trigger Dependabot actions by commenting on this PR:

  • @dependabot rebase will rebase this PR
  • @dependabot recreate will recreate this PR, overwriting any edits that have been made to it
  • @dependabot merge will merge this PR after your CI passes on it
  • @dependabot squash and merge will squash and merge this PR after your CI passes on it
  • @dependabot cancel merge will cancel a previously requested merge and block automerging
  • @dependabot reopen will reopen this PR if it is closed
  • @dependabot close will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
  • @dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
  • @dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
  • @dependabot use these labels will set the current labels as the default for future PRs for this repo and language
  • @dependabot use these reviewers will set the current reviewers as the default for future PRs for this repo and language
  • @dependabot use these assignees will set the current assignees as the default for future PRs for this repo and language
  • @dependabot use this milestone will set the current milestone as the default for future PRs for this repo and language

You can disable automated security fix PRs for this repo from the Security Alerts page.

</details>

+1 -1

0 comment

1 changed file

dependabot[bot]

pr closed time in a month

push eventSANSA-Stack/SANSA-Stack

carstendraschner

commit sha 21e646ed7d81ca48c277db090efdbc7b18141fd8

start drafting classes for new sim feature

view details

carstendraschner

commit sha 9a3efdcab90bbd7b9039d6f699cd695eac4ce453

Merge branch 'jena4' into feature/dasim

view details

carstendraschner

commit sha 47c43fc30c46ef87e8922290a968398fe5111683

first distsim on lmdb with minhash betweeen movies for promising candidates

view details

carstendraschner

commit sha a3be9d944c2e6aff25a643af5c839c6f933f4fbd

play around with different feature extractors and further use DistRDF2ML modules for feature extraction

view details

carstendraschner

commit sha 719b9c6d522937ab8b3ee1feb52776dc42df8a10

start wih pivot nbased feature extracting transformer

view details

carstendraschner

commit sha 12e057bf2ee5b493e7438738105ce3d6756b61eb

automatic cast dataframe to correpsonding litreal type and split features if needed by their respective datatype

view details

carstendraschner

commit sha 472e43da407d123045dba164f379be10be60e20b

bring all information to transform

view details

carstendraschner

commit sha f0a0a62ece15a082b03113d4e61e16756b706083

make components more compact and broader documentation

view details

carstendraschner

commit sha d7e402db6ce89bf1b67d49ec3515628a84e87275

align design to be transformer conform

view details

carstendraschner

commit sha f319fd788224d64b2a14641750c5ffb96a75eb38

integrate smart feature extrator into novel dasim pipeline

view details

carstendraschner

commit sha e8dca8be901e6d75251b95de45ea31a50274d658

cast numeric values to doubles also ensure that collect lists are only applied if we need lists

view details

carstendraschner

commit sha fb10899af55799369006baa883873b3798d5e79e

offer first unit test for smartFeatureExtractor

view details

carstendraschner

commit sha f376c692ffd8ed1bf65a945b31af04551c421ec2

play a bit around and structure

view details

carstendraschner

commit sha e1fddd9403a4a7e97028a481b2be214cca8978d4

also show schema

view details

carstendraschner

commit sha cbce9f1a14275245d158284ebd02319584d10ff2

started handling of different feature types

view details

carstendraschner

commit sha 87bf91a48730837d8416474d726dff44edb0b6a0

started soe playground for word2vec in spark

view details

carstendraschner

commit sha 847dfe81e080807f4bab4ff7463dbd28f119e81c

clean up

view details

carstendraschner

commit sha f51913a2cf2222aa3f60b40e9364adc89f54e43b

handling of categorical strings transformed over hashing and IDF (Information Content) weightning

view details

carstendraschner

commit sha 601e664a6d90414bed4d710c3eb128427d2016ae

string column hanfling by default pipleline as current fallback non implemented word2Vec

view details

carstendraschner

commit sha 8b330fad65a7ad4f3019271dc67e7377a5e24ad2

calculate similarity values and join those into one df s.t. we can later aggregate those

view details

push time in a month

push eventSANSA-Stack/SANSA-Stack

carstendraschner

commit sha 21e646ed7d81ca48c277db090efdbc7b18141fd8

start drafting classes for new sim feature

view details

carstendraschner

commit sha 9a3efdcab90bbd7b9039d6f699cd695eac4ce453

Merge branch 'jena4' into feature/dasim

view details

carstendraschner

commit sha 47c43fc30c46ef87e8922290a968398fe5111683

first distsim on lmdb with minhash betweeen movies for promising candidates

view details

carstendraschner

commit sha a3be9d944c2e6aff25a643af5c839c6f933f4fbd

play around with different feature extractors and further use DistRDF2ML modules for feature extraction

view details

carstendraschner

commit sha 719b9c6d522937ab8b3ee1feb52776dc42df8a10

start wih pivot nbased feature extracting transformer

view details

carstendraschner

commit sha 12e057bf2ee5b493e7438738105ce3d6756b61eb

automatic cast dataframe to correpsonding litreal type and split features if needed by their respective datatype

view details

carstendraschner

commit sha 472e43da407d123045dba164f379be10be60e20b

bring all information to transform

view details

carstendraschner

commit sha f0a0a62ece15a082b03113d4e61e16756b706083

make components more compact and broader documentation

view details

carstendraschner

commit sha d7e402db6ce89bf1b67d49ec3515628a84e87275

align design to be transformer conform

view details

carstendraschner

commit sha f319fd788224d64b2a14641750c5ffb96a75eb38

integrate smart feature extrator into novel dasim pipeline

view details

carstendraschner

commit sha e8dca8be901e6d75251b95de45ea31a50274d658

cast numeric values to doubles also ensure that collect lists are only applied if we need lists

view details

carstendraschner

commit sha fb10899af55799369006baa883873b3798d5e79e

offer first unit test for smartFeatureExtractor

view details

carstendraschner

commit sha f376c692ffd8ed1bf65a945b31af04551c421ec2

play a bit around and structure

view details

carstendraschner

commit sha e1fddd9403a4a7e97028a481b2be214cca8978d4

also show schema

view details

carstendraschner

commit sha cbce9f1a14275245d158284ebd02319584d10ff2

started handling of different feature types

view details

carstendraschner

commit sha 87bf91a48730837d8416474d726dff44edb0b6a0

started soe playground for word2vec in spark

view details

carstendraschner

commit sha 847dfe81e080807f4bab4ff7463dbd28f119e81c

clean up

view details

carstendraschner

commit sha f51913a2cf2222aa3f60b40e9364adc89f54e43b

handling of categorical strings transformed over hashing and IDF (Information Content) weightning

view details

carstendraschner

commit sha 601e664a6d90414bed4d710c3eb128427d2016ae

string column hanfling by default pipleline as current fallback non implemented word2Vec

view details

carstendraschner

commit sha 8b330fad65a7ad4f3019271dc67e7377a5e24ad2

calculate similarity values and join those into one df s.t. we can later aggregate those

view details

push time in a month

issue commentontop/ontop

Problem: SPARQL - Slow Queries when sharing properties with a same name/URI -

@bcogrel interesting approach with those constraints. Similar things do exists in PostgreSQL, but not in this form I guess. Nevertheless, can't we exclude in Ontop already any triple map which maps to entities of an rdf:type that is disjoint with the type in the SPARQL query? At least when we assume logical consistency of the data that should be a valid assumption, shouldn't it?

tulioqrxkyde

comment created time in a month

issue commentontop/ontop

Problem: SPARQL - Slow Queries when sharing properties with a same name/URI -

Thanks, didn't know. But then the question would be, "why not"? Isn't that a very easy and efficient way to skip triple maps?

tulioqrxkyde

comment created time in a month

issue commentontop/ontop

Problem: SPARQL - Slow Queries when sharing properties with a same name/URI -

Did you provide an OWL ontology? And in this ontology there is an axiom that states foaf:Organization disjointWith sefazma:Estabelecimento ? Otherwise the query optimizer cannot simply ignore the other query because of the open world assumption

tulioqrxkyde

comment created time in a month

issue commentdeepset-ai/haystack

error when using other SQL backend than SQLite, e.g. Postgres

Just to give you an idea how it works:

class MetaDocumentORM(ORMBase):
    __tablename__ = "meta_document"
    __table_args__ = (
                ForeignKeyConstraint(
                       ["document_id", "document_index"],
                       ["document.id", "document.index"],
                        ondelete="CASCADE", onupdate="CASCADE"
                   ),
    )

    name = Column(String(100), index=True)
    value = Column(String(1000), index=True)
    document_id = Column(
        String(100),
        nullable=False,
        index=True
    )
    document_index = Column(
        String(100),
        nullable=False,
        index=True
    )

    documents = relationship("DocumentORM", back_populates="meta")

As you can see i) because the primary key is a composite one we ii) have to reference all those columns which iii) in fact means we also have to add another column for the index name then. Analogous for the label vs meta_label tables indeed

LorenzBuehmann

comment created time in a month

issue commentdeepset-ai/haystack

error when using other SQL backend than SQLite, e.g. Postgres

Well, not sure how helpful my analysis is. At least it's strange that PostgreSQL behaves different from SQLite here. Dropping the primary key on index field in DocumentORM avoids the error and leads to the next and same error for label vs meta_label tables.

LorenzBuehmann

comment created time in 2 months

issue openeddeepset-ai/haystack

error when using other SQL backend than SQLite, e.g. Postgres

Hello Haystack devs,

Using Postgres as SQL backend for e.g. the Milvus document store leads to an error because of the primary key definition of the ORM objects. The ORMBase class has id defined as primary key and so do have other classes like DocumentORM an additional primary key field, like index. This makes by definition a combined primary key for the table document. The issue is now that referencing tables like meta_document define via a foreign keys in e.g. MetaDocumentORM just refer to document.id which is only a part of the combined primary key. This also affects other referencing tables like label and meta_label.

I don't know why SQLite doesn't complaint as it should do so, but Postgres does for sure, see

File "/home/user/miniconda3/envs/haystack/lib/python3.7/site-packages/haystack/document_stores/sql.py", line 129, in __init__
    ORMBase.metadata.create_all(engine)
  File "/home/user/miniconda3/envs/haystack/lib/python3.7/site-packages/sqlalchemy/sql/schema.py", line 4780, in create_all
    ddl.SchemaGenerator, self, checkfirst=checkfirst, tables=tables
  File "/home/user/miniconda3/envs/haystack/lib/python3.7/site-packages/sqlalchemy/engine/base.py", line 3107, in _run_ddl_visitor
    conn._run_ddl_visitor(visitorcallable, element, **kwargs)
  File "/home/user/miniconda3/envs/haystack/lib/python3.7/site-packages/sqlalchemy/engine/base.py", line 2110, in _run_ddl_visitor
    visitorcallable(self.dialect, self, **kwargs).traverse_single(element)
  File "/home/user/miniconda3/envs/haystack/lib/python3.7/site-packages/sqlalchemy/sql/visitors.py", line 524, in traverse_single
    return meth(obj, **kw)
  File "/home/user/miniconda3/envs/haystack/lib/python3.7/site-packages/sqlalchemy/sql/ddl.py", line 850, in visit_metadata
    _is_metadata_operation=True,
  File "/home/user/miniconda3/envs/haystack/lib/python3.7/site-packages/sqlalchemy/sql/visitors.py", line 524, in traverse_single
    return meth(obj, **kw)
  File "/home/user/miniconda3/envs/haystack/lib/python3.7/site-packages/sqlalchemy/sql/ddl.py", line 895, in visit_table
    include_foreign_key_constraints,  # noqa
  File "/home/user/miniconda3/envs/haystack/lib/python3.7/site-packages/sqlalchemy/engine/base.py", line 1286, in execute
    return meth(self, multiparams, params, _EMPTY_EXECUTION_OPTS)
  File "/home/user/miniconda3/envs/haystack/lib/python3.7/site-packages/sqlalchemy/sql/ddl.py", line 78, in _execute_on_connection
    self, multiparams, params, execution_options
  File "/home/user/miniconda3/envs/haystack/lib/python3.7/site-packages/sqlalchemy/engine/base.py", line 1384, in _execute_ddl
    compiled,
  File "/home/user/miniconda3/envs/haystack/lib/python3.7/site-packages/sqlalchemy/engine/base.py", line 1843, in _execute_context
    e, statement, parameters, cursor, context
  File "/home/user/miniconda3/envs/haystack/lib/python3.7/site-packages/sqlalchemy/engine/base.py", line 2024, in _handle_dbapi_exception
    sqlalchemy_exception, with_traceback=exc_info[2], from_=e
  File "/home/user/miniconda3/envs/haystack/lib/python3.7/site-packages/sqlalchemy/util/compat.py", line 207, in raise_
    raise exception
  File "/home/user/miniconda3/envs/haystack/lib/python3.7/site-packages/sqlalchemy/engine/base.py", line 1800, in _execute_context
    cursor, statement, parameters, context
  File "/home/user/miniconda3/envs/haystack/lib/python3.7/site-packages/sqlalchemy/engine/default.py", line 717, in do_execute
    cursor.execute(statement, parameters)
sqlalchemy.exc.ProgrammingError: (psycopg2.errors.InvalidForeignKey) there is no unique constraint matching given keys for referenced table "document"

[SQL: 
CREATE TABLE meta_document (
        id VARCHAR(100) NOT NULL, 
        created_at TIMESTAMP WITHOUT TIME ZONE DEFAULT now(), 
        updated_at TIMESTAMP WITHOUT TIME ZONE DEFAULT now(), 
        name VARCHAR(100), 
        value VARCHAR(1000), 
        document_id VARCHAR(100) NOT NULL, 
        PRIMARY KEY (id), 
        FOREIGN KEY(document_id) REFERENCES document (id) ON DELETE CASCADE ON UPDATE CASCADE
)

System:

  • OS:
  • GPU/CPU:
  • Haystack version (commit or version number): latest from today, i.e. commit 99365e1d8e46a57104d787aca27ff42bbc8387fb
  • DocumentStore:
  • Reader:
  • Retriever:

Can you confirm this issue or is there any issue on my end?

created time in 2 months

issue commentdeepset-ai/haystack

REST API fails when embeddings returned flag is enabled in document store

Hi,

yes, forgot to say that I also tried this as a quick fix - but then the JSON encoder/decoder (of I think Pydantic) got into trouble. Looks like a custom

Just to clarify why I enabled the embeddings, I added another pipeline but with generative QA, and that one takes the existing embeddings of the retrieved documents into account, so it's necessary to return those by the document store. Technically it should not be necessary, but for whatever reasoner the generator was not aware of the retriever. Is this not possible via pipeline?

This is the pipeline definition, in particular the one named "generate" fails when the return_embedding is not set to True although a retriever is declared:

version: '0.9'

components:
  - name: DocumentStore
    type: MilvusDocumentStore
    params:
      milvus_url: 
      sql_url: 
      index: 
  - name: Retriever
    type: DensePassageRetriever
    params:
      document_store: DocumentStore 
      top_k: 5
  - name: Reader  
    type: FARMReader
    params:
      model_name_or_path: deepset/roberta-base-squad2
  - name: Generator
    type: RAGenerator
    params:
      model_name_or_path: facebook/rag-sequence-nq
      top_k: 1
pipelines:
  - name: query    # extractive QA Pipeline
    type: Query
    nodes:
      - name: Retriever
        inputs: [Query]
      - name: Reader
        inputs: [Retriever]
  - name: generate    # generative QA Pipeline
    type: Query
    nodes:
      - name: Retriever
        inputs: [Query]
      - name: Generator
        inputs: [Retriever]

it fails at https://github.com/deepset-ai/haystack/blob/master/haystack/nodes/answer_generator/transformers.py#L178-L194 with

AttributeError("_prepare_passage_embeddings need a DPR instance as self.retriever to embed document")
LorenzBuehmann

comment created time in 2 months

issue openeddeepset-ai/haystack

REST API fails when embeddings returned flag is enabled in document store

Good morning Haystack devs. Not sure if this is really a bug or I'm doing something wrong, so feel free to change the type if this issue to something else if it's not a bug.

Describe the bug When the document store is forced to return the embeddings the REST API request fails because the type of the embedding is a Numpy array (dtype=float32) but the Pydantic dataclass of DocumentSerialized expects a List[float] and I think there is no automatic conversion between both types.

Error message

[2021-11-26 08:48:59 +0000] [10] [ERROR] Exception in ASGI application
haystack-api_1  | Traceback (most recent call last):
haystack-api_1  |   File "/usr/local/lib/python3.7/dist-packages/uvicorn/protocols/http/httptools_impl.py", line 375, in run_asgi
haystack-api_1  |     result = await app(self.scope, self.receive, self.send)
haystack-api_1  |   File "/usr/local/lib/python3.7/dist-packages/uvicorn/middleware/proxy_headers.py", line 75, in __call__
haystack-api_1  |     return await self.app(scope, receive, send)
haystack-api_1  |   File "/usr/local/lib/python3.7/dist-packages/fastapi/applications.py", line 208, in __call__
haystack-api_1  |     await super().__call__(scope, receive, send)
haystack-api_1  |   File "/usr/local/lib/python3.7/dist-packages/starlette/applications.py", line 112, in __call__
haystack-api_1  |     await self.middleware_stack(scope, receive, send)
haystack-api_1  |   File "/usr/local/lib/python3.7/dist-packages/starlette/middleware/errors.py", line 181, in __call__
haystack-api_1  |     raise exc
haystack-api_1  |   File "/usr/local/lib/python3.7/dist-packages/starlette/middleware/errors.py", line 159, in __call__
haystack-api_1  |     await self.app(scope, receive, _send)
haystack-api_1  |   File "/usr/local/lib/python3.7/dist-packages/starlette/middleware/cors.py", line 84, in __call__
haystack-api_1  |     await self.app(scope, receive, send)
haystack-api_1  |   File "/usr/local/lib/python3.7/dist-packages/starlette/exceptions.py", line 82, in __call__
haystack-api_1  |     raise exc
haystack-api_1  |   File "/usr/local/lib/python3.7/dist-packages/starlette/exceptions.py", line 71, in __call__
haystack-api_1  |     await self.app(scope, receive, sender)
haystack-api_1  |   File "/usr/local/lib/python3.7/dist-packages/starlette/routing.py", line 656, in __call__
haystack-api_1  |     await route.handle(scope, receive, send)
haystack-api_1  |   File "/usr/local/lib/python3.7/dist-packages/starlette/routing.py", line 259, in handle
haystack-api_1  |     await self.app(scope, receive, send)
haystack-api_1  |   File "/usr/local/lib/python3.7/dist-packages/starlette/routing.py", line 61, in app
haystack-api_1  |     response = await func(request)
haystack-api_1  |   File "/usr/local/lib/python3.7/dist-packages/fastapi/routing.py", line 243, in app
haystack-api_1  |     is_coroutine=is_coroutine,
haystack-api_1  |   File "/usr/local/lib/python3.7/dist-packages/fastapi/routing.py", line 137, in serialize_response
haystack-api_1  |     raise ValidationError(errors, field.type_)
haystack-api_1  | pydantic.error_wrappers.ValidationError: 5 validation errors for QueryResponse
haystack-api_1  | response -> documents -> 0 -> embedding
haystack-api_1  |   value is not a valid list (type=type_error.list)
haystack-api_1  | response -> documents -> 1 -> embedding
haystack-api_1  |   value is not a valid list (type=type_error.list)
haystack-api_1  | response -> documents -> 2 -> embedding
haystack-api_1  |   value is not a valid list (type=type_error.list)
haystack-api_1  | response -> documents -> 3 -> embedding
haystack-api_1  |   value is not a valid list (type=type_error.list)
haystack-api_1  | response -> documents -> 4 -> embedding
haystack-api_1  |   value is not a valid list (type=type_error.list)

To Reproduce Just start the REST API with return_embedding: True set in the document store

System:

  • Haystack version (commit or version number): 1.0RC1 (latest version from Github)
  • DocumentStore: Milvus
  • Reader: Farm
  • Retriever: DPR

created time in 2 months

issue closedSANSA-Stack/SANSA-Stack

Create a jar including all dependencies in a submodule of sansa stack

Hello all, I would like to know if it is possible to create a jar including all the dependencies for example in the submodule sansa-example spark ? Because i tried by adding these lines in the pom file in this submodule:

 <build>
    <plugins>
      <!-- any other plugins -->
      <plugin>
        <artifactId>maven-assembly-plugin</artifactId>
        <executions>
          <execution>
            <phase>package</phase>
            <goals>
              <goal>single</goal>
            </goals>
          </execution>
        </executions>
        <configuration>
          <descriptorRefs>
            <descriptorRef>jar-with-dependencies</descriptorRef>
          </descriptorRefs>
        </configuration>
      </plugin>
    </plugins>
  </build>

Then run this command in this submodule: mvn clean package

But no jar with the suffix "jar-with-dependencies" is created in the target file does anybody have an idea?

closed time in 2 months

huangmatthieu

push eventSANSA-Stack/SANSA-Stack

Lorenz Buehmann

commit sha 34a6231a7ccf799d87b0f1f64cfcf7ed9b987b3b

minor

view details

Lorenz Buehmann

commit sha 70f485a66fa0fd41d44a121816791f4a758a6724

Merge branch 'develop' of github.com:SANSA-Stack/SANSA-Stack into develop

view details

Lorenz Buehmann

commit sha d707afa7241494cec25381628d39e2aa5f643231

Merge branch 'develop' of github.com:SANSA-Stack/SANSA-Stack into develop

view details

Lorenz Buehmann

commit sha 056bb57fe52cc5f5e0fe564a09a972732e47055d

minor

view details

Lorenz Buehmann

commit sha 610a8a5c5f9ceb19be4c0fdc8ddda93009f14515

parse any RDF language

view details

Lorenz Buehmann

commit sha 7d97df8ff09bfd13fe979a21cb694b51cd501ab8

Merge branch 'develop' of github.com:SANSA-Stack/SANSA-Stack into develop

view details

Lorenz Buehmann

commit sha c60b8be5477f6cafefde35f2fe2e58b7ba916b07

fix test

view details

Lorenz Buehmann

commit sha 474f2bb6f3229388ae56dc0ee27f8e802d05f882

Merge branch 'jena4' into develop

view details

Lorenz Buehmann

commit sha 5f520d41e1617784f12c816f6394769d658f40f3

Merge branch 'develop' of github.com:SANSA-Stack/SANSA-Stack into develop

view details

push time in 2 months

more