-
Notifications
You must be signed in to change notification settings - Fork 3.1k
Failed to load ingestion sources #13181
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Hi. similar problem. Logs pod datahub-gms
|
Hi, Me too I'm using datahub 1.0.0 ( es_prefix = datahub_ ) In the datahub slack channel, i asked RunLLM.
here is my error log
|
Hello! I figured it out what causes this issue. If you remove "bootstrap-ingestion-datahub-gc" ingestion job from your IngestionSource index from elasticsearch. it will work fine. (This is not the proper solution BTW) |
@hyungryuk you're right. If remove curl -XGET "http://localhost:9200/datahubingestionsourceindex_v2_1715768050160/_doc/_search?pretty"
curl -X DELETE "http://localhost:9200/datahubingestionsourceindex_v2_1715768050160/_doc/urn%253Ali%253AdataHubIngestionSource%253Adatahub-gc?pretty" |
Describe the bug
After updating to the recent 1.0.0 version, ingestion sources do not work anymore.
To Reproduce
Go to "Admin / Data Sources" and recognize a red error message at the top and an empty list of sources.
Expected behavior
Both existing data sources and newly added ones should appear.
Additional context
The following errors show in container logs:
2025-04-11 06:58:26,277 [ForkJoinPool.commonPool-worker-1779] ERROR c.l.m.s.e.query.ESSearchDAO:165 - Search query failed java.lang.NullPointerException: null 2025-04-11 06:58:26,277 [ForkJoinPool.commonPool-worker-1779] ERROR c.l.d.g.e.DataHubDataFetcherExceptionHandler:45 - Failed to execute java.util.concurrent.CompletionException: java.lang.RuntimeException: Failed to list ingestion sources at java.base/java.util.concurrent.CompletableFuture.encodeThrowable(CompletableFuture.java:315) at java.base/java.util.concurrent.CompletableFuture.completeThrowable(CompletableFuture.java:320) at java.base/java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1770) at java.base/java.util.concurrent.CompletableFuture$AsyncSupply.exec(CompletableFuture.java:1760) at java.base/java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:373) at java.base/java.util.concurrent.ForkJoinPool$WorkQueue.topLevelExec(ForkJoinPool.java:1182) at java.base/java.util.concurrent.ForkJoinPool.scan(ForkJoinPool.java:1655) at java.base/java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1622) at java.base/java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:165) Caused by: java.lang.RuntimeException: Failed to list ingestion sources at com.linkedin.datahub.graphql.resolvers.ingest.source.ListIngestionSourcesResolver.lambda$get$1(ListIngestionSourcesResolver.java:119) at java.base/java.util.concurrent.CompletableFuture$AsyncSupply.run(CompletableFuture.java:1768) ... 6 common frames omitted Caused by: com.datahub.util.exception.ESQueryException: Search query failed: at com.linkedin.metadata.search.elasticsearch.query.ESSearchDAO.lambda$executeAndExtract$1(ESSearchDAO.java:166) at io.datahubproject.metadata.context.TraceContext.withSpan(TraceContext.java:110) at io.datahubproject.metadata.context.OperationContext.withSpan(OperationContext.java:391) at com.linkedin.metadata.search.elasticsearch.query.ESSearchDAO.executeAndExtract(ESSearchDAO.java:147) at com.linkedin.metadata.search.elasticsearch.query.ESSearchDAO.search(ESSearchDAO.java:338) at com.linkedin.metadata.search.elasticsearch.ElasticSearchService.search(ElasticSearchService.java:173) at com.linkedin.metadata.search.elasticsearch.ElasticSearchService.search(ElasticSearchService.java:155) at com.linkedin.metadata.client.JavaEntityClient.search(JavaEntityClient.java:458) at com.linkedin.datahub.graphql.resolvers.ingest.source.ListIngestionSourcesResolver.lambda$get$1(ListIngestionSourcesResolver.java:83) ... 7 common frames omitted Caused by: java.lang.NullPointerException: null 2025-04-11 06:58:26,278 [ForkJoinPool.commonPool-worker-1773] ERROR c.datahub.graphql.GraphQLController:153 - Errors while executing query: query listIngestionSources($input: ListIngestionSourcesInput!) { listIngestionSources(input: $input) { start count total ingestionSources { urn name type config { recipe version executorId debugMode extraArgs { key value __typename } __typename } schedule { interval timezone __typename } platform { urn __typename ..., result: {errors=[{message=An unknown error occurred., locations=[{line=2, column=3}], path=[listIngestionSources], extensions={code=500, type=SERVER_ERROR, classification=DataFetchingException}}], data={listIngestionSources=null}, extensions={tracing={version=1, startTime=2025-04-11T06:58:26.265025322Z, endTime=2025-04-11T06:58:26.278222119Z, duration=13199038, parsing={startOffset=622316, duration=594755}, validation={startOffset=1183528, duration=514351}, execution={resolvers=[{path=[listIngestionSources], parentType=Query, returnType=ListIngestionSourcesResult, fieldName=listIngestionSources, startOffset=1879409, duration=10967547}]}}}}, errors: [DataHubGraphQLError{path=[listIngestionSources], code=SERVER_ERROR, locations=[SourceLocation{line=2, column=3}]}]
The text was updated successfully, but these errors were encountered: