Commit Graph

122 Commits

Author SHA1 Message Date
JieguangZhou
11a354927d
[fix] [MLflow Task plugin] Sometimes errors are reported using preset algorithms (#10986) (#10987)
* Optimize the download method of preset project

* fix checkstyle
2022-07-15 15:38:13 +08:00
zhuxt2015
7196b125d9
[Feature][Datax Task]Datax task plugin support custom parameters (#10950)
* datax task plugin support custom parameters
2022-07-15 09:27:23 +08:00
Eric Gao
a38fa34d43
[Feature][Task Plugin] Enable users to switch endpoints in zeppelin tasks (#10925)
* [Feature][Task Plugin] Enable users to switch endpoints in zeppelin tasks (#9814)
2022-07-15 09:26:28 +08:00
chuxing
427c58546c
[fix][plugin]: fix spark sql use custom parma. (#10821)
* [fix][plugin]: fix spark sql use custom parma.

* [fix][plugin] roll back part of the code

Co-authored-by: huaangcg <huangcg@getui.com>
2022-07-13 11:44:13 +08:00
JieguangZhou
02b2de2a1e
[fix] mlflow project success status (#10897)
close: #10902
2022-07-12 20:58:53 +08:00
Tq
499e5b1307
[bug][task-plugins]fix switch class name and add param check to the switch task plugin (#10894)
* fix switch class name and add param check to the switch task plugin
2022-07-12 16:41:15 +08:00
ZhaoGuodong
8eaf5a2309
[Feature-10219][EMR] EMR supports use <add-Steps> to add steps to an existing cluster (#10657)
* Add the ProgramType parameter to distinguish task types
* EmrAddStepsTask supports Add-Steps
* UI supports Add-Steps
* EmrTask modify the name of the class to EmrJobFlowTask
* add ERM Task abstract base class AbstractEmrTask
* add testcase for EmrAddStepsTask
* Modifying help Documents
2022-07-10 10:15:25 +08:00
Eric Gao
71f0168510
[Feature] Enable users to create python env from requirements.txt (#10658) 2022-07-06 15:56:39 +08:00
Kerwin
bd1be6cf36
[Task] Added Apache SeaTunnel 2.x task node (#10480) 2022-07-05 19:21:30 +08:00
JieguangZhou
7d79a2165e
[Optimization]Optimize some details of MLFlow task plugin #10740 (#10739)
* Optimize some details of MLFlow task plugin

* Update dolphinscheduler-task-plugin/dolphinscheduler-task-mlflow/src/main/java/org/apache/dolphinscheduler/plugin/task/mlflow/MlflowParameters.java

* fix some nips

Co-authored-by: Jiajie Zhong <zhongjiajie955@gmail.com>
2022-07-05 11:05:20 +08:00
aiwenmo
5b0347e8da
[Task] Add Dinky task to better support the development and execution of FlinkSQL (#10640) 2022-07-05 10:07:24 +08:00
Paul Zhang
4a3c3e7797
[feature][task-flink] Support Flink application mode (#9577) 2022-07-01 12:20:38 +08:00
WangJPLeo
8f621ff98b
[Optimization] Calculate global parameter and local parameter at master. (#10704)
* Global parameter and local parameter calculation external expansion.

* k8s task ut fix.

* TimePlaceholderUtils import DateUtils fix

* follow the review comments to fix.

* follow the review comments to fix.

* e2e rerun
2022-06-30 22:45:25 +08:00
zhuxt2015
7e7df1d8ff
[Bug] [Task] SQL task no exception stack trace in task log (#10656) 2022-06-29 09:57:05 +08:00
Wenjun Ruan
66624c5c86
[Bug] [Master] Worker failover will cause task cannot be failover (#10631)
* fix worker failover may lose event
2022-06-28 16:08:35 +08:00
Wenjun Ruan
db595b3eff
Optimize master log, use MDC to inject workflow instance id and task instance id in log (#10516)
* Optimize master log, add workflow instance id and task instance id in log

* Use MDC to set the workflow info in log4j

* Add workflowInstanceId and taskInstanceId in MDC
2022-06-23 11:45:06 +08:00
JieguangZhou
36e20cdfc8
[Feature][Task Plugin] Add DVC task plugin for MLops scenario (#10372) (#10407) 2022-06-21 13:13:46 +08:00
Eric Gao
64cee03fed
[Feature] Enable users to switch and install conda env in jupyter task (#10337) 2022-06-20 14:47:44 +08:00
Eric Gao
4be3b877c3
[feature][task] Enable zeppelin schedule a whole zeppelin note (#10434) 2022-06-18 13:34:24 +08:00
guodong
2c227ab384
[Feature-10363][server] S3 Resource center supports bucket customization (#10433)
* [Feature-10363][server] S3 Resource center supports bucket customization

* Modify the configuration. Sort out HDFS and S3 configurations to distinguish them
* Modify the configuration comment to clarify the configuration meaning
* Modify official documents to add missing configuration descriptions and clarify configuration meanings and usage
* Modify the configuration file in e2E to the correct definition, dolphinscheduler-e2e/dolphinscheduler-e2e-case/src/test/resources/docker/file-manage/common.properties
* Modified code to support bucket customization
* Check that the bucket on personal permission, Otherwise, the startup fails
* S3 increase support resource. Storage. Upload. Base. The base path path
* Change S3Utils implementation style to be consistent with HadoopUtils

* [Feature-10363][server] S3 Resource center supports bucket customization

* Modify code review comments
2022-06-18 10:20:01 +08:00
Wenjun Ruan
ad2646ff1f
Fix TaskProcessorFactory#getTaskProcessor get common processor is not thread safe (#10479)
* Fix TaskProcessorFactory#getTaskProcessor get common processor is not thread safe
2022-06-16 21:46:18 +08:00
JieguangZhou
3258438f6e
[Feature] [MLOps] support mlflow deploy with docker compose (#10217)
* [Feature] [MLOps] support mlflow deploy with docker compose

fix doc

Update docs/docs/en/guide/task/mlflow.md

fix doc

Co-authored-by: Jiajie Zhong <zhongjiajie955@gmail.com>

revert cancel modification

fix ENV name and docker compose command

* fix doc image link

* fix testModelsDeployDockerCompose

* add docker compose container health check and fix mlflow bug

* update docker compose healthcheck timeout
2022-06-10 10:20:50 +08:00
xiangzihao
bbba20d39a
change to common lang3 and fix checkstyle (#10380) 2022-06-09 15:56:42 +08:00
xiangzihao
b1fb17119b
[Feature-6758][Task] Add limit resource usage for tasks base on cgroup (#10373) 2022-06-08 19:42:05 +08:00
He Zhao
da3c25dc67
[improve] Using create or replace function in sql task (#10170) 2022-06-06 10:33:24 +08:00
Wenjun Ruan
0bf4043d5f
Move datasource params to datasouce plugin module (#10353)
* Move datasource params to datasouce plugin module
2022-06-03 12:15:53 +08:00
kezhenxu94
d80cf21456
Clean up unused dependencies and packaging issues (#9944) 2022-05-31 15:22:41 +08:00
Eric Gao
a8a5da367a
[Feature][Task Plugin] Add support for dynamic form for zeppelin task plugin (#9977) (#10269) 2022-05-31 11:19:41 +08:00
HuangWei
c5961aed25
[feat][task] Add OpenMLDB task plugin (#10198) 2022-05-31 11:00:21 +08:00
JieguangZhou
54ab2651bd
[Feature][MLops] Add MLflow Projects (custom project) component (#10175)
* Add MLflow Projects (custom project) component

* fix something

* update document

* update document

* Supplementary test cases
2022-05-24 09:42:46 +08:00
JieguangZhou
e11373b963
[Feature][MLops] Support MLflow Models to deploy model service (MLflow models serve and Docker) (#10150)
* Add backend of MLflow Moldes component

* Add front end of MLflow Moldes component

* [DOC] update mlflow document

* revert unnecessary

* fix doc image url

* Update docs/docs/en/guide/task/mlflow.md

Discard the lr abbreviation

Co-authored-by: Jiajie Zhong <zhongjiajie955@gmail.com>

Co-authored-by: Jiajie Zhong <zhongjiajie955@gmail.com>
2022-05-20 19:41:22 +08:00
JieguangZhou
9ab3bebc90
[Feature][Task Plugin] Add mlflow task plugin for MLops scenario. (#10036)
* [Feature][Task Plugin] Add mlflow task plugin (#9725)

* [Feature][Task Plugin] Add mlflow task plugin UI (#9725)

* [Feature][Task Plugin] fix license header (#9725)

* [Feature][Task Plugin] fix license header (#9725)

* [Feature][Task Plugin] revert unnecessary

* [Feature][Task Plugin] add auto ml to mlflow task plugin

* [DOC] add mlflow document

* [DOC] fix mlflow docs imgs

* [DOC] fix dead link localhost:5000

* [DOC] fix dead link localhost:5000

* [DOC] remove dead link localhost:5000

* Update docs/docs/en/guide/task/mlflow.md

* Update docs/docs/zh/guide/task/mlflow.md

* [DOC] format ui code

* [DOC] remove dead link localhost:5000

* [Feature][Task Plugin] revert unnecessary

* fix some nits

* Move the personal repository to the public repository.

* Run the command directly instead of saving the file

* fix paramsMap initialize

* revert unnecessary

Co-authored-by: Jiajie Zhong <zhongjiajie955@gmail.com>
2022-05-19 09:40:31 +08:00
He Zhao
5bb1eb04fc
[Feature][plugin] Add k8s task in task plugin (#9425)
* [Feature][plugin] Add k8s task in task plugin

* [Feature][plugin] fix dos and code problems

* [Feature][plugin] refactor some code based on sonar

* [Feature][UI] front-end for k8s task plugin

* [Feature][plugin] delete some front files

* [Feature][plugin] update document

* Update docs/docs/zh/guide/task/k8s.md

* Update docs/docs/en/guide/task/k8s.md

* Update docs/docs/en/guide/task/k8s.md

* Update docs/docs/en/guide/task/k8s.md

* [Feature][UI] front-end change from review comments

* [Feature][UI] replace get namespace list api

* [Feature][plugin] change file name

* Add kubernetes to zh task list

Co-authored-by: hezhao2 <hezhao2@cisco.com>
Co-authored-by: William Tong <weitong@cisco.com>
Co-authored-by: Jiajie Zhong <zhongjiajie955@gmail.com>
2022-05-18 17:23:42 +08:00
Dannila
d643e1c1cf
[Fix-10039] Flink run command when perfecting Python jobs (#10042)
* [fix] flink task

* [fix] flink task
2022-05-16 15:06:21 +08:00
zixi0825
359cbe2a1e
[dataquality] Fix task commnd null bug (#9974) 2022-05-15 17:46:31 +08:00
chuxing
df04c4aef0
[fix-9991][worker]fix statement is closed before resultSet.getMetaData() (#10014) 2022-05-15 10:24:31 +08:00
Eric Gao
80369363c9
[feat][task plugin] Add Jupyter task plugin (#9872)
Co-authored-by: Amy0104 <97265214+Amy0104@users.noreply.github.com>
2022-05-12 19:55:49 +08:00
Dannila
2d36449444
[Feature- 9837][plugin/ui] support FlinkSQL Task (#9840)
* flink_sql

* '[refactor]flinksql'

* '[refactor]flinksql'

* '[refactor]flinksql'

* '[refactor]flinksql'

* '[refactor]flinksql'

* '[refactor]flinksql'

* '[refactor]flinksql'

* '[refactor]flinksql'

* '[refactor]flinksql'

* '[refactor]flinksql'

* '[refactor]flinksql'

* '[refactor]flinksql'

* [refactor] flinksql

* '[refactor]flinksql'

* [refactor] dolphinscheduler flinksql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* [refactor] flink sql

* Update docs/docs/en/guide/task/flink.md

* Update docs/docs/zh/guide/task/flink.md

* prettier front-end code

Co-authored-by: Jiajie Zhong <zhongjiajie955@gmail.com>
Co-authored-by: devosend <devosend@gmail.com>
2022-05-09 18:51:55 +08:00
Kerwin
0e7c98d465
[Feature-9177][Task] The sql task supports configuring segmentation notation to provide execution of multiple statements (#9917)
* Support sql segmentation to execute multiple functions.
2022-05-08 15:04:35 +08:00
sq-q
ed425d2d2d
[Feature-9772][plugin/ui] support SparkSQL Task (#9790)
* [refactor] Dolphinscheduler sparkSQL

* [refactor] Dolphinscheduler SparkSQL

* [refactor] Dolphinscheduler SparkSQL

* [refactor] Dolphinscheduler SparkSQL

* [refactor] Dolphinscheduler SparkSQL

* [refactor] Dolphinscheduler SparkSQL

* [refactor] Dolphinscheduler SparkSQL

* [refactor] Refactor ui code and add sparksql test cases

* [refactor] refactor dolphinscheduler SparkSQL

* [refactor] refactor dolphinscheduler plugin-sparkSQL

* [refactor] refactor dolphinscheduler plugin-SparkSQL

* [refactor] dolphinscheduler plugin-SparkTaskTest

* [refactor] dolphinscheduler plugin-SparkTask

* [refactor] dolphinscheduler plugin-Spark

* [refactor] dolphinscheduler plugin-SparkTask-SparkSQL

* [refactor] dolphinscheduler plugin-spark-SparkTask

* [refactor] dolphinscheduler plugin-spark-SparkTask redefine code
2022-04-27 17:44:05 +08:00
zhuxt2015
206b7c1c51
[Fix-9352][datasource-api]data source client uses a cache with an expiration time (#9353)
* datasource client cache with expiration time

* remove unused import

* fixed unit test

* fix

* fix

* fix

* fix

* fix

* fix

* fix
2022-04-27 16:56:24 +08:00
exmy
36f01155b5
[Improvement][server] varPool support syntax #{setValue(key=value)} (#9586) 2022-04-22 14:10:21 +08:00
Tq
c5b7e5adff
[Bug] [API-9558]fix homepage task instance count method to use submit time to recount (#9559)
* fix homepage task instance count method to use submit time to recount

* fix homepage task instance count method to use submit time to recount

* fix homepage task instance count method to use submit time to recount

* fix homepage task instance count method JUNIT

* fix homepage task instance count method JUNIT

* fix homepage task instance count method JUNIT
2022-04-19 15:23:57 +08:00
Eric Gao
a5bbf7852d
[Feature][Task-Plugin]Add zeppelin task-plugin to support Apache Zeppelin (#9327) 2022-04-17 22:10:10 +08:00
xiangzihao
7f41a96fc1
[Fix-9525] [Worker] Environment did not work as expected (#9527)
* fix #9525

* change to ${PYTHON_HOME}

* remove import

* fix ut error
2022-04-16 18:57:33 +08:00
Tq
04d4e4e0c4
fix the OUT direct param could pass to the current script (#9463) 2022-04-13 10:43:12 +08:00
caishunfeng
b285ccf930
[Future-9396]Support output parameters transfer from parent workflow to child work flow (#9410)
* [Future-9396]Support output parameters transfer from parent workflow to child work flow

* fix note
2022-04-11 20:03:16 +08:00
Kerwin
923f3f38e3
[Fix-9316] [Task] Configure DB2 data source SQL script execution report ResultSet has been closed exception in SQL task (#9317)
* fix db2 error in the sql task

* update limit in sql task
2022-04-11 13:19:48 +08:00
nobolity
037692517a
[Fix-9251] [WORKER] reslove the sql task about of add the udf resource failed (#9319)
* feat(resource  manager): extend s3 to the storage of ds

1.fix some spell question
2.extend the type of storage
3.add the s3utils
to manager resource
4.automatic inject the storage in addition to your
config

* fix(resource  manager): update the dependency

* fix(resource  manager): extend s3 to the storage of ds

fix the constant of hadooputils

* fix(resource  manager): extend s3 to the storage of ds

1.fix some spell question
2.delete the import *

* fix(resource  manager):

merge  the unitTest:
1.TenantServiceImpl
2.ResourceServiceImpl
3.UserServiceImpl

* fix(resource  manager): extend s3 to the storage of ds

merge the resourceServiceTest

* fix(resource  manager): test  cancel the test method

createTenant verifyTenant

* fix(resource  manager): merge the code  follow the check-result of sonar

* fix(resource  manager): extend s3 to the storage of ds

fit the spell question

* fix(resource  manager): extend s3 to the storage of ds

revert the common.properties

* fix(resource  manager): extend s3 to the storage of ds

update the storageConfig with None

* fix(resource  manager): extend s3 to the storage of ds

fix the judge of resourceType

* fix(resource  manager): extend s3 to the storage of ds

undo the compile-mysql

* fix(resource  manager): extend s3 to the storage of ds

delete hadoop aws

* fix(resource  manager): extend s3 to the storage of ds

update the know-dependencies to delete aws 1.7.4
update the e2e
file-manager common.properties

* fix(resource  manager): extend s3 to the storage of ds

update the aws-region

* fix(resource  manager): extend s3 to the storage of ds

fix the storageconfig init

* fix(resource  manager): update e2e docker-compose

update e2e docker-compose

* fix(resource  manager): extend s3 to the storage of ds

revent the e2e common.proprites

print the resource type in propertyUtil

* fix(resource  manager): extend s3 to the storage of ds
1.println the properties

* fix(resource  manager): println the s3 info

* fix(resource  manager): extend s3 to the storage of ds

delete the info  and upgrade the s3 info to e2e

* fix(resource  manager): extend s3 to the storage of ds

add the bucket init

* fix(resource  manager): extend s3 to the storage of ds

1.fix some spell question
2.delete the import *

* fix(resource  manager): extend s3 to the storage of ds

upgrade the s3 endpoint

* fix(resource  manager): withPathStyleAccessEnabled(true)

* fix(resource  manager): extend s3 to the storage of ds

1.fix some spell question
2.delete the import *

* fix(resource  manager): upgrade the  s3client builder

* fix(resource  manager): correct  the s3 point to s3client

* fix(resource  manager): update the constant BUCKET_NAME

* fix(resource  manager): e2e  s3 endpoint -> s3:9000

* fix(resource  manager): extend s3 to the storage of ds

1.fix some spell question
2.delete the import *

* style(resource  manager): add info to createBucket

* style(resource  manager): debug the log

* ci(resource  manager): test

test s3

* ci(ci): add INSERT INTO dolphinscheduler.t_ds_tenant (id, tenant_code, description, queue_id, create_time, update_time) VALUES(1, 'root', NULL, 1, NULL, NULL); to h2.sql

* fix(resource  manager): update the h2 sql

* fix(resource  manager): solve to delete the tenant

* style(resource  manager): merge the style end delete the unuse s3 config

* fix(resource  manager): extend s3 to the storage of ds

UPDATE the rename resources when s3

* fix(resource  manager): extend s3 to the storage of ds

1.fix the code style of QuartzImpl

* fix(resource  manager): extend s3 to the storage of ds

1.impoort restore_type to CommonUtils

* fix(resource  manager): update the work thread

* fix(resource  manager): update  the baseTaskProcessor

* fix(resource  manager): upgrade dolphinscheduler-standalone-server.xml

* fix(resource  manager): add  user Info to dolphinscheduler_h2.sql

* fix(resource  manager): merge  the resourceType to NONE

* style(upgrade the log level to info):

* fix(resource  manager): sysnc the h2.sql

* fix(resource  manager): update the merge the user tenant

* fix(resource  manager): merge the resourcesServiceImpl

* fix(resource  manager):

when the storage is s3 ,that the directory can't be renamed

* fix(resource  manager): in s3 ,the directory cannot be renamed

* fix(resource  manager): delete the deleteRenameDirectory in E2E

* fix(resource  manager): check the style and  recoverd the test

* fix(resource  manager): delete the log.print(LoginUser)

* fix(server): fix the  udf serialize

* fix(master  task): update the udfTest to update the json string

* fix(test): update the udfFuncTest

* fix(common): syn the common.properties

* fix(udfTest): upgrade the udfTest

* fix(common): revent the common.properties
2022-04-11 10:49:46 +08:00
Tq
2a4fa9cdb1
[BUG][WORKER-9349]fix param priority (#9379)
* fix param priority

* fix params priority code logic
2022-04-07 20:09:53 +08:00