Class/Object

com.krux.hyperion.activity

SparkTaskActivity

Related Docs: object SparkTaskActivity | package activity

Permalink

case class SparkTaskActivity extends EmrTaskActivity[SparkCluster] with Product with Serializable

Runs a Spark job on a cluster. The cluster can be an EMR cluster managed by AWS Data Pipeline or another resource if you use TaskRunner. Use SparkActivity when you want to run work in parallel. This allows you to use the scheduling resources of the YARN framework or the MapReduce resource negotiator in Hadoop 1. If you would like to run work sequentially using the Amazon EMR Step action, you can still use SparkActivity.

Source
SparkTaskActivity.scala
Linear Supertypes
Serializable, Serializable, Product, Equals, EmrTaskActivity[SparkCluster], EmrActivity[SparkCluster], PipelineActivity[SparkCluster], NamedPipelineObject, PipelineObject, Ordered[PipelineObject], Comparable[PipelineObject], AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. SparkTaskActivity
  2. Serializable
  3. Serializable
  4. Product
  5. Equals
  6. EmrTaskActivity
  7. EmrActivity
  8. PipelineActivity
  9. NamedPipelineObject
  10. PipelineObject
  11. Ordered
  12. Comparable
  13. AnyRef
  14. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. def <(that: PipelineObject): Boolean

    Permalink
    Definition Classes
    Ordered
  4. def <=(that: PipelineObject): Boolean

    Permalink
    Definition Classes
    Ordered
  5. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  6. def >(that: PipelineObject): Boolean

    Permalink
    Definition Classes
    Ordered
  7. def >=(that: PipelineObject): Boolean

    Permalink
    Definition Classes
    Ordered
  8. val activityFields: ActivityFields[SparkCluster]

    Permalink
    Definition Classes
    SparkTaskActivityPipelineActivity
  9. val arguments: Seq[HString]

    Permalink
  10. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  11. def attemptTimeout: Option[HDuration]

    Permalink
    Definition Classes
    PipelineActivity
  12. val baseFields: BaseFields

    Permalink
    Definition Classes
    SparkTaskActivityNamedPipelineObject
  13. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  14. def compare(that: PipelineObject): Int

    Permalink
    Definition Classes
    PipelineObject → Ordered
  15. def compareTo(that: PipelineObject): Int

    Permalink
    Definition Classes
    Ordered → Comparable
  16. def dependsOn: Seq[PipelineActivity[_]]

    Permalink
    Definition Classes
    PipelineActivity
  17. val emrTaskActivityFields: EmrTaskActivityFields

    Permalink
    Definition Classes
    SparkTaskActivityEmrTaskActivity
  18. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  19. def failureAndRerunMode: Option[FailureAndRerunMode]

    Permalink
    Definition Classes
    PipelineActivity
  20. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  21. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  22. def groupedBy(group: String): Self

    Permalink

    Postfix the name field

    Postfix the name field

    Definition Classes
    NamedPipelineObject
  23. val hadoopQueue: Option[HString]

    Permalink
  24. def id: PipelineObjectId

    Permalink
    Definition Classes
    NamedPipelineObjectPipelineObject
  25. def idGroupedBy(group: String): Self

    Permalink

    Have a grouping postfix in the id field

    Have a grouping postfix in the id field

    Definition Classes
    NamedPipelineObject
    Note

    Id naming is more restrictive, it is recommended to not changing the id unleass you have a good reason

  26. def idNamed(namePrefix: String): Self

    Permalink

    Id field will be prefixed with name

    Id field will be prefixed with name

    Definition Classes
    NamedPipelineObject
    Note

    Id naming is more restrictive, it is recommended to not changing the id unless you have a good reason

  27. val inputs: Seq[S3DataNode]

    Permalink
  28. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  29. val jarUri: HString

    Permalink
  30. val jobRunner: HString

    Permalink
  31. def lateAfterTimeout: Option[HDuration]

    Permalink
    Definition Classes
    PipelineActivity
  32. val mainClass: MainClass

    Permalink
  33. def maxActiveInstances: Option[HInt]

    Permalink
    Definition Classes
    PipelineActivity
  34. def maximumRetries: Option[HInt]

    Permalink
    Definition Classes
    PipelineActivity
  35. def name: Option[String]

    Permalink

    Name of the pipeline object, if not set, it will defaults to

    Name of the pipeline object, if not set, it will defaults to

    Option(id)
    Definition Classes
    NamedPipelineObject
  36. def named(namePrefix: String): Self

    Permalink

    Give the object a name prefix

    Give the object a name prefix

    Definition Classes
    NamedPipelineObject
  37. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  38. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  39. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  40. def objects: Iterable[PipelineObject]

    Permalink
  41. def onFail(alarms: SnsAlarm*): Self

    Permalink
    Definition Classes
    PipelineActivity
  42. def onFailAlarms: Seq[SnsAlarm]

    Permalink
    Definition Classes
    PipelineActivity
  43. def onLateAction(alarms: SnsAlarm*): Self

    Permalink
    Definition Classes
    PipelineActivity
  44. def onLateActionAlarms: Seq[SnsAlarm]

    Permalink
    Definition Classes
    PipelineActivity
  45. def onSuccess(alarms: SnsAlarm*): Self

    Permalink
    Definition Classes
    PipelineActivity
  46. def onSuccessAlarms: Seq[SnsAlarm]

    Permalink
    Definition Classes
    PipelineActivity
  47. val outputs: Seq[S3DataNode]

    Permalink
  48. def postActivityTaskConfig: Option[ShellScriptConfig]

    Permalink
    Definition Classes
    EmrTaskActivity
  49. def preActivityTaskConfig: Option[ShellScriptConfig]

    Permalink
    Definition Classes
    EmrTaskActivity
  50. def preconditions: Seq[Precondition]

    Permalink
    Definition Classes
    PipelineActivity
  51. def ref: AdpRef[AdpActivity]

    Permalink
    Definition Classes
    PipelineActivityPipelineObject
  52. def retryDelay: Option[HDuration]

    Permalink
    Definition Classes
    PipelineActivity
  53. def runsOn: Resource[SparkCluster]

    Permalink
    Definition Classes
    PipelineActivity
  54. val scriptRunner: HString

    Permalink
  55. implicit def seq2Option[A](anySeq: Seq[A]): Option[Seq[A]]

    Permalink
    Definition Classes
    PipelineObject
  56. def seqToOption[A, B](anySeq: Seq[A])(transform: (A) ⇒ B): Option[Seq[B]]

    Permalink
    Definition Classes
    PipelineObject
  57. lazy val serialize: AdpHadoopActivity

    Permalink
  58. val sparkConfig: Map[HString, HString]

    Permalink
  59. val sparkOptions: Seq[HString]

    Permalink
  60. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  61. implicit def uniquePipelineId2String(id: PipelineObjectId): String

    Permalink
    Definition Classes
    PipelineObject
  62. def updateActivityFields(fields: ActivityFields[SparkCluster]): SparkTaskActivity

    Permalink
    Definition Classes
    SparkTaskActivityPipelineActivity
  63. def updateBaseFields(fields: BaseFields): SparkTaskActivity

    Permalink
    Definition Classes
    SparkTaskActivityNamedPipelineObject
  64. def updateEmrTaskActivityFields(fields: EmrTaskActivityFields): SparkTaskActivity

    Permalink
    Definition Classes
    SparkTaskActivityEmrTaskActivity
  65. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  66. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  67. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  68. def whenMet(conditions: Precondition*): Self

    Permalink
    Definition Classes
    PipelineActivity
  69. def withArguments(args: HString*): SparkTaskActivity

    Permalink
  70. def withAttemptTimeout(duration: HDuration): Self

    Permalink
    Definition Classes
    PipelineActivity
  71. def withDriverCores(n: HInt): SparkTaskActivity

    Permalink
  72. def withDriverMemory(memory: Memory): SparkTaskActivity

    Permalink
  73. def withExecutorCores(n: HInt): SparkTaskActivity

    Permalink
  74. def withExecutorMemory(memory: Memory): SparkTaskActivity

    Permalink
  75. def withFailureAndRerunMode(mode: FailureAndRerunMode): Self

    Permalink
    Definition Classes
    PipelineActivity
  76. def withFiles(files: HString*): SparkTaskActivity

    Permalink
  77. def withHadoopQueue(queue: HString): SparkTaskActivity

    Permalink
  78. def withInput(input: S3DataNode*): SparkTaskActivity

    Permalink
  79. def withLateAfterTimeout(duration: HDuration): Self

    Permalink
    Definition Classes
    PipelineActivity
  80. def withMaster(master: HString): SparkTaskActivity

    Permalink
  81. def withMaxActiveInstances(activeInstances: HInt): Self

    Permalink
    Definition Classes
    PipelineActivity
  82. def withMaximumRetries(retries: HInt): Self

    Permalink
    Definition Classes
    PipelineActivity
  83. def withNumExecutors(n: HInt): SparkTaskActivity

    Permalink
  84. def withOutput(output: S3DataNode*): SparkTaskActivity

    Permalink
  85. def withPostActivityTaskConfig(config: ShellScriptConfig): Self

    Permalink
    Definition Classes
    EmrTaskActivity
  86. def withPreActivityTaskConfig(config: ShellScriptConfig): Self

    Permalink
    Definition Classes
    EmrTaskActivity
  87. def withRetryDelay(duration: HDuration): Self

    Permalink
    Definition Classes
    PipelineActivity
  88. def withSparkConfig(key: HString, value: HString): SparkTaskActivity

    Permalink
  89. def withSparkOption(option: HString*): SparkTaskActivity

    Permalink
  90. def withTotalExecutorCores(n: HInt): SparkTaskActivity

    Permalink

Inherited from Serializable

Inherited from Serializable

Inherited from Product

Inherited from Equals

Inherited from EmrTaskActivity[SparkCluster]

Inherited from EmrActivity[SparkCluster]

Inherited from PipelineActivity[SparkCluster]

Inherited from NamedPipelineObject

Inherited from PipelineObject

Inherited from Ordered[PipelineObject]

Inherited from Comparable[PipelineObject]

Inherited from AnyRef

Inherited from Any

Ungrouped