Handle empty partition iterators #367

ankurdave · 2014-04-09T06:05:09Z

Empty edge partitions sometimes appear in the output of zipPartitions for unknown reasons, causing calls to Iterator#next to fail. This PR checks these cases, handles them by returning an empty iterator, and logs an error if this would cause GraphX to drop a corresponding non-empty partition.

Resolves amplab/graphx#52.

Empty edge partitions sometimes appear in the output of zipPartitions for unknown reasons, causing calls to Iterator#next to fail. This commit checks these cases, handles them by returning an empty iterator, and logs an error if this would cause GraphX to drop a corresponding non-empty partition. Resolves amplab/graphx#52.

rxin · 2014-04-09T06:06:18Z

I know you said "unknown", but any guesses on why they appear? Seems like they shouldn't.

AmplabJenkins · 2014-04-09T06:07:23Z

Merged build triggered.

AmplabJenkins · 2014-04-09T06:07:30Z

Merged build started.

ankurdave · 2014-04-09T06:07:46Z

I don't have any ideas. cc @jegonzal @dcrankshaw

AmplabJenkins · 2014-04-09T07:01:46Z

Merged build finished.

AmplabJenkins · 2014-04-09T07:01:46Z

Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/13934/

dcrankshaw · 2014-04-10T02:09:16Z

I've looked into briefly but I'm not sure either.

rxin · 2014-04-17T00:32:38Z

Jenkins, retest this please.

AmplabJenkins · 2014-04-17T00:33:13Z

Merged build triggered.

AmplabJenkins · 2014-04-17T00:33:21Z

Merged build started.

AmplabJenkins · 2014-04-17T02:04:14Z

Merged build finished.

AmplabJenkins · 2014-04-17T02:04:14Z

Refer to this link for build results: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/14191/

mateiz · 2014-05-06T18:30:13Z

@ankurdave do we still need this if we merge #497?

ankurdave · 2014-05-06T18:37:50Z

No, #497 subsumes this. Closing.

mateiz · 2014-05-06T18:44:09Z

Alright, great. I took a quick look through #497 but I also want to test it locally. I think the Jenkins failure may have been due to some methods with unspecified return types, breaking MIMA or scalastyle. But we'll find out when we rerun it.

@jegonzal

GraphX: Unifying Graphs and Tables GraphX extends Spark's distributed fault-tolerant collections API and interactive console with a new graph API which leverages recent advances in graph systems (e.g., [GraphLab](http://graphlab.org)) to enable users to easily and interactively build, transform, and reason about graph structured data at scale. See http://amplab.github.io/graphx/. Thanks to @jegonzal, @rxin, @ankurdave, @dcrankshaw, @jianpingjwang, @amatsukawa, @kellrott, and @adamnovak. Tasks left: - [x] Graph-level uncache - [x] Uncache previous iterations in Pregel - [x] ~~Uncache previous iterations in GraphLab~~ (postponed to post-release) - [x] - Describe GC issue with GraphLab - [ ] Write `docs/graphx-programming-guide.md` - [x] - Mention future Bagel support in docs - [ ] - Section on caching/uncaching in docs: As with Spark, cache something that is used more than once. In an iterative algorithm, try to cache and force (i.e., materialize) something every iteration, then uncache the cached things that depended on the newly materialized RDD but that won't be referenced again. - [x] Undo modifications to core collections and instead copy them to org.apache.spark.graphx - [x] Make Graph serializable to work around capture in Spark shell - [x] Rename graph -> graphx in package name and subproject - [x] Remove standalone PageRank - [x] ~~Fix amplab/graphx#52 by checking `iter.hasNext`~~

* set RestartPolicy=Never for executor As for current implementation the RestartPolicy of executor pod is not set, so the default value "OnFailure" is in effect. But this causes problem. If an executor is terminated unexpectedly, for example, exit by java.lang.OutOfMemoryError, it'll be restarted by k8s with the same executor ID. When the new executor tries to fetch a block hold by the last executor, ShuffleBlockFetcherIterator.splitLocalRemoteBlocks() think it's a **local** block and tries to read it from it's local dir. But the executor's local dir is changed because random generated ID is part of local dir. FetchFailedException will raise and the stage will fail. The rolling Error message: 17/06/29 01:54:56 WARN KubernetesTaskSetManager: Lost task 0.1 in stage 2.0 (TID 7, 172.16.75.92, executor 1): FetchFailed(BlockManagerId(1, 172.16.75.92, 40539, None), shuffleId=2, mapId=0, reduceId=0, message= org.apache.spark.shuffle.FetchFailedException: /data2/spark/blockmgr-0e228d3c-8727-422e-aa97-2841a877c42a/32/shuffle_2_0_0.index (No such file or directory) at org.apache.spark.storage.ShuffleBlockFetcherIterator.throwFetchFailedException(ShuffleBlockFetcherIterator.scala:357) at org.apache.spark.storage.ShuffleBlockFetcherIterator.next(ShuffleBlockFetcherIterator.scala:332) at org.apache.spark.storage.ShuffleBlockFetcherIterator.next(ShuffleBlockFetcherIterator.scala:54) at scala.collection.Iterator$$anon$11.next(Iterator.scala:409) * Update KubernetesClusterSchedulerBackend.scala

This reverts commit a3c2539.

Disable Telefonica Cloud related jobs

Update version to 2.3.2-pie1.0.3

ankurdave mentioned this pull request Apr 14, 2014

Unify GraphImpl RDDs + other graph load optimizations amplab/graphx#137

Closed

ankurdave closed this May 6, 2014

mccheah pushed a commit to mccheah/spark that referenced this pull request Oct 3, 2018

Revert "update upstream (apache#366)" (apache#367)

17f07ed

This reverts commit a3c2539.

bzhaoopenstack pushed a commit to bzhaoopenstack/spark that referenced this pull request Sep 11, 2019

Merge pull request apache#367 from theopenlab/disable-tlf-jobs

ccf81c4

Disable Telefonica Cloud related jobs

holdenk pushed a commit to holdenk/spark that referenced this pull request Sep 12, 2019

Merge pull request apache#367 from aaruna/2.3.2-s

aabb2e4

Update version to 2.3.2-pie1.0.3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle empty partition iterators #367

Handle empty partition iterators #367

ankurdave commented Apr 9, 2014

rxin commented Apr 9, 2014

AmplabJenkins commented Apr 9, 2014

AmplabJenkins commented Apr 9, 2014

ankurdave commented Apr 9, 2014

AmplabJenkins commented Apr 9, 2014

AmplabJenkins commented Apr 9, 2014

dcrankshaw commented Apr 10, 2014

rxin commented Apr 17, 2014

AmplabJenkins commented Apr 17, 2014

AmplabJenkins commented Apr 17, 2014

AmplabJenkins commented Apr 17, 2014

AmplabJenkins commented Apr 17, 2014

mateiz commented May 6, 2014

ankurdave commented May 6, 2014

mateiz commented May 6, 2014

Handle empty partition iterators #367

Handle empty partition iterators #367

Conversation

ankurdave commented Apr 9, 2014

rxin commented Apr 9, 2014

AmplabJenkins commented Apr 9, 2014

AmplabJenkins commented Apr 9, 2014

ankurdave commented Apr 9, 2014

AmplabJenkins commented Apr 9, 2014

AmplabJenkins commented Apr 9, 2014

dcrankshaw commented Apr 10, 2014

rxin commented Apr 17, 2014

AmplabJenkins commented Apr 17, 2014

AmplabJenkins commented Apr 17, 2014

AmplabJenkins commented Apr 17, 2014

AmplabJenkins commented Apr 17, 2014

mateiz commented May 6, 2014

ankurdave commented May 6, 2014

mateiz commented May 6, 2014