Click here to Skip to main content
15,795,937 members

Questions

Questions

1
answer

How can I use the 'take(n)' function in spark 3.4.0 with pyspark to display the top 3 rows of a CSV file?

27-May-23 20:14pm - updated 27-May-23 21:56pm
0
answers

Change value in nested struct, array, struct in a spark dataframe using pyspark

17-Jan-23 21:00pm
1
answer

How to write code using spark api scala?

26-Aug-22 22:35pm - updated 29-Aug-22 18:11pm
0
answers

Iterate over pyspark array elemets and then within elements itself using loop.

21-Apr-22 0:17am - updated 21-Apr-22 6:43am
0
answers

So there is a match_id, batsman, and batsman_runs column, batsman_runs column consist of values where he scored a number of runs in a ball like 0, 1, 2

20-Jun-21 22:56pm
1
answer

How to write the spark programming to count letters

12-May-21 8:56am - updated 12-May-21 9:36am
0
answers

Spark is not able to connect to hive metastore

26-Apr-21 21:35pm
0
answers

Pyspark is not working in my macos

26-Apr-21 15:50pm
0
answers

Creating data frames columns from a list of dictionaries

16-Mar-21 3:07am
1
answer

Use value like ISO week but week starting from sunday

5-Dec-20 2:59am - updated 5-Dec-20 4:38am
1
answer

If I access data via spark, can I control database table access at column level with impala

2-May-19 22:46pm - updated 6-May-19 0:46am
1
answer

Spark scala-count even numbers from from file

16-Jun-18 21:59pm - updated 16-Jun-18 22:38pm
0
answers

How to merge two spark row

24-Apr-18 0:35am
0
answers

Why get wrong index when saving data in libsvm format by using saveaslibsvmfile

3-Apr-18 7:24am
0
answers

Executing commands on remote spark(EC2) using local R(sparkr) interface hangs

7-Jul-17 19:52pm - updated 9-Jul-17 18:45pm
0
answers

Splitting dataset for cross validation fpgrowth in spark

26-Mar-17 23:17pm

To narrow down your search try filtering by tags using the Filter box at the top right.