pyspark.sql.functions.zip_with¶

pyspark.sql.functions。 zip_with ( 左:ColumnOrName,正確的:ColumnOrName,f:可調用的((pyspark.sql.column.Column,pyspark.sql.column.Column],pyspark.sql.column.Column] )→pyspark.sql.column.Column¶

合並兩個數組,element-wise,到一個數組中使用一個函數。如果一個數組是短,null是附加在比賽時間越長數組的長度,在應用功能。

參數

左 列或str: 第一個列或表達式
正確的 列或str: 名稱,第二列或表達式
f 函數: 一個二元函數(x1:列,x2:列)- >列……可以使用的方法列、函數中定義pyspark.sql.functions和ScalaUserDefinedFunctions。PythonUserDefinedFunctions不支持(火星- 27052)。

返回

列

例子

           > > >df=火花。createDataFrame(((1,(1,3,5,8),(0,2,4,6])),(“id”,“x”,“y”))> > >df。選擇(zip_with(“x”,“y”,λx,y:x* *y)。別名(“權力”))。顯示(截斷=假)+ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - +權力| |+ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - +| | (1.0,9.0,625.0,262144.0)+ - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - +
          

           > > >df=火花。createDataFrame(((1,(“foo”,“酒吧”),(1,2,3])),(“id”,“x”,“y”))> > >df。選擇(zip_with(“x”,“y”,λx,y:concat_ws(“_”,x,y))。別名(“xs_ys”))。顯示()+ - - - - - - - - - - - - - - - - - - +| xs_ys |+ - - - - - - - - - - - - - - - - - - +| | foo_1 bar_2 3+ - - - - - - - - - - - - - - - - - - +
          

以前的

pyspark.sql.functions.aggregate

下一個

pyspark.sql.functions.transform_keys