[Data Factory] Schema Drift and Unpivot transformation -> scala.MatchError

link之家

链接快照平台

输入网页链接，自动生成快照
标签化管理网页链接

相关文章推荐

爱搭讪的芒果 · 一代大师终迎谢幕时刻 ...· 1 年前 ·

傲视众生的拐杖 · Red Notice-《红色通缉令》影评 - 知乎· 1 年前 ·

有腹肌的熊猫 · 魔神的新娘终于要生了_魔神的新娘完结版小说_ ...· 1 年前 ·

讲道义的柳树 · 曹建明分别会见参加第二十二届国际检察官联合会 ...· 1 年前 ·

坚强的稀饭 · 世界上有哪些著名的马拉松比赛？各自最大的特点 ...· 2 年前 ·

I try to use " Unpivot " transformation on a Source that don't have schema pre-defined (Schema Drift is activated).
The "unpivot" transformation generates the following error:

Error: DF-SYS-01 at : scala.MatchError: 1 (of class java.lang.Integer) - RunId: 700ee826-2737-4a4a-b9bb-daf65b52fb31

Could you help me understand the meaning of this error and how to fix it?
Is "unpivot" unable to work with schema drift?

For information, I put more detail about the other component below.

The input file looks like this:

The file format will change regularly, a new column will be added every month, so I want to use Schema Drift.

The source looks like this:
As you can see there is no schema prepared.

My select transformation looks like this:
I retrieve the product code by using "name" and I retrieve all columns prefixed by 20 (ex: 2020-01)

The unpivot is configured as follow, using "byName" to retrieve the product code column:

I create a column year_month which will contain the column headers that have been unpivoted.

I create a column allocation which will contain the values that have been unpivoted.

If you have any other suggestion I'm happy to hear too.

Thank you

@Pestre Remi
Actually I reproduced it now. In last attempt I forgot to get rid of the schema.

@Kiran-MSFT the script is:

source(allowSchemaDrift: true,  
	validateSchema: false,  
	ignoreNoFilesFound: false) ~> source1  
source1 select(mapColumn(  
		each(match(name == "Product Code"),  
			"product_code" = $$),  
		each(match(left(name, 2) == "20"))  
	skipDuplicateMapInputs: true,  
	skipDuplicateMapOutputs: true) ~> Select1  
Select1 unpivot(output(  
		year_month as string,  
		allocation as string  
	ungroupBy({Product Code} = byName("product_code")),  
	lateral: false,  
	ignoreNullPivots: false) ~> Unpivot1  
			 This looks more complicated that it needs to be. You can do something as simple as
yearly derive(  

Product_Code = byName("Product_Code")  

) ~> yearlyDerived
yearlyDerived unpivot(output(  

year_month as string,  

allocation as string  

ungroupBy(Product_Code),  

lateral: false,  

ignoreNullPivots: false) ~> unpivotYearly
Unpivots do not allow expressions or assignments in ungroupBy expressions. The validation error will be made more user friendly
											To make things far simple and avoiding byName, you can open the script and type
source(  

output(  

Product_Code as string  

This will add Product_Code as a named column and rest of the columns will be drifted. The UI controls don't directly have this facility but you can edit the script and add this snippet.

Then you just need to unpivot nothing else.
yearlyDerived unpivot(output(  

year_month as string,  

allocation as string  

ungroupBy(Product_Code),  

lateral: false,  

ignoreNullPivots: false) ~> unpivotYearly
											Thank you all for your help.    

After some attempts, I have been able to achieve it by adding a "Derived Column" step before the Unpivot and by creating the Product_Code column. The unpivot can then use this column directly.    
I guess it's very similar to the approach that was suggested above.