Tuesday 22 December 2015

Interview Questions On Informatica Aggregator Transformation

1. What is aggregator transformation?

Aggregator transformation is an active transformation and its connected, mainly used to perform calculations on group of rows

2. What is the default behavior of aggregator transformation?

By default it gives last row as output in each group

3. What happens if you don't select group by ports in aggregator?

If you don't select any group by ports, by default it treats all rows as single group and returns last row

4. What happens if you select all group by ports in aggregator?

If you select all ports in aggregator, it applies grouping on all ports and eliminates duplicate records

5. What are different types of caches in aggregator transformation?

a. Index Cache ==> Contains Group by Ports Information
b. Data  Cache ==> Contains Aggregated Calculation Ports Information

Index Cache file has extension .idx
Data   Cache file has extension .dat

6. What are different types of aggregate functions in aggregator?

a. Sum()
b. Max()
c. Min()
d. Avg()
e. First()
f.  Last() etc

7. What are conditional statements in aggregator transformation?

While performing aggregate calculations, we can use conditional statements with the use of IIF

Eg:- SUM(IIF(CITY='HYDERABAD',SAL,0))

The above example calculates only HYDERABAD city employees total

8. What are nested aggregate functions in aggregator transformation?

If you place one aggregate function in another then it is called nesting of aggregation.

Eg:- MAX(SUM(SAL))

9. What happens if group by port contains null values?

 All rows containing null values will treat as separate group

10. Can you create a variable port in aggregator?

Yes we can create variable ports in aggregator

All above information you can see in below video with examples


https://www.youtube.com/watch?v=SS0-XdluF1Q&list=PLmgctEn1nRyXVj-cfFGpext8j2_qJtzCf



11. What are the optimization techniques in aggregator transformaion?

There are two optimization techniques in aggregator transformation

a. Sorted Input

b. Incremental Aggregation

12. How to set sorted input option in aggregator transformation?

Sorted Input option available under properties tab of aggregator transformation. When you select this option you need to send the sorted data to the aggregator based on group by ports. For example if you take DNO is the group by port then you need to sort on DNO.

13. How sorted input helps in increasing performance of aggregator transformation?

When we sort the data and send to aggregator, then immediately performs aggregate calcuations. It won't wait till all rows read from source. Informatica can't read all rows at a time, it will read block by block. If you don't select sorted input, aggregator will wait till all rows read from source

14. What is incremental aggregation?

Incremental aggregation means aggregating values based on existing aggregated values. Below example clarifies.

First time we received below rows
eno   name     sal      dno
100    ravi      2000   10
101    vinay    3000   10

Ouput :-  10  5000

Next time new employee added in same department

102  raju  4000

this time when you use incremental aggregation it peforms 5000+4000 instead of again adding all employee salaries 2000+3000+4000.

15. When cache files deleted for incremental aggregation what happens?

Session won't fail, it will recreate the cache automatically

16. Can we select both sorted input and incremental aggregation at a time?

No, we can't select. Reason is if you select sorted input it performs calculations in memory. When you select incremental aggregation it performs calculations on existing cache values. 

17. Where cache files stored?

The cache files by default stored under server/infa_shared/cache direcotry.

18. When cache files will be deleted?

Automatically cache files will be deleted after session succeeded

19. Cache files will be deleted if session fails?

If session fails, cache files won't delete, we need to manually delete

20. How to get first and last record using aggregator?

By default you can get last record, to get first record use First() function for each port. Source, SQ, two aggregators, by default one aggregator returns last row, another aggregator select first and connect to two target instances.


4 comments:


  1. I appreciate you sharing this article. Really thank you! Much obliged. Informatica Data Quality Online Training

    ReplyDelete
  2. Very appreciated for your great article

    ReplyDelete
  3. This comment has been removed by the author.

    ReplyDelete

  4. Nice information, this is will helpfull a lot, Thank for sharing, Keep do posting i like to follow this

    Interesting blog, here a lot of valuable information is available, it is very useful information Keep do posting i like to follow this informatica online training
    informatica online course
    informatica bdm training
    informatica developer training
    informatica training
    informatica course
    informatica axon training
    online informatica course
    informatica developer course

    ReplyDelete