Hello everyone,

I am using Pentaho 4.4 to dump data from MongoDB to Hadoop. Since the database contains large number of rows and I want to limit the number of rows to a small number (say 1000) in the transformation. My question is how can I achieve this either by writing a query in the query field of MongoDB input step or by some other method.

What I have already tried:

I have tried using the "maxscan" operator

{"$query": {...}, "$maxScan": 10}.

It works well when you want to limit the rows to 100 rows beyond that pentaho completely ignores the limit parameter and proceeds to transfer all the rows from MongoDB to Hadoop.

So If anyone could help with what query or approach should I use to limit the number of rows in the pentaho 4.4, I would be very grateful.

Thanks in advance.