Simulation of bibliographic data bases for studies of automatic document classification