Abstract:Extracting schema from massive information is very difficult for the research on massive information integration in network environment. A new method is presented in this paper, which is about extracting and incremental maintenance of local accurate schema. In this process, the algorithm control the scale of extracted schema within the 'schema diameter' by examining the path distance of the target set and using the Hash class and its path distance operation. This method is very efficient for restrain schema from expanding.