Dr. Richard S. Segall is Professor of Computer & Information Technology at Arkansas State University in Jonesboro, AR. He holds BS and MS in mathematics, MS in operations research and statistics from Rensselaer Polytechnic Institute in Troy, New York, and PhD in operations research form University of Massachusetts at Amherst. He has served on the faculty of Texas Tech University, University of Louisville, University of New Hampshire, University of Massachusetts-Lowell, and West Virginia University. His publications have appeared in journals including International Journal of Information Technology and Decision Making (IJITDM), International Journal of Information and Decision Sciences (IJIDS), Applied Mathematical Modelling, Kybernetes: International Journal of Systems and Cybernetics, and Journal of the Operational Research Society (JORS). He has book chapters in Encyclopedia of Data Warehousing and Mining, Handbook of Computational Intelligence in Manufacturing and Production Management, Handbook of Research on Text and Web Mining Technologies, and Encyclopedia of Business Analytics & Optimization.
His research interests include data mining, text mining, web mining, database management, and mathematical modeling, and supercomputing applications. His research has been funded by U.S. Air Force, U.S. National Aeronautics and Space Administration (NASA), Arkansas Biosciences Institute (ABI), and Arkansas Science & Technology Authority (ASTA). He is a member of the Editorial Board of the International Journal of Data Mining, Modelling and Management (IJDMMM), The Open Cybernetics and Systemics Journal, and The Open Medical Informatics Journal.
He is a member of the Arkansas Center for Plant-Powered Production (P3), recipient of Session Best Paper awards at the 2008, 2009, 2010 and 2011 World Multi-Conference on Systemics, Cybernetics and Informatics (WMSCI) conferences, and served as Local Arrangements Chair of the 2010 MidSouth Computational Biology & Bioinformatics Society (MCBIOS) Conference. He is co-editor of book titled Visual Analytics of Interactive Technologies: Applications to Data, Text and Web Mining published in 2011 by IGI Global, and is currently co-editing a book titled Research and Applications in Global Supercomputing to be published by IGI Global.
This talk will summarize research of Dr. Segall that pertains to the informatics study of the spectrum of dimensionalities for computation in both speed and magnitude of data as obtained upon the applications of supercomputing, data, text and web mining.
A supercomputer is a computer at the frontlines of current processing capacity and speed of calculations. First introduced in the 1960s, the supercomputers of the 1970s used only few processors, and in the 1990s machines with thousands of processors began to appear. By the end of the 20th century supercomputers were massively parallel computing systems composed of tens of thousands of processors. In contrast, supercomputers of the 21st century can use over 100,000 processors including those with graphic capabilities.
Supercomputers are used today for highly-intensive calculation tasks for projects ranging from quantum physics, weather forecasting, molecular modeling, and physical simulations. Supercomputers can be used for simulations of airplanes in wind tunnels, detonations of nuclear weapons, splitting electrons, and helping researchers study how drugs combat the swine flu virus. Supercomputing can be in the form of grid computing, in which the processing power of a large number of computers is distributed, or in the form of computer clusters, in which a large number of processors are used in close proximity to each other.
The IBM Sequoia, completed in 2012, is currently the fastest supercomputer at 16.32 petaflops consuming 7890 kilowatts of power. The Indian government has stated that it has committed about $940 million to develop what could become the world’s fastest supercomputer by 2017, one that would have a performance of 1 exaflop which is about 61 times faster than today’s fastest computers. [http://articles.timesofindia.indiatimes.com/2012-09-17/hardware/33901529_1_first-supercomputers-petaflop-fastest-supercomputer].
Data/Text/Web mining is the informatics methodology and systemics study of finding hidden patterns in large-scale sets of alphanumeric data/text/web respectively.
This presentation will also discuss the applications of data mining to the dimensionality of micro-array databases. Micro-arrays are huge collections of spots that contain massive amounts of compressed data. Micro-arrays are used by researchers in life sciences for genetics because DNA contains so much information on a micro-scale. For example, each spot of a micro-array thus could contain a unique DNA sequence. This research has also been extended to include statistical quality control of microarray gene expression data.
This presentation will also discuss the applications of text and web mining to linkage discovery of related documents in a repository and the use of semantic rules for identification of similar records. The results of using web mining technologies for customer and marketing surveys are also discussed.