Data Science & Spatial Analysis Archives - Page 4 of 8

Phasic Metropolitan Settlers: A Phase-Based Model for the Distribution of Households in US Metropolitan Regions

Estiri, Hossein; Krause, Andy; Heris, Mehdi P. (2015). Phasic Metropolitan Settlers: A Phase-Based Model for the Distribution of Households in US Metropolitan Regions. Urban Geography, 36(5), 777 – 794.

View Publication

Abstract

In this article, we develop a model for explaining spatial patterns in the distribution of households across metropolitan regions in the United States. First, we use housing consumption and residential mobility theories to construct a hypothetical probability distribution function for the consumption of housing services across three phases of household life span. We then hypothesize a second probability distribution function for the offering of housing services based on the distance from city center(s) at the metropolitan scale. Intersecting the two hypothetical probability functions, we develop a phase-based model for the distribution of households in US metropolitan regions. We argue that phase one households (young adults) are more likely to reside in central city locations, whereas phase two and three households are more likely to select suburban locations, due to their respective housing consumption behaviors. We provide empirical validation of our theoretical model with the data from the 2010 US Census for 35 large metropolitan regions.

Keywords

Residential-mobility; Life-course; Housing Consumption; Family; Satisfaction; Migration; Geography; Context; Age; Distribution Patterns; Us Metropolitan Regions; Household

Intersections and Non-Intersections: A Protocol for Identifying Pedestrian Crash Risk Locations in GIS

Kang, Mingyu; Moudon, Anne Vernez; Kim, Haena; Boyle, Linda Ng. (2019). Intersections and Non-Intersections: A Protocol for Identifying Pedestrian Crash Risk Locations in GIS. International Journal Of Environmental Research And Public Health, 16(19).

View Publication

Abstract

Intersection and non-intersection locations are commonly used as spatial units of analysis for modeling pedestrian crashes. While both location types have been previously studied, comparing results is difficult given the different data and methods used to identify crash-risk locations. In this study, a systematic and replicable protocol was developed in GIS (Geographic Information System) to create a consistent spatial unit of analysis for use in pedestrian crash modelling. Four publicly accessible datasets were used to identify unique intersection and non-intersection locations: Roadway intersection points, roadway lanes, legal speed limits, and pedestrian crash records. Two algorithms were developed and tested using five search radii (ranging from 20 to 100 m) to assess the protocol reliability. The algorithms, which were designed to identify crash-risk locations at intersection and non-intersection areas detected 87.2% of the pedestrian crash locations (r: 20 m). Agreement rates between algorithm results and the crash data were 94.1% for intersection and 98.0% for non-intersection locations, respectively. The buffer size of 20 m generally showed the highest performance in the analyses. The present protocol offered an efficient and reliable method to create spatial analysis units for pedestrian crash modeling. It provided researchers a cost-effective method to identify unique intersection and non-intersection locations. Additional search radii should be tested in future studies to refine the capture of crash-risk locations.

Keywords

Traffic Crash; Walking; Collisions; Accidents; Models; Pedestrian Safety; Spatial Autocorrelation; Algorithm

Split-Match-Aggregate (SMA) Algorithm: Integrating Sidewalk Data with Transportation Network Data in GIS

Kang, Bumjoon; Scully, Jason Y.; Stewart, Orion; Hurvitz, Philip M.; Moudon, Anne V. (2015). Split-Match-Aggregate (SMA) Algorithm: Integrating Sidewalk Data with Transportation Network Data in GIS. International Journal Of Geographical Information Science, 29(3), 440 – 453.

View Publication

Abstract

Sidewalk geodata are essential to understand walking behavior. However, such geodata are scarce, only available at the local jurisdiction and not at the regional level. If they exist, the data are stored in geometric representational formats without network characteristics such as sidewalk connectivity and completeness. This article presents the Split-Match-Aggregate (SMA) algorithm, which automatically conflates sidewalk information from secondary geometric sidewalk data to existing street network data. The algorithm uses three parameters to determine geometric relationships between sidewalk and street segments: the distance between streets and sidewalk segments; the angle between sidewalk and street segments; and the difference between the lengths of matched sidewalk and street segments. The SMA algorithm was applied in urban King County, WA, to 13 jurisdictions' secondary sidewalk geodata. Parameter values were determined based on agreement rates between results obtained from 72 pre-specified parameter combinations and those of a trained geographic information systems (GIS) analyst using a randomly selected 5% of the 79,928 street segments as a parameter-development sample. The algorithm performed best when the distances between sidewalk and street segments were 12m or less, their angles were 25 degrees or less, and the tolerance was set to 18m, showing an excellent agreement rate of 96.5%. The SMA algorithm was applied to classify sidewalks in the entire study area and it successfully updated sidewalk coverage information on the existing regional-level street network data. The algorithm can be applied for conflating attributes between associated, but geometrically misaligned line data sets in GIS.

Keywords

Geodatabases; Sidewalks; Algorithms; Pedestrians; Digital Mapping; Algorithm; Gis; Pedestrian Network Data; Polyline Conflation; Sidewalk; Built Environment; Physical-activity; Mode Choice; Urban Form; Land-use; Travel; Generation; Walking

Quantifying Economic Effects of Transportation Investment Considering Spatiotemporal Heterogeneity in China: A Spatial Panel Data Model Perspective

Lin, Xiongbin; Maclachlan, Ian; Ren, Ting; Sun, Feiyang. (2019). Quantifying Economic Effects of Transportation Investment Considering Spatiotemporal Heterogeneity in China: A Spatial Panel Data Model Perspective. The Annals Of Regional Science, 63(3), 437 – 459.

View Publication

Abstract

Transportation investment plays a significant role in promoting economic development. However, in what scenario and to what extent transportation investment can stimulate economic growth still remains debatable. For developing countries undergoing rapid urbanization, answering these questions is necessary for evaluating proposals and determining investment plans, especially considering the heterogeneity of spatiotemporal conditions. Current literature lacks systematical research to consider the impacts of panel data and spatial correlation issue in examining the economic effects of transportation investment. To fill this gap, this study collects provincial panel data in China from 1997 to 2015 to evaluate multi-level temporal and spatial effects of transportation investment on economic growth by using spatial panel data analysis. Results show that transportation investment leads to significant and positive effects on growth and spatial concentration of economic activities, but these results vary significantly depending on the temporal and spatial characteristics of each province. The economic impacts of transportation investment are quite positive even considering the time lag effects. This study suggests that both central and local governments should carefully evaluate the multifaceted economic effects of transportation investment, such as a balanced transportation investment and economic development between growing and lagging regions, and considering the spatiotemporal heterogeneity of the economic environment.

Keywords

High-speed Rail; Infrastructure Investment; Causal Relationship; Empirical-analysis; Growth; Impact; Productivity; Efficiency; Spillover; Agglomeration; C33; R40; R58; Spatial Analysis; Time Lag; Urbanization; Transportation; Heterogeneity; Economic Growth; Economic Models; Economic Impact; Data Analysis; Spatial Data; Panel Data; Economic Development; Developing Countries--ldcs; Investments; Economic Analysis; Investment; Local Government; China

Domain Knowledge-Based Information Retrieval for Engineering Technical Documents

Shang-hsien Hsieh; Ken-yu Lin; Nai-wen Chi; Hsien-tang Lin. (2015). Domain Knowledge-Based Information Retrieval for Engineering Technical Documents. Ontology In The AEC Industry. A Decade Of Research And Development In Architecture, Engineering And Construction, chapter 1.

View Publication

Abstract

Technical documents with complicated structures are often produced in architecture/engineering/construction (AEC) projects and research. Information retrieval (IR) techniques provide a possible solution for managing the ever-growing volume and contexts of the knowledge embedded in these technical documents. However, applying a general-purpose search engine to a domain-specific technical document collection often produces unsatisfactory results. To address this problem, we research the development of a novel IR system based on passage retrieval techniques. The system employs domain knowledge to assist passage partitioning and supports an interactive concept-based expanded IR for technical documents in an engineering field. The engineering domain selected in this case is earthquake engineering, although the technologies developed and employed by the system should be generally applicable to many other engineering domains that use technical documents with similar characteristics. We carry out the research in a three-step process. In the first step, since the final output of this research is an IR system, as a prerequisite, we created a reference collection which includes 111 earthquake engineering technical documents from Taiwan's National Center for Research on Earthquake Engineering. With this collection, the effectiveness of the IR system can be further evaluated onceit is developed. In the second step, the research focuses on creating a base domain ontology using an earthquake-engineering handbook to represent the domain knowledge and to support the target IR system with the knowledge. In step three, the research focuses on the semantic querying and retrieval mechanisms and develops the OntoPassage approach to help with the mechanisms. The OntoPassage approach partitions a document into smaller passages, each with around 300 terms, according to the main concepts in the document. This approach is then used to implement the target domain knowledge-based IR system that allows users to interact with the system and perform concept-based query expansions. The results show that the proposed domain knowledge-based IR system can achieve not only an effective IR but also inform search engine users with a clear knowledge representation.

Keywords

Architecture; Construction; Engineering; Knowledge Based Systems; Ontologies (artificial Intelligence); Query Processing; Search Engines; Knowledge Representation; Concept-based Query Expansions; Base Domain Ontology; Earthquake Engineering; General-purpose Search Engine; Aec Projects; Architecture/engineering/construction Projects; Complicated Structures; Technical Documents; Domain Knowledge-based Information Retrieval

A Tutorial on Dynasearch: A Web-Based System for Collecting Process-Tracing Data in Dynamic Decision Tasks

Lindell, Michael K.; House, Donald H.; Gestring, Jordan; Wu, Hao-Che. (2019). A Tutorial on Dynasearch: A Web-Based System for Collecting Process-Tracing Data in Dynamic Decision Tasks. Behavior Research Methods, 51(6), 2646 – 2660.

View Publication

Abstract

This tutorial describes DynaSearch, a Web-based system that supports process-tracing experiments on coupled-system dynamic decision-making tasks. A major need in these tasks is to examine the process by which decision makers search over a succession of situation reports for the information they need in order to make response decisions. DynaSearch provides researchers with the ability to construct and administer Web-based experiments containing both between- and within-subjects factors. Information search pages record participants' acquisition of verbal, numeric, and graphic information. Questionnaire pages query participants' recall of information, inferences from that information, and decisions about appropriate response actions. Experimenters can access this information in an online viewer to verify satisfactory task completion and can download the data in comma-separated text files that can be imported into statistical analysis packages.

Keywords

Downloading; Text Files; Tasks; Access To Information; Statistics; Dynamic Decision Making; Process Tracing; Web-based Experiments; Information Search; Human-behavior; Eye-tracking; Choice; Expectations; Strategies; Mousetrap; Software; Time

Push, Pull, and Spill: A Transdisciplinary Case Study in Municipal Open Government

Whittington, Jan; Calo, Ryan; Simon, Mike; Jesse Woo; Meg Young; Schmiedeskamp, Peter. (2015). Push, Pull, and Spill: A Transdisciplinary Case Study in Municipal Open Government. Berkeley Technology Law Journal, 30(3), 1899 – 1966.

View Publication

Abstract

Municipal open data raises hopes and concerns. The activities of cities produce a wide array of data, data that is vastly enriched by ubiquitous computing. Municipal data is opened as it is pushed to, pulled by, and spilled to the public through online portals, requests for public records, and releases by cities and their vendors, contractors, and partners. By opening data, cities hope to raise public trust and prompt innovation. Municipal data, however, is often about the people who live, work, and travel in the city. By opening data, cities raise concern for privacy and social justice. This article presents the results of a broad empirical exploration of municipal data release in the City of Seattle. In this research, parties affected by municipal practices expressed their hopes and concerns for open data. City personnel from eight prominent departments described the reasoning, procedures, and controversies that have accompanied their release of data. All of the existing data from the online portal for the city were joined to assess the risk to privacy inherent in open data. Contracts with third parties involving sensitive or confidential data about residents of the city were examined for safeguards against the unauthorized release of data. Results suggest the need for more comprehensive measures to manage the risk latent in opening city data. Cities should maintain inventories of data assets, produce data management plans pertaining to the activities of departments, and develop governance structures to deal with issues as they arise--centrally and amongst the various departments--with ex ante and ex post protocols to govern the push, pull, and spill of data. In addition, cities should consider conditioned access to pushed data, conduct audits and training around public records requests, and develop standardized model contracts to protect against the spill of data by third parties. [ABSTRACT FROM AUTHOR]; Copyright of Berkeley Technology Law Journal is the property of University of California School of Law and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)

Keywords

Public Records; Open Data Movement; Acquisition Of Data; Ubiquitous Computing; Data Analysis; Social Justice

Probabilistic Walking Models Using Built Environment and Sociodemographic Predictors

Moudon, Anne Vernez; Huang, Ruizhu; Stewart, Orion T.; Cohen-Cline, Hannah; Noonan, Carolyn; Hurvitz, Philip M.; Duncan, Glen E. (2019). Probabilistic Walking Models Using Built Environment and Sociodemographic Predictors. Population Health Metrics, 17(1).

View Publication

Abstract

BackgroundIndividual sociodemographic and home neighborhood built environment (BE) factors influence the probability of engaging in health-enhancing levels of walking or moderate-to-vigorous physical activity (MVPA). Methods are needed to parsimoniously model the associations.MethodsParticipants included 2392 adults drawn from a community-based twin registry living in the Seattle region. Objective BE measures from four domains (regional context, neighborhood composition, destinations, transportation) were taken for neighborhood sizes of 833 and 1666 road network meters from home. Hosmer and Lemeshow's methods served to fit logistic regression models of walking and MVPA outcomes using sociodemographic and BE predictors. Backward elimination identified variables included in final models, and comparison of receiver operating characteristic (ROC) curves determined model fit improvements.ResultsBuilt environment variables associated with physical activity were reduced from 86 to 5 or fewer. Sociodemographic and BE variables from all four BE domains were associated with activity outcomes but differed by activity type and neighborhood size. For the study population, ROC comparisons indicated that adding BE variables to a base model of sociodemographic factors did not improve the ability to predict walking or MVPA.ConclusionsUsing sociodemographic and built environment factors, the proposed approach can guide the estimation of activity prediction models for different activity types, neighborhood sizes, and discrete BE characteristics. Variables associated with walking and MVPA are population and neighborhood BE-specific.

Keywords

Walking; Confidence Intervals; Research Funding; Transportation; Logistic Regression Analysis; Built Environment; Socioeconomic Factors; Predictive Validity; Receiver Operating Characteristic Curves; Data Analysis Software; Descriptive Statistics; Psychology; Washington (state); Active Travel; Home Neighborhood Domains; Physical Activity; Physical-activity; United-states; Life Stage; Adults; Attributes; Health; Associations; Destination; Pitfalls

Spatial Energetics Integrating Data from GPS, Accelerometry, and GIS to Address Obesity and Inactivity

James, Peter; Jankowska, Marta; Marx, Christine; Hart, Jaime E.; Berrigan, David; Kerr, Jacqueline; Hurvitz, Philip M.; Hipp, J. Aaron; Laden, Francine. (2016). Spatial Energetics Integrating Data from GPS, Accelerometry, and GIS to Address Obesity and Inactivity. American Journal Of Preventive Medicine, 51(5), 792 – 800.

View Publication

Abstract

To address the current obesity and inactivity epidemics, public health researchers have attempted to identify spatial factors that influence physical inactivity and obesity. Technologic and methodologic developments have led to a revolutionary ability to examine dynamic, high-resolution measures of temporally matched location and behavior data through GPS, accelerometry, and GIS. These advances allow the investigation of spatial energetics, high-spatiotemporal resolution data on location and time-matched energetics, to examine how environmental characteristics, space, and time are linked to activity-related health behaviors with far more robust and detailed data than in previous work. Although the transdisciplinary field of spatial energetics demonstrates promise to provide novel insights on how individuals and populations interact with their environment, there remain significant conceptual, technical, analytical, and ethical challenges stemming from the complex data streams that spatial energetics research generates. First, it is essential to better understand what spatial energetics data represent, the relevant spatial context of analysis for these data, and if spatial energetics can establish causality for development of spatially relevant interventions. Second, there are significant technical problems for analysis of voluminous and complex data that may require development of spatially aware scalable computational infrastructures. Third, the field must come to agreement on appropriate statistical methodologies to account for multiple observations per person. Finally, these challenges must be considered within the context of maintaining participant privacy and security. This article describes gaps in current practice and understanding and suggests solutions to move this promising area of research forward. (C) 2016 American Journal of Preventive Medicine. Published by Elsevier Inc. All rights reserved.

Keywords

Physical-activity Levels; Built Environment; Activity Monitors; Travel Behavior; Health Research; Neighborhood; Exposure; Validation; Children; Design

Urban Systems Design: A Conceptual Framework for Planning Smart Communities

Tobey, Michael B.; Binder, Robert B.; Chang, Soowon; Yoshida, Takahiro; Yamagata, Yoshiki; Yang, Perry P. J. (2019). Urban Systems Design: A Conceptual Framework for Planning Smart Communities. Smart Cities, 2(4), 522 – 537.

View Publication

Abstract

Urban systems design arises from disparate current planning approaches (urban design, Planning Support Systems, and community engagement), compounded by the reemergence of rational planning methods from new technology (Internet of Things (IoT), metric based analysis, and big data). The proposed methods join social considerations (Human Well-Being), environmental needs (Sustainability), climate change and disaster mitigation (Resilience), and prosperity (Economics) as the four foundational pillars. Urban systems design integrates planning methodologies to systematically tackle urban challenges, using IoT and rational methods, while human beings form the core of all analysis and objectives. Our approach utilizes an iterative three-phase development loop to contextualize, evaluate, plan and design scenarios for the specific needs of communities. An equal emphasis is placed on feedback loops through analysis and design, to achieve the end goal of building smart communities.

Keywords

Urban Design; Planning Support System; Resilience; Sustainability; Economics; Human Factors; Big Data