Article citation information:

Makaremi-Sharifi, M., Rassafi, A. A. The preferences of choosing taxi-hailing mode attributes through the BWS-Case 1. Scientific Journal of Silesian University of Technology. Series Transport. 2024, 122, 199-219. ISSN: 0209-3324. DOI: https://doi.org/10.20858/sjsutst.2024.122.11.

 

 

Mohsen MAKAREMI-SHARIFI[1], Amir Abbas RASSAFI[2]

 

 

 

THE PREFERENCES OF CHOOSING TAXI-HAILING MODE ATTRIBUTES THROUGH THE BWS-CASE 1

 

Summary. With the widespread use of the Internet in everyday life, new businesses have emerged, causing significant changes in the market, while some traditional businesses were marginalized. One of the emerging businesses is taxi-hailing, which has gained popularity among the public. This study examines ten attributes of taxi-hailing and asks individuals about their preferences for these attributes through a questionnaire. Unlike the traditional approach of dealing with discrete choice models, which focuses on choosing the best (most important) alternative only, the role of the worst (least important) alternative is also considered in this type of modelling. The present study utilizes case 1 (out of the three available cases) of this scaling method, called “best-worst”, which focuses on attributes. Each questionnaire includes 12 questions about taxi-hailing attributes, where respondents have to state their preference in selecting the best and the worst ones. The results indicate that security and reassurance are the most crucial attributes when deciding this transportation mode, followed by accessibility. Compliance with health issues and social distancing ranked as the least significant attribute.

Keywords: taxi-hailing attributes, best worst scaling, discrete choice model, stated preference

 

1. INTRODUCTION

 

The impact of the Internet on people's lives has been immense, and one of the significant changes it has brought about is the emergence of new businesses, including taxi-hailing services. Uber, a US-based transportation company, has been highly successful in this field and has inspired similar companies in other countries. In Iran, for the first time in mid-2014, people were able to use their smartphones to book rides, from a company called “Snapp”, and avoid the hassle of either calling taxi companies, or finding taxis in the streets. The popularity of this new transportation mode led to the emergence of other taxi companies in the country.

This study aims to determine what aspects of taxi-hailing, from the users' standpoint, are more crucial and contribute to its growth and significance. Consequently, the primary focus of this study is to uncover answers to the following questions:

1.    What are the effective attributes in choosing a taxi-hailing?

2.    What will the effect of different attributes be in selecting a taxi-hailing?

 

Discrete choice models are highly beneficial in studying, simulating, and rationalizing passengers' preferences. They can estimate the likelihood of decision-makers selecting from various alternatives and their decision-making behavior. These models adopt a probabilistic structure to mathematically model the decision-makers’ behavior and their attempts to maximize the utility resulting from their choice. The probability model arises from the presence of an unknown or error term in the analyst's understanding. Depending on the distribution assumption made for this error, different models can be employed, among which, the logit model is the most commonly used.

The Best-Worst Scaling (BWS) Case 1 method has been utilized as a ground-breaking approach to studying discrete choice. This technique places emphasis on both the best and worst alternatives, thereby underscoring the importance of attributes in the selection process like never before.

The rest of this paper is organized as follows: Section 2 provides a concise overview of the research background, while section 3 discusses the methodology employed in this study. Then, the research findings and Conclusions are described in Section 4. and Section 5, respectively.  References are given in section 6.

 

 

2. LITERATURE REVIEW

 

This following review covers several studies focused on various aspects of the taxi-hailing. These studies explore topics such as the impact of technology on the industry, the design of a more efficient taxi management system, the influence of social media on travel behavior, and the factors that affect the adoption of on-demand ride-hailing services.

The taxi industry is being reformed due to advancements in network technology and operational thinking brought on by the new era of the Internet. The introduction of special cars has had an unprecedented impact on the industry. Li examined the market positioning and operation mode differences between taxis and special cars by analyzing their positioning in domestic and foreign cities, as well as industrial development mode experience. He explored the possibility of their harmonious development and suggested specific proposals for the development and management of the taxi industry. He believes that service quality should always be considered the most important evaluation criterion [1].

Taxis are a convenient mode of transportation for many people. However, the common ways of finding passengers, either by driving around or staying at designated spots, are inefficient and wasteful. They lead to low occupancy rates and various issues such as traffic congestion and environmental damage. Dow et al. proposed a taxi management system using Location-Based Services and regional queuing techniques on the Internet. In this proposed system, the service areas are defined based on the geogrid and enable drivers to hunt both in the streets and wait in a station. They conducted field research on actual taxi operations and used PRISM to simulate and compare their model with the waiting model. The results indicated that their model is more efficient than the waiting model. In the field research, they designed a questionnaire for taxi drivers and examined a taxi station in Taiwan to obtain logical, empirical data. Although the modelling results show the superiority of the proposed model over the traditional models, this modelling was only based on approximately 30 effective questionnaires [2].

The study of how travelers decide to move around has always been important in transportation research. With the increasing availability of ICT in everyday life, the context in which people make travel decisions has changed. Regardless of whether travelers are intentionally searching for information before their trip or casually browsing the internet, they can find user-created content on social media that could potentially influence the decisions they make about how to move around. Bou Mjahed et al. examined trip behavior in a highly connected environment and attempted to design tools based on ICT to influence behaviors. They studied how a platform called Yelp.com (a multinational company in the US that helps people find local businesses) can provide information for activities and trip planning in the pre-trip process. The work presented in this study could be valuable as a starting point for more profound research on social media platforms and their role in trip planning and trip behavior [3].

The study by Chen et al. used online Location-Based Services network data from Brighticket to examine the effect of social networks on passengers’ destination choices in the Chicago metropolitan area1. The study found that social connections have a significant impact on travelers’ destination selection1. The formation of destination choice sets is influenced not only by external factors but also by personal perceptions, attitudes, and acceptance. Consequently, for an accurate understanding and prediction of daily trip demand, it is important to consider this process dynamically. The study found that social connections have a significant impact on travelers’ destination selection, and that the quantity of virtual friends has a substantial impact on actual physical travel behavior. However, the dataset used in this study has some constraints such as no data available on socio-demographic characteristics like age, ethnicity, etc., not all entries are logged and home and work locations are not differentiated [4].

Alemi et al. investigated the factors that influence the adoption of on-demand ride-hailing services such as Uber and Lyft among Millennials (i.e., individuals born between 1981 and 1997) and Generation X (i.e., middle-aged adults between 1965 and 1980) in California. They found that educated and older millennials are more likely to use on-demand ride-hailing services than other groups. Additionally, mixed land use and regional access by car are linked to a higher probability of accepting on-demand ride-hailing services. Participants who reported a higher frequency of long-distance trips and a higher proportion of long-distance trips by plane, as well as regular users of transportation-related smartphone apps and those who had previously used taxi and car-sharing services, were more likely to use these services. These findings can serve as a foundation for predicting the adoption of on-demand services and their impact on general behavioral patterns across different regions and social and demographic variables [5].

Song studied the impact of the emergence of new modes of transportation in changing mode choice behavior. Considering that a new mode is usually associated with some new attributes that people may be less familiar with, she tried to investigate the mode choice behavior by collecting data through stated preference and discrete choice modelling at the individual level and uncovering travel demand through empirical analysis. Using best-worst scaling, she investigated people's behavior in choosing two new modes of HSR (high-speed rail)-air and air taxi service [6].

The level of security has a significant impact on people's mobility. Jing et al. discussed the challenges faced by service providers in ensuring security during taxi-hailing trips, particularly following incidents of sexual assault and homicide. Didi Taxi-hailing Company has implemented additional measures to enhance passenger security, but there are few scientific findings on the effect of these measures on personal perception of security. They identified the key underlying factors that impact individuals' willingness to use or reuse taxi-hailing services following modifications to their security measures by expanding and merging the Technology Acceptance Model (TAM) and the Theory of Planned Behavior. Notably, the level of security risks and perceived security have a significant impact on their behavioral intentions.  On the contrary, the impact of government credit is not immediate. The trust levels can be affected by government credit, which in turn indirectly impacts the intention to use. Ultimately, this research affirms that investigating the impact of latent factors on the utilization or reutilization of taxi-hailing services can enhance the evaluation and improvement of security measures [7].

In Iran, Akbari et al. built a model combining the TAM with the information and trust system success model to determine the factors impacting the level of acceptance of users of taxi-hailing services. This study involved approximately 500 individuals from Tehran, from which 466 acceptable answers were obtained. The data was analyzed using a mediating analysis in a structural equation model. The study revealed that the perceived ease of use and perceived usefulness are significantly influenced by the quality of information and services. As anticipated, trust was positively associated with both perceived usefulness and perceived ease of use, while behavioral intention was positively linked to perceived usefulness. However, the predicted positive relationship between perceived ease of use and the behavioral intention was rejected. In addition, the outcomes of this investigation demonstrate that trust plays a crucial part as a mediator in the model. By exploring the mediating function of trust, an area that has not been previously investigated, this study broadens the technology adoption literature. The research focused on how trust primarily enhances the inclination to utilize a taxi-hailing service, ultimately influencing the likelihood of selecting this mode[8].

Louviere and Woodworth were the first to propose a discrete choice framework in which, in addition to the traditional determination of the best alternative in a set of choices, a person is asked to indicate the least important alternative in that set. This data collection method is called BWS and is used in many fields [9].

Louviere and Flynn wrote a book for researchers and practitioners who have some background and basic knowledge of BWS. They showed that BWS is accessible to a practitioner with moderate application skills and, often, can be successfully implemented using spreadsheet software rather than statistical programs. In this book, they brought together theories and methods and demonstrated their application in various case studies in a useful reference guide [10].

Lancsar et al. revealed that the primary objective of Discrete Choice Experiments (DCEs) is to gather high-quality choice data for estimating choice models, which can be utilized to investigate health-related experiments. The study introduced a novel type of choice experiment called Best Worst Discrete Choice Experiments. The authors explained what the approach is, how and when to use it, and provided some analytical methods for modelling the available data. In an experimental program, the approach for preference extraction was tested by investigating the preferences of 898 individuals in Edmonton and Calgary, Canada, whose topic was the treatment of cardiac arrest that occurred in a public place, and showed that better results are obtained compared to traditional analysis [11].

He and Shen proposed a spatial equilibrium model that not only balances the supply and demand of taxi services but also captures both the taxi drivers’ and passengers’ possible adoption of the newly emerging e-hailing applications in a well-regulated taxi market. They then proved the existence of the proposed equilibrium, and further provided an algorithm to solve it.  They also suggested an extensive equilibrium model with elastic taxi-passenger demands. Lastly, they presented a numerical example to compare the taxi services with and without the e-hailing application and evaluated two types of e-hailing applications [12].

Lancsar et al. have also offered instructions for users on how to interpret data obtained from DCEs using the best-worst and best-best data approaches. This guide contains a theoretical overview of the major choice models, as well as practical tips on interpreting and using the results of the analysis. They also provide descriptions of standard software that can be used in these methods. In this guide, in addition to providing descriptions of choice modelling, they attempt to do so in a way that allows researchers to analyze the data. They argued that the choice of modelling method depends on the research questions, study design, and limitations in terms of data quality/quantity and that decisions made regarding the choice of data analysis are often mutually dependent instead of being sequential. Additionally, they hold the belief that the knowledge and application presented in this research can be advantageous for scholars not only in the field of health economics but also in other areas [13].

Echaniz et al. also showed that overall customer satisfaction with the public transportation system mainly depends on two factors: The degree of contentment a customer feels regarding various elements of the service, as well as the significance that each of those elements holds for the customer. Typically, researchers utilize revealed preference surveys along with logit/probit models to gauge the proportion of satisfaction associated with each service characteristic in the conventional approach. The study's objective was to explore the feasibility of replacing the conventional technique with BWS-case 1. Through a customer survey conducted in Santander, Spain, they demonstrated that the satisfaction level obtained from both methods is quite comparable. However, due to the distinct relative significance of each attribute obtained from these two approaches, they inferred that the best-worst method offers more insightful and reliable findings that align with the existing literature on public transportation customer satisfaction [14].

A common goal in psychological research is to measure subjective perceptions, such as the first perception of a face. These perceptions are usually measured using a Likert scale. Although these ratings are simple to implement, they come with responses that can limit validity. Burton et al. studied BWS as an alternative to the Likert scale to measure participants' first facial perceptions. They found that BWS scores were almost perfectly correlated with Likert scores at the group level, suggesting that the two methods have the same perceptions. However, at the level of individual participants, it outperforms Likert scale both in terms of its ability to predict preferences and in terms of validity test. These advantages make the power of BWS exceptional, especially for use in individual differences research [15].

Aizaki et al. have published several works about the application of BWS in the field of agriculture, instructions on how to perform different cases of BWS and how to use relevant packages in R software by mentioning illustrative examples. They have presented the latest version of a package called support.BWS that can be utilized for such studies and explained its functions [16-18].

In conclusion, by incorporating ICT, we can gather more accurate data and develop more effective transportation policies. As technology continues to evolve, it presents exciting opportunities for improving our transportation systems and enhancing the quality of life for individuals and communities. With the emergence of taxi-hailing platforms, such as Uber and Lyft, commuters have access to an affordable, convenient, and reliable mode of transportation. By integrating technology like ICT into daily travel, the benefits of the BWS method can be further amplified, resulting in more efficient and sustainable travel experiences for everyone. Tab. 1 has summarized the reviewed literature.

 

                                                                                                                                      Tab. 1

Summary of the literature review

 

Researchers

year

Dealing with the subject

Transition in traditional taxis

Internet, application

Taxi -hailing

Best-worst scaling

Stated preference

Revealed preference

Louviere and Woodworth

1990

 

 

 

ü

ü

 

Lancsar et al.

2013

 

 

 

ü

ü

 

Aizaki et al.

2014

 

 

 

ü

ü

 

He and Shen

2015

ü

ü

ü

 

 

ü

Lancsar et al.

2017

 

 

 

ü

ü

 

Li

2016

ü

ü

 

 

 

 

Dow et al.

2016

ü

ü

 

 

 

ü

Bou Mjahed et al.

2017

 

ü

 

 

 

ü

Chen et al.

2018

 

ü

 

 

 

ü

Alemi et al.

2018

 

ü

ü

 

 

ü

Echaniz et al.

2019

 

 

 

ü

ü

 

Burton et al.

2019

 

 

 

ü

ü

 

Aizaki and Fogarty

2019

 

 

 

ü

ü

 

Song

2019

ü

 

 

ü

ü

 

Akbari et al.

2020

 

 

ü

 

ü

 

Jing et al.

2021

 

 

ü

 

ü

 

Aizaki and Fogarty

2023

 

 

 

ü

ü

 

 

 

3. METHODOLOGY

 

The proposed methodology involves the following steps. Firstly, the variables representing taxi-hailing attributes are determined, encompassing factors such as time and cost. Secondly, the study area is identified. Then, sample size is determined, considering a random sampling of individuals with taxi-hailing experience in the chosen area. To ensure efficient data collection and analysis, an experimental design of Orthogonal Main Effects Design is employed. This design systematically varied attribute levels across choice tasks. The questionnaire is then designed, incorporating choice tasks that presented different attribute combinations. After analyzing the collected data, the collected data underwent preparation for modelling, including cleaning, coding, and formatting. Modelling techniques, i.e., logit models and counting approach are applied to estimate preference weights and identify significant factors. The research results are derived from the analysis, including estimated preference weights and their significance. Finally, the methodology is concluded. These steps are briefly shown in the flowchart of Fig. 1. and further explained in the subsections.

 

Determining variables (taxi-hailing՚s attributes)

Identifying and determining the study area

Discrete choice experiment based on best-worst scaling-Case 1

Identifying the sample size

Experimental Orthogonal Main-Effects Design (OMED)

Questionnaire design

Selection of sampling method and data collection

Modelling

Research results

 Data preparation for modelling

Conclusion

 

 

Fig. 1. The flowchart of the research methodology

 

3.1. DCEs

 

DCEs method is a preferred approach that involves generating and analyzing choice data. They are usually conducted in surveys. This survey presents the participants with multiple sets of choices, each comprising various questions, and the participants are required to select one alternative from each set.         

 

3.2. Discrete choice model

 

Discrete choice modelling is one of the important components of DCE that researchers face when analyzing DCE data in a situation where the study level is disaggregated. Certain decisions should be made regarding the model structure (binary vs. multiple choices; linear, quadratic, logarithmic, etc.; …) based on the nature of the problem at hand. Hence, it is not possible to suggest a single model that is suitable for all situations. Each model has its strengths and weaknesses depending on the particular research problem. It is crucial to bear in mind that the selection of a model is influenced by factors such as study goals, research questions study design, and data availability.

Multinomial Logit (MNL) and its associated theory (i.e., random utility developed by [19]) are typically a starting point for the discrete choice models. The utility shown in Equation (1) is received for respondent i who chooses alternative j in choice scenario s:

 

                                                      (1)

 

Where N decision makers choose J alternatives among S scenarios. Visj and εisj represent the systematic or predictable component of the overall utility of choosing alternative j in scenario s by decision maker i, and the potential disturbance (error) term represents attributes that are unobservable by the analyst, respectively. It is assumed that decision maker i chooses alternative j if it provides the highest utility compared to the utility associated with the other alternatives in the choice set. Therefore:

 

                                                                                      (2)

 

Where  is the probability of choosing alternative j in scenario s by individual i. In the well-known MNL format, the likelihood of selecting j can be expressed in the following:

 

                                                                                                                   (3)

 

Where λ represents the scale parameter, which is the inverse of the disturbance's standard deviation. However, in the standard MNL model, λ cannot be determined and is typically set to one. In the Conditional Logit (CL) model, the independent variables change based on the attributes of different alternatives. In this method, the analysis is done on a set of different alternatives for each person. At the same time, in the MNL model, the independent variables are the attributes of each person. In other words, to distinguish between conditional logit and MNL models, the question can be raised whether the independent variables change with choices. If the answer is negative, the multinomial logit model should be used, and if the answer is positive, the conditional logit model should be used.

One of the important requirements in using the conditional logit and multinomial logit models is that the choice of alternatives from a choice set should follow the attribute of independence from irrelevant alternatives. This implies that the likelihood ratio linked to the other alternatives remains unaffected by the existence or non-existence of an alternative.

 

3.3. Scaling of the BWS-Case 1

 

In examining the structure of this model, an attempt is made to improve the stated preference results by using techniques, among which is the BWS, which is based on the idea that a person, among a set of alternatives, identifies the best and worst alternatives of the set. BWS consists of three cases, which differ only in the complexity of the cases or alternatives considered. In Case 1 (object case), the respondents evaluate the list of attributes and then subsets of those attributes are presented to them as a choice set. They are asked to select the most and the least important cases from each subset. This process is repeated until all subsets have been evaluated. In Case 2 (profile case), different combinations of profile levels are created and respondents are asked to select the best and worst levels for each profile. Case 3 (multiprofile case) involves respondents selecting the best and worst profiles from a choice set of three or more profiles.

 

3.4. Determining the variables (Taxi-hailing attributes)

 

In the first step an attempt was made to determine the significant attributes in choosing taxi-hailing with the benefit of scientific studies. After a careful review, ten distinct attributes were identified in this category. These attributes include:

1.    Cost

2.    Convenience

3.    Safety against risks

4.    Security and confidence

5.    Honoring the customer

6.    Compliance with health cares and social distancing

7.    Being fast

8.    Accessibility

9.    Flexibility (in terms of time and choice of intermediary destination, etc.)

10.  Dependence on technology (cell phone and Internet)

 

3.5. Identification and determination of the study area

 

The study area is Qazvin city, the provincial capital of Qazvin province. Qazvin province, with an area equivalent to 15623 km2 in the central area of Iran, is placed between 48° 44' to 50° 51' longitude and 35° 24' to 36° 36' latitude (See Fig. 2.). Qazvin province is bordered by Guilan and Mazandaran provinces from the north, Hamadan and Zanjan provinces from the west, Markazi province from the south, and Alborz province from the east. This province is made up of 20 cities in the form of 5 counties and the National Statistics Centre of Iran has announced the population of Qazvin province as 1326400 people and the population of Qazvin County as 621800 people by the end of 2021.

 

 

Fig. 2. Map of Qazvin province

 

3.6. Sample size

 

Since best-worst scaling introduces a different choice task with distinct outcomes compared to conventional DCEs, the sample sizes necessary for estimating disaggregated utilities remain uncertain. Nevertheless, according to Flynn et al., if the focus lies on comparing the proportions of respondents selecting different attribute levels, it is possible to estimate the required sample sizes using equations for confidence intervals. In these cases, factors such as the number of times the best-worst pairs are chosen are used [20]. Although there are no specific methods for determining sample size for B-W scaling, according to Louviere et al., sample size determination techniques used for multinomial proportions data can be applied [21, 22].

Thompson established sample size requirements for multinomial proportions data, with deriving a formula that determines the necessary sample size based on the acceptable level of error and desired level of confidence for obtaining population proportions. He devised a method for determining the necessary sample size for multinomial proportions data. Similar to the binomial case, the sample size is dependent on the acceptable error level (α) and the desired confidence level (d) of the actual population proportions. Thompson found that the number of multinomial categories (j or alternatives) does not affect the required sample size. He created a table to aid in determining the appropriate sample size based on the desired values of α and d. As an illustration, Thompson gave the example of a biologist who wants to estimate the proportion of fish in each age class in a population. To achieve a probability of 0.95 that all estimates are within 0.05 of the population proportion, a sample size of 510 would suffice [22].

If the BWS considerations are not taken into account, it is proposed that 384 observations are needed using Cochran's formula [23]. This estimation is based on the latest population and housing census in 2016, which reported the population of Qazvin city to be approximately 600,000 people. In this study, 100 questionnaires were collected, in which 12 different scenarios were asked in each questionnaire, and a total of 1200 observations were obtained. Therefore, with this number of observations, the assurance of the required sample size was provided [22].

 

3.7. Orthogonal Main Effects Design (OMED)

 

A well-planned experimental design results in the most accurate estimations. The arrangement of variables that have a significant impact on the analysis is determined by the analyst, highlighting the crucial role of analysts in designing the experiment. For example, if there is an effect between attributes identified.

In this study, the OMED design was used. Since even a few number of factors and a few levels per factor lead to an unmanageable number of potential profiles, a representative subset, known as an orthogonal array, must be generated. Orthogonal design provides the possibility to examine the main and interaction effects by performing the least number of experiments. In an OMED, the levels of each factor are chosen so that they are orthogonal to each other. This means that the effect of each factor can be estimated independently of the other factors. Furthermore, each row corresponds to a question and each column corresponds to an item. Each element in the matrix is assigned one of two distinct numbers. One value represents the item being “absent” from the corresponding column, while the other value represents the item being “present.” This allows researchers to decide which items are assigned to each question [16]. To prepare the questionnaire based on this design, the desired experiment was designed by utilizing the oa.design function from the R software's DoE.base package.

 

3.8. Questionnaire design

 

Using the OMED design, a set of choices was made to ask the respondents; in each question, a set of several attributes was presented and simultaneously exposed to the choice of the most important and the least important alternative to pick. In this test design, each respondent was asked 12 questions, and in each question, the attributes were appeared according to Fig. 3.


 

3.9. Selection of sampling method and Data collection

 

In order to collect the required information, questionnaires were designed and prepared electronically, and its hyperlink was distributed in social networks whose members were from different age ranges. The questionnaire included questions about individual attributes (age, gender, education, job, etc.) and questions to discover the best-worst priorities among the alternatives and attributes.

After designing the questionnaire questions, the survey form was created as an online form, and its hyperlink was provided to 100 respondents. The reasons for choosing this number were discussed in section 3.6. After completing the survey, the response data was converted, which consists of the respondents' choices as the best and worst alternatives from each choice set, into data that can be used for modelling in R software. Tab. 2 displays a dataset whose first column includes the variable ID, the unique identification number of the participant. The following columns show pairs of response variables, where each pair indicates the participant's choice of the best and worst attributes for each question.

 

 

Fig. 3. Questionnaire design using OMED

 

3.10. Data preparation for modelling

 

The data is prepared for modelling according to Tab. 3. For example, in question 1 of the respondent 1, five attributes (ITEMs 2, 5, 7, 8, and 9) are available for selection. Therefore, there are a total of 20 best-worst combinations for this query. This respondent chose ITEM8 as the best, and ITEM9 as the worst attribute. In each row of this table each attribute takes the value of 1 if it is chosen as the best, value of -1 if it is chosen as the worst, and 0 otherwise.

 

                                                                                                                                           Tab. 2

Data entry for the attributes chosen in each question by the respondents

 

 

ID

B1

W1

B2

W2

B3

W3

B4

W4

B5

W5

B6

W6

B7

W7

B8

W8

B9

W9

B10

W10

B11

W11

B12

W12

1

1

4

5

1

3

5

2

5

2

5

3

4

1

8

4

2

4

4

1

5

1

4

3

4

1

2

2

4

3

5

1

4

1

5

1

5

1

3

2

9

1

3

1

3

4

4

1

4

1

3

1

3

3

3

4

3

1

1

5

4

1

4

1

2

4

4

1

4

1

4

1

3

1

3

2

3

1

4

4

4

2

1

4

2

3

2

4

3

4

3

4

4

10

1

4

1

4

3

5

3

4

1

2

5

5

4

5

1

5

5

4

5

3

1

3

2

3

1

6

1

3

4

3

3

2

4

2

1

2

6

6

2

4

1

5

2

5

4

5

3

4

3

2

1

2

1

4

1

4

1

4

2

4

1

4

7

7

1

5

1

5

1

5

1

4

1

5

1

4

1

10

1

4

4

3

3

4

1

4

1

4

8

8

4

5

1

5

1

3

1

3

1

5

1

4

1

6

1

4

1

3

1

4

1

4

1

4

9

9

4

5

1

5

1

4

1

3

1

3

2

3

1

6

1

3

4

3

1

2

3

4

1

4

10

10

2

3

3

5

2

5

2

4

3

4

1

2

5

7

2

4

1

4

1

5

3

1

2

1

 

3.11. Analysis approaches

 

The counting approach and the modelling one are two methods for analyzing answers to BWS inquiries. They are introduced in the following subsections.

 

                                                                                                                                         Tab. 3

Data preparation for modelling

 

 

ID

Q

PAIR

BEST

WORST

RES.B

RES.W

RES

Cost

 

(1)

(2)

(3)

(4)

(5)

(6)

(7)

(8)

(9)

(10)

1

1

1

1

2

5

8

9

FALSE

0

2

1

1

2

2

7

8

9

FALSE

0

3

1

1

3

2

8

8

9

FALSE

0

4

1

1

4

2

9

8

9

FALSE

0

5

1

1

5

5

2

8

9

FALSE

0

6

1

1

6

5

7

8

9

FALSE

0

7

1

1

7

5

8

8

9

FALSE

0

8

1

1

8

5

9

8

9

FALSE

0

9

1

1

9

7

2

8

9

FALSE

0

10

1

1

10

7

5

8

9

FALSE

0

11

1

1

11

7

8

8

9

FALSE

0

12

1

1

12

7

9

8

9

FALSE

0

13

1

1

13

8

2

8

9

FALSE

0

14

1

1

14

8

5

8

9

FALSE

0

15

1

1

15

8

8

8

9

FALSE

0

16

1

1

16

8

9

8

9

TRUE

0

17

1

1

17

9

2

8

9

FALSE

0

18

1

1

18

9

5

8

9

FALSE

0

19

1

1

19

9

7

8

9

FALSE

0

20

1

1

20

9

8

8

9

FALSE

0

 

The respondent’s identification number.

Sequential identifier for BWS questions, commencing at number 1.

For each question is the sequential identifier of possible pairs of best /worst attributes

Item number in possible pairs of best-worst attributes in each question, where the attribute is considered the best

Item number in possible pairs of best-worst attributes in each question, where the attribute is considered worst

Item number that was chosen as the best by the respondents

Item number that was chosen as the worst by the respondents

response to a BWS question: if a pair of best and worst items is selected by respondents, it is marked as TRUE, otherwise it is marked as FALSE.

 

Convenience

Safety against risks

Security and confidence

Honoring the customer

health cares

Being fast

Accessibility

Flexibility

Dependence on technology

STR

(11)

(12)

(13)

(14)

(15)

(16)

(17)

(18)

(19)

(20)

1

0

0

-1

0

0

0

0

0

101

1

0

0

0

0

-1

0

0

0

101

1

0

0

0

0

0

-1

0

0

101

1

0

0

0

0

0

0

-1

0

101

-1

0

0

1

0

0

0

0

0

101

0

0

0

1

0

-1

0

0

0

101

0

0

0

1

0

0

-1

0

0

101

0

0

0

1

0

0

0

-1

0

101

-1

0

0

0

0

1

0

0

0

101

0

0

0

-1

0

1

0

0

0

101

0

0

0

0

0

1

-1

0

0

101

0

0

0

0

0

1

0

-1

0

101

-1

0

0

0

0

0

1

0

0

101

0

0

0

-1

0

0

1

0

0

101

0

0

0

0

0

-1

1

0

0

101

0

0

0

0

0

0

1

-1

0

101

-1

0

0

0

0

0

0

1

0

101

0

0

0

-1

0

0

0

1

0

101

0

0

0

0

0

-1

0

1

0

101

0

0

0

0

0

0

-1

1

0

101

 

 

 

 

 

 

 

 

 

a categorical variable that identifies any combination of respondent and question

3.11.1. Counting approach

 

The approach involves computing the frequency with which item i is selected as the best (Bin), or the worst (Win) among all the questions asked to participant n; that is, to obtain the utility score, we can use the frequency of the best-worst choices, which is the total number of times that an alternative is chosen as the best or the worst. These functions show the perceived utility of that attribute and the sensitivity of the respondent's perception and preferences to changes in the attributes.

The scores can be classified into two main groups: disaggregated scores, which pertain to the individual level, and aggregated scores, which pertain to the total level. The computation details of these scores are presented in Tab. 4. It is deductible from the table that the disaggregated standardized BW score  falls within the range of -1 to +1 because the minimum and maximum values of BWin are -r and +r when respondent n selects item i as the worst and best among all questions containing item i, respectively. BWin equals zero if respondent n chooses item i as both the best and the worst equally often, or if respondent n does not choose item i as either the best or the worst. The value of SSQBWi provides with an understanding of the relative significance of various items. For instance, values of 0.5 and 0.24 for items i and j, respectively, indicate that item i is nearly two times more important than item j.

 

                                                                                                                                           Tab. 4

Different scores of the counting approach

 

Disaggregated scores

Aggregated scores

=                  (4)

=                (5)

= ,           (6)

=                                                                  (7)

=                                                              (8)

=                                                 (9)

r: Frequency of item i in all questions

N: Number of people who responded to the survey

 

3.11.2. Modelling approach

 

The method involves the use of discrete choice models to scrutinize the replies. The details of the method have been presented in the following studies, and the interested reader may refer to those works. Assume respondents choose item i as the best and item j as the worst based on their particular utilities (v). The probability of this selection is expressed as the following CL model:

 

                                                                                            (10)

 

In order to calculate the share of preference for a specific item i ( ), the estimated utility coefficients are converted using the CL model choice rule.

 

                                                                                                                     (11)

 

The clogit() function in the survival package can be utilized to conduct an analysis on responses to BWS questions in CL model. To analyze BWS questions where respondents evaluate j items, the model formula is usually structured in the following way:

 

                                RES ~ ITEM1 + ITEM2 + ... + ITEMj-1 + strata(STR)                          (12)

 

where the state variable, ITEMk, is associated with the potential best and worst item pairs.

 

When item k is considered as the best item in a given pair, ITEMk takes a value of 1, while it takes a value of -1 if item k is considered as the worst item in a given pair. When item k is not part of any potential best and worst item pairs, ITEMk takes a value of 0. The variable ITEMj (j-th item), has been left out of the equation because its coefficient should be set to zero to establish a reference point. strata(STR) is utilized to distinguish each respondent and BWS question combination. RES and STR have been defined earlier [16].

 

 

4. FINDINGS

 

4.1. Counting approach

 

The results presented in Fig. 4. and Fig. 5. are obtained based on the counting approach for the disaggregated and aggregated scores, respectively. The attributes of safety against risks, security and confidence, being fast, accessibility, and flexibility (in terms of time and choice of intermediary destination, etc.) have positive BW standard scores. It means that these attributes are more likely to be chosen as the most important than the least important, and the other attributes are conversely more likely to be chosen as the least important.

 

 

Fig. 4. The results of the disaggregated scores (at the individual level)

 

 

Fig. 5. The results of aggregated scores (at the total level)

 

Comparing std.sqrt.BW (aggregated) values shows that the most important roles belong to security and confidence (1.0000), and accessibility (0.8349), respectively. These attributes are approximately 4.2 and 3.5 times more significant than the least attribute of compliance with health cares and social distance with the value of std.sqrt.BW (0.2370), respectively. The order of attributes using the counting approach is as follows:

Security and confidence

Accessibility

Flexibility

Safety against risks

Being fast

Cost

Convenience

Honoring the customer

Dependence on technology (cell phone and Internet)

Compliance with health cares and social distancing

 

4.2. Modelling approach

 

The results of the CL model based on the data set created with the assumption that the coefficient of ITEM8 (accessibility) is normalized to zero are presented in Fig. 6.

 

 

Fig. 6. The results of the modelling approach

 

The column p value indicates that all attributes are significantly different from zero at the 1% level. Since the coefficient of ITEM8 is normalized to zero, the other coefficients show the difference in value from the coefficient of ITEM8. Therefore, since the coefficient of ITEM4 is positive, while the coefficients of the rest of the attributes are negative, it is concluded that ITEM4 is more important than ITEM8, and the rest are less important than ITEM8. The comparison of the estimated coefficients of the CL model with the available standardized BW (stdBW) score is shown in Fig. 7.

 

 

Fig. 7. Comparison of the estimated coefficients of the CL model with
the standardized BW score (stdBW)

 

Fig. 8 and Fig. 9 show the relationship between the two vectors clogit and stdBW. As expected, the correlation between the two vectors is significant. Share of preference is obtained cumulatively using the addmargins function (Fig. 10). It is an easy measure to interpret. For example, the most important attributes of security and confidence (0.204) and accessibility (0.154) are approximately 3.5 and 2.7 times more important than the lowest attribute of compliance with health cares and social distancing (0.058), respectively.

 

 

Fig. 8. The relationship between two vectors clogit and stdBW


 

The order of attributes based on the modelling approach and using the shares of preference is as follows:

Security and confidence

Accessibility

Flexibility

Safety against risks

Being fast

Convenience

Cost

Honoring the customer

Dependence on technology (cell phone and Internet)

Compliance with health care and social distancing

 

 

Fig. 9. Correlation between two vectors clogit and stdBW

 

 

Fig. 10. Shares of preferences of attributes

 

 

5. CONCLUSION

 

Taxi-hailing attributes can be ranked using the BWS-Case 1. Modelling can be done by using the counting approach based on the number of times (frequency) that attribute i is chosen as the best (Bin) or worst (Win) alternative among all questions of the n respondents or performed by using the conditional logit model in the modelling approach. As can be seen in Tab. 5 the results obtained from the two methods are very similar, and in this study, among the ten attributes examined for Taxi-hailing, there was a difference only in the attributes of the sixth rank and the seventh rank, where the compared values were very close to each other.

It can be seen in Fig. 11 that the attributes of security and confidence, accessibility, flexibility (in terms of time and choice of intermediary destination, etc.) and safety against the risks are above the mean line, and the rest of the attributes are below. Therefore, it is suggested to define levels for these attributes in future studies and to conduct a more detailed study on these attributes using the BWS-Cases 2 and 3.

                                                                                                                                         Tab. 5

Ranking comparison of the counting and modelling approaches

 

Rank

Counting approach

Modeling approach

1

Security and confidence

Security and confidence

2

Accessibility

Accessibility

3

Flexibility

Flexibility

4

Safety against risks

Safety against risks

5

Being fast

Being fast

6

Cost

Convenience

7

Convenience

Cost

8

Honoring the customer

Honoring the customer

9

Dependence on technology

(cell phone and Internet)

Dependence on technology

(cell phone and Internet)

10

Compliance with health care and social distancing

Compliance with health care and social distancing

 

Also, it can be concluded that among the attributes of the Taxi-hailing, mental and spiritual attributes such as security and confidence, and safety against risks that cause mental peace, as well as the attributes of accessibility and flexibility (in terms of time and choice of intermediary destination, etc.) that provide comfort to passengers, are more important than the attributes of being fast, cost, convenience, honoring the customer, dependence on technology (cell phone and Internet) and compliance with health cares and social distancing, which mostly go back to material attributes and physical health and well-being.

 

 

Fig. 11. Scores of attributes in counting approach and modelling approach
and comparison with a mean line

 

 

References

 

1.    Li Han-ru. 2016. "Taxi positioning in the new age of internet and industrial development research". Procedia Engineering 137: 811-16. ISSN 1877-7058. DOI: 10.1016/j.proeng.2016.01.320.

2.    Dow Chyi-Ren, Duc-Binh Nguyen, Shr-Chen Wang, Shiow-Fen Hwang, Ming Fong Tsai. 2016. "A geo-aware taxi carrying management system by using location based services and zone queuing techniques on internet of things". Mobile Information Systems 2016: 9817374. DOI: 10.1155/2016/9817374.

3.    Bou Mjahed Lama, Archak Mittal, Amr Elfar, Hani S. Mahmassani, Ying Chen. 2017. "Exploring the role of social media platforms in informing trip planning: Case of yelp.Com". Transportation Research Record 2666: 1-9. DOI: 10.3141/2666-01.

4.    Chen Ying, Hani S. Mahmassani, Andreas Frei. 2018. "Incorporating social media in travel and activity choice models: Conceptual framework and exploratory analysis". International Journal of Urban Sciences 22(2): 180-200. ISSN: 1226-5934. DOI: 10.1080/12265934.2017.1331749.

5.    Alemi F., G. Circella, S. Handy, P. Mokhtarian. 2018, "What influences travelers to use uber? Exploring the factors affecting the adoption of on-demand ride services in california". Travel Behaviour and Society 13: 88-104. DOI: 10.1016/j.tbs.2018.06.002.

6.    Song Fangqing. 2019. "Understanding mode choice behaviour when new modes come into play". Ph.D. Dissertation. University of Leeds.

7.    Jing Peng, Yuanyuan Chen, Xingyue Wang, Kewen Pan, Daibiao Yuan. 2021. "Evaluating the effectiveness of didi ride-hailing security measures: An integration model". Transportation Research Part F: Traffic Psychology and Behaviour 76: 139-66. ISSN: 1369-8478. DOI: 10.1016/j.trf.2020.11.004.

8.    Akbari Morteza, Nader Seyyed Amiri, Miguel Ángel Zúñiga, Hamid Padash, Hodjat Shakiba. 2020, "Evidence for acceptance of ride-hailing services in iran". Transportation Research Record 2674(11): 289-303. ISSN: 0361-1981. DOI: 10.1177/0361198120942224.

9.    Louviere Jordan J., George G. Woodworth. 1991. Best worst scaling: A model for largest difference judgments. University of Alberta, Working Paper.

10.  Louviere Jordan J., Terry N. Flynn, Anthony Alfred John Marley. 2015. Best-worst scaling: Theory, methods and applications. Cambridge University Press.

11.  Lancsar E., J. Louviere, C. Donaldson, G. Currie, L. Burgess. 2013, "Best worst discrete choice experiments in health: Methods and an application". Social Science and Medicine 76(1): 74-82. ISSN: 0277-9536. DOI: 10.1016/j.socscimed.2012.10.007.

12.  He F., Z.J.M. Shen. 2015. "Modeling taxi services with smartphone-based e-hailing applications". Transportation Research Part C: Emerging Technologies 58: 93-106. DOI: 10.1016/j.trc.2015.06.023.

13.  Lancsar Emily, Denzil G. Fiebig, Arne Risa Hole. 2017. "Discrete choice experiments: A guide to model specification, estimation and software". PharmacoEconomics 35(7): 697-716. DOI: 10.1007/s40273-017-0506-4.

14.  Echaniz Eneko, Chinh Q. Ho, Andres Rodriguez, Luigi dell'Olio. 2019, "Comparing best-worst and ordered logit approaches for user satisfaction in transit services". Transportation Research Part A: Policy and Practice 130: 752-69. ISSN: 0965-8564. DOI: 10.1016/j.tra.2019.10.012.

15.  Burton Nichola, Michael Burton, Dan Rigby, Clare A.M. Sutherland, Gillian Rhodes. 2019. "Best-worst scaling improves measurement of first impressions". Cognitive research: principles and implications 4: 1-10(36). ISSN: 2365-7464. DOI: 10.1186/s41235-019-0183-2.

16.  Aizaki Hideo, James Fogarty. 2019. "An r package and tutorial for case 2 best–worst scaling". Journal of Choice Modelling 32: 100171. ISSN: 1755-5345. DOI: 10.1016/j.jocm.2019.100171.

17.  Aizaki Hideo, James Fogarty. 2023. "R packages and tutorial for case 1 best–worst scaling". Journal of Choice Modelling 46: 100394. ISSN: 1755-5345. DOI: 10.1016/j.jocm.2022.100394.

18.  Aizaki Hideo, Tomoaki Nakatani, Kazuo Sato. 2014. Stated preference methods using r. CRC Press.

19.  McFadden Daniel. 1974. "Conditional logit analysis of qualitative choice behavior." In: Frontiers in econometrics. Paul Zarembka. New York: Academic Press, 105-42.

20.  Flynn Terry N., Jordan J. Louviere, Tim J. Peters and Joanna Coast. 2007. "Best–worst scaling: What it can do for health care research and how to do it". Journal of Health Economics 26(1): 171-89. ISSN: 0167-6296. DOI: 10.1016/j.jhealeco.2006.04.002.

21.  Louviere Jordan, Ian Lings, Towhidul Islam, Siegfried Gudergan, Terry Flynn. 2013, "An introduction to the application of (case 1) best–worst scaling in marketing research". International journal of research in marketing 30(3): 292-303. ISSN: 0167-8116. DOI: 10.1016/j.ijresmar.2012.10.002.

22.  Thompson Steven K. 1987. "Sample size for estimating multinomial proportions". The American Statistician 41(1): 42-46. ISSN: 00031305. DOI: 10.2307/2684318.

23.  Cochran William Gemmell. 1977. Sampling techniques. 3d. New York: Wiley.

 

 

Received 21.10.2023; accepted in revised form 05.01.2024

 

 

Scientific Journal of Silesian University of Technology. Series Transport is licensed under a Creative Commons Attribution 4.0 International License

 



[1] Faculty of Engineering, Imam Khomeini International University, Qazvin, Iran. Email: m.makaremi.sh@gmail.com. ORCID: https://orcid.org/0009-0002-3545-2069

[2] Faculty of Engineering, Imam Khomeini International University, Qazvin, Iran. Email: rasafi@eng.ikiu.ac.ir. ORCID: https://orcid.org/0000-0002-3419-0194