ABSTRAK Vita Nova Anwar (2024). Pengembangan Model Pedagogi Digital dalam Pembelajaran Matematika Terintegrasi Computational Thinking untuk Meningkatkan Kemampuan Problem Solving Siswa SMP Penggunaan teknologi yang melibatkan keterampilan computational thinking merupakan kualifikasi yang diperlukan pada abad 21 ini. Computational thinking erat kaitannya dengan pembelajaran matematika. Dalam memecahkan masalah matematika yang kompleks penting untuk mengikuti langkah-langkah penyelesaian masalah sesuai tahapan computational thinking. Sehingga diperlukan model pembelajaran yang sesuai untuk mendukung hal tersebut. Penelitian ini bertujuan untuk mengembangkan model pedagogi digital dalam pembelajaran matematika terintegrasi computational thinking untuk meningkatkan kemampuan problem solving siswa SMP. Penelitian ini adalah penelitian pengembangan yang mengikuti model Plomp meliputi preliminary research, prototyping phase, dan assesment phase yang diuraikan secara deskriptif. Penelitian dilaksanakan pada dua sekolah SMP di kota Padang yang melibatkan 56 orang siswa kelas VIII. Aktivitas computational thinking yang dikembangkan dalam pembelajaran matematika terdiri dari aktivitas langsung dan aktivitas digital yang berbantukan teknologi. Instrumen penelitian adalah lembar validasi, lembar penilaian kepraktisan oleh guru, dan soal tes kemampuan problem solving. Hasil uji validitas menunjukkan bahwa dari segi konten, bahasa, penyajian dan kegrafikan sudah memenuhi kriteria valid dan sangat valid. Hasil uji praktikalitas memenuhi kriteria sangat praktis. Hasil uji efektivitas menunjukkan bahwa terdapat peningkatan kemampuan problem solving siswa. ABSTRACT Vita Nova Anwar (2024). Development of a Digital Pedagogical Model on Integration of Computational Thinking in Mathematics Learning to Improve Students' Problem Solving Abilities The use of technology involving computational thinking skills is a required qualification today. Computational thinking is closely related to mathematics learning. In solving complex mathematical problems, it is important to follow the problem solving steps according to the stages of computational thinking. So an appropriate learning model is needed to support this. This research aims to develop a digital pedagogy model in mathematics learning integrated with computational thinking to improve problem solving abilities of junior high school students. This research is development research that follows the Plomp model including preliminary research, prototyping phase, and assessment phase which are described descriptively. The research was carried out at two schools in Padang, involving 56 students in grade VIII. The research instruments were a validation sheet, a practicality assessment sheet by the teacher, and problem solving ability test questions. The results of the validity test show that in terms of content, language, presentation and graphics it meets the valid and very valid criteria. The results of the practicality test carried out are very practical criteria. The results of the effectiveness test show that there is an increase in students' problem solving abilities.
Item Type: | Thesis (S3) |
---|---|
Additional Information: | https://scholar.google.com/citations?user=_fWHHi0AAAAJ&hl=id ID SINTA Dosen Pembimbing Darhim : 6166301 Suhendra : 6140435 Elah Nurlaelah : 6665327 |
Uncontrolled Keywords: | Pedagogi Digital, Computational Thinking, Problem Solving Digital Pedagogy, Computational Thinking, Problem Solving |
Subjects: | |
Divisions: | |
Depositing User: | Vita Nova Anwar |
Date Deposited: | 06 Sep 2024 08:26 |
Last Modified: | 06 Sep 2024 08:26 |
URI: |
View Item |
Yarong Chen
College of Mechanical and Electronic Engineering, Wenzhou University, Wenzhou, China
Contribution: Data curation, Funding acquisition, Resources, Supervision, Validation, Writing - original draft
Junjie Zhang
Contribution: Investigation, Methodology, Visualization, Writing - original draft
Corresponding Author
Mudassar Rauf
Correspondence
Mudassar Rauf, College of Mechanical and Electronic Engineering, Wenzhou University, Room A601b, Building No.6, Wenzhou, Zhejiang, China.
Email: [email protected]
Contribution: Conceptualization, Formal analysis, Resources, Supervision, Validation, Writing - review & editing
Jabir Mumtaz
Contribution: Formal analysis, Methodology, Resources, Visualization, Writing - review & editing
Shenquan Huang
Contribution: Data curation, Methodology, Project administration, Software, Visualization
A hybrid flow shop is pivotal in modern manufacturing systems, where various emergencies and disturbances occur within the smart manufacturing context. Efficiently solving the dynamic hybrid flow shop scheduling problem (HFSP), characterised by dynamic release times, uncertain job processing times, and flexible machine maintenance has become a significant research focus. A NeuroEvolution of Augmenting Topologies (NEAT) algorithm is proposed to minimise the maximum completion time. To improve the NEAT algorithm's efficiency and effectiveness, several features were integrated: a multi-agent system with autonomous interaction and centralised training to develop the parallel machine scheduling policy, a maintenance-related scheduling action for optimal maintenance decision learning, and a proactive scheduling action to avoid waiting for jobs at decision moments, thereby exploring a broader solution space. The performance of the trained NEAT model was experimentally compared with the Deep Q-Network (DQN) and five classical priority dispatching rules (PDRs) across various problem scales. The results show that the NEAT algorithm achieves better solutions and responds more quickly to dynamic changes than DQN and PDRs. Furthermore, generalisation test results demonstrate NEAT's rapid problem-solving ability on test instances different from the training set.
The hybrid flow shop scheduling problem (HFSP) is extensively applied in chemical, textile, steel, and semiconductor industries and is recognised as an NP-hard problem [ 1 ]. HFSP exhibits greater complexity compared to traditional flow shop scheduling problems, as it integrates parallel machines and flow shop scheduling issues [ 2 ]. In the context of smart manufacturing, the complexity and uncertainty of manufacturing systems have significantly escalated due to the demand for multi-variety, small-batch, and personalised product [ 3 ]. Addressing the HFSP effectively, especially considering dynamic events, has thus become a central focus of production scheduling research. The dynamic hybrid flow shop scheduling problem (DHFSP) is deemed more valuable compared to the classic job shop problem since it includes most of the scheduling problems associated with high-mix, low-volume production [ 4 ]. In addition, DHFSP tackles the uncertainties that naturally exist in the production environment, including dynamic demand, machine unavailability, and uncertain process time [ 5 ]. Consequently, this paper focuses on DHFSP with uncertain process times, dynamic job arrivals, and flexible maintenance. There are four major challenges in solving the presented DHFSP. First, uncertainty in job processing times makes it difficult to predict and plan the scheduling accurately, increasing the complexity of the scheduling problem. Second, jobs arriving at different times dynamically add another layer of complexity, requiring real-time adjustments to the schedule. Third, the need for flexible and often unpredictable machine maintenance schedules complicates the decision-making process. Thus, incorporating uncertain process times, dynamic job arrivals, and flexible maintenance significantly complicates decision-making in DHFSP. Fourth, rapid and efficient decision-making is crucial, especially with the advancements in Industry 4.0, where quick responses to changing conditions are essential.
In recent decades, dynamic machine scheduling has been the subject of extensive research [ 6 - 9 ]. Three primary dynamic scheduling methods identified in the literature are completely reactive scheduling (CRS) [ 10 ], predictive reactive scheduling (PRS), and robust scheduling (RS) [ 11 ]. CRS methods, exemplified by dispatching rules like the shortest processing time (SPT) rule [ 12 ] and the First in First out rule [ 13 ] offer rapid solution generation. However, their short-term focus often produces suboptimal solutions for complex, long-term scheduling problems. PRS, a rescheduling approach, initially creates a complete schedule using heuristic or meta-heuristic methods. Upon encountering dynamic disturbances, a new schedule based on the current state is generated to maintain the stability of the manufacturing system. Commonly used meta-heuristic algorithms include genetic algorithm (GA) [ 14 , 15 ], particle swarm optimisation [ 16 - 18 ] and ant colony optimisation [ 19 - 21 ]. While PRS can achieve near-optimal solutions, it may incur high preparatory costs due to potential underestimation of deviations between new and original scheduling solutions. RS anticipates disturbances, adopts preemptive measures, and generates solutions that consider current conditions and potential disruptions.
For HFSP with frequent dynamic events, the regular application of PRS-generated solutions may negatively impact production process stability. While the RS method ensures manufacturing robustness, it often compromises scheduling effectiveness. From the standpoint of real-time response to frequent dynamic changes, CRS is preferable. However, traditional dispatching rules, tailored for specific dynamic scenarios, lack the ability to self-adjust. With the advancement of machine learning, reinforcement learning (RL) methods have become increasingly popular for learning optimal scheduling policies based on the production state at the decision moment [ 22 ].
Over the past few decades, RL-based dynamic scheduling methods have garnered significant attention in research. These methods typically employ agents to select different actions, often scheduling rules, at various decision moments to minimise the impact of uncertainty on the objective value. Presently, there are two primary approaches: classical RL and deep reinforcement learning (DRL), both of which have been effectively applied in machine scheduling problems. Classical RL methods, utilising algorithms like Q-learning [ 23 , 24 ] and Sarsa [ 25 ], address Markov decision process problems but necessitate predefined Q-table sizes. Despite their notable achievements, classical RL methods encounter the challenge of state explosion, especially in manufacturing scenarios with discrete state spaces where the state space can be vast, making the maintenance of a large Q-table impractical. Moreover, classical RL struggles with unknown state spaces. To address the limitations of individual RL models, researchers have proposed ensemble RL methods which combine multiple learning algorithms to improve robustness and performance. Liu et al. [ 24 ] proposed a Q-learning based ensemble evolutionary algorithm for FSP which leverage the evolutionary algorithm's exploration capabilities and RL's policy optimisation. In contrast, DRL has significantly advanced in handling sequential decision-making in complex, discrete state spaces [ 26 ]. It leverages artificial neural networks (ANN) as approximators for value functions, allowing the direct input of state feature vectors into an ANN to evaluate the q -value of each potential action, thus addressing the state explosion issue. However, traditional DRL models, such as Actor-Critic (AC) and Deep Q-Network (DQN), often require extensive experimental validation and significant manual effort.
NeuroEvolution of Augmenting Topologies (NEAT) algorithm, proposed by Stanley et al. [ 27 ] is a GA used for evolving ANN. Unlike conventional neural network algorithms that only optimise the weights, the NEAT algorithm simultaneously optimises both the topology (structure) and the weights of the neural network [ 23 ]. This dual optimisation allows the algorithm to adapt better to dynamic hybrid flow shop scheduling complexities. Its ability to evolve and adjust the neural network structure makes it robust in handling unpredictable changes in job arrivals, process times, and maintenance schedules [ 28 ]. NEAT integrates the strengths of both RL (for learning optimal policies through interaction with the environment) and evolutionary algorithms (for exploring and optimising complex solution spaces). This hybrid approach leverages the benefits of both methodologies to effectively address the scheduling problem. This integration facilitates automated neural network design and reduces manual labour. Thus, NEAT requires less work than various problem-specific DRL approaches. Therefore, this paper emphasises the NEAT algorithm for addressing the HFSP, considering dynamic release times, uncertain job processing times, and flexible preventive maintenance. Table 1 summarises the distinctions between the proposed method and existing RL-based approaches for dynamic HFSP, covering aspects such as dynamic events, optimisation objectives, and algorithms.
Ref. | Objectives | Dynamic events | Agent | Algorithm | |||||
---|---|---|---|---|---|---|---|---|---|
Job arrival | Machine maintenance | Skill level | Worker fatigue | Process time | SA | MA | |||
[ ] | TT | √ | √ | NEAT | |||||
[ ] | OSP | √ | √ | Q-learning | |||||
[ ] | MS | √ | √ | √ | GA-RL | ||||
[ ] | MS | √ | √ | CMA | |||||
[ ] | MS & EC | √ | √ | DRL + GIN | |||||
[ ] | MS | √ | √ | DQND | |||||
[ ] | MS | √ | √ | √ | PFSPNet-AC | ||||
[ ] | MS & TT | √ | √ | PPO | |||||
[ ] | TT | √ | √ | DDPG | |||||
[ ] | MS | √ | √ | GCN | |||||
This paper | MS | √ | √ | √ | √ | NEAT |
It can be concluded from Table 1 that researchers have often considered only a single aspect of dynamic events, such as machine maintenance, uncertain processing times, or dynamic job release, with limited studies addressing multiple dynamic events simultaneously. Han et al. [ 30 ] first studied the RL-based HFSP scheduling method while considering the single dynamic event. Wang et al. [ 32 ] introduced a cooperative memetic algorithm with an RL strategy for energy-aware distributed hybrid flow shop scheduling problems. Dong et al. [ 33 ] explored the application of DRL and graph isomorphic networks in reducing the delay criteria for permutation flow shop scheduling issues. However, studies considering multiple dynamic events concurrently are scarce, with a notable example being Lang et al. [ 29 ] who developed the NEAT algorithm to address the two-stage HFSP incorporating product family set-up times. Consequently, this paper addresses the dynamic scheduling of hybrid flow shops, integrating flexible preventive maintenance for machines and multiple dynamic events such as dynamic job release and uncertain processing times.
Moreover, most existing studies concentrate on single-agent frameworks. For instance, Yan et al. [ 34 ] applied DRL to address the distributed permutation flow shop scheduling problem with flexible preventive maintenance, while Pan et al. [ 35 ] developed an efficient optimisation algorithm using DRL to tackle the permutation flow shop scheduling problem, aiming to minimise the maximum completion time. Conversely, research on multi-agent RL remains scarce. Zhang et al. [ 34 ] proposed a deep multi-agent graph method for flexible job shop scheduling problem (FJSP) with the objective of minimising completion time. Jing et al. [ 37 ] developed a multi-agent graph convolutional network-based algorithm for FJSP, aiming to reduce maximum completion time. Liu et al. [ 31 ] introduced a hybrid approach combining a GA with a multi-agent DRL system to manage HFSP, taking into account factors such as worker fatigue and skill levels. This study proposes a learning policy tailored for multi-agent systems, facilitating centralised training and distributed execution, which is particularly suited to the nature of parallel machines.
In summary, research on RL algorithms for HFSP has yielded noteworthy outcomes, laying a solid groundwork for this paper's investigations. Given the NEAT algorithm's adaptability in neural network topology and weight parameters, this study applies NEAT to the dynamic HFSP while considering flexible preventive maintenance of machines, dynamic job release, and uncertain job processing times. The aim of this paper is to provide real-time and intelligent scheduling solutions for the dynamic HFSP. The main contributions of this paper are as follows: (1) A learning strategy for multi-agent systems that integrates centralised training with distributed execution is proposed to enhance scheduling efficiency. (2) A maintenance decision-learning scheduling action, based on maintenance thresholds and aimed at leveraging the flexibility of machine preventive maintenance, is introduced. (3) A scheduling action for the dynamic release of jobs is proposed, which avoids selecting jobs at specific decision points to broaden the solution space.
The structure of the remainder of this paper is as follows: Section 2 details the problem addressed and its Mixed Integer Programming (MIP) model. Section 3 provides an in-depth discussion of the proposed NEAT algorithm. Section 4 presents an analysis of experiments and results from the application of the proposed algorithm, including a comparison with other algorithms. Section 5 concludes the paper and outlines future research directions.
The HFSP with dynamic events examined in this paper involves n dynamically release jobs J i ( i = 1, 2, …, n ) processed across K ( K ≥ 2) stages, where each stage k contains m k ( m k ≥ 1; k = 1, 2, …, K ) machines, with at least one stage featuring parallel machines. The model incorporates flexible preventive maintenance based on actual production needs, stipulating that a machine's continuous processing time cannot exceed the maintenance threshold ( UT ), with the maintenance duration denoted as t m . The objective of this study is to minimise the maximum completion time. The completion of a job at a stage or the release of a new job triggers a decision-making process aimed at optimising the objective value. This is achieved by strategically assigning the dynamically released jobs to the machines to ensure efficient scheduling. The decision of this problem includes that how to assign the jobs to the machine at each stage and determine the sequence of jobs in each machine. In addition, the decision of when to conduct preventive maintenance is also needed to determine.
The assumptions made in this study are as follows: (1) Each machine can process only one job at a time. (2) The processing time for a job is consistent across all machines at stage k . (3) Once the processing sequence of jobs at stage 1 is established, the sequence on machines at subsequent stages follows accordingly. (4) The setup time for machines and the transportation time of jobs between adjacent stages are disregarded. To construct the MIP model for the problem under study, symbols and variables are defined to characterise the problem as presented in Table 2 .
Notation | Description |
---|---|
Indices: | |
Number of jobs, indexed by , = 1,2,…, | |
Number of the processing stage, indexed by , = 1,2,…, | |
Number of machines at stage , indexed by , = 1,2,…, | |
Position index of jobs on machine at stage , = 1,2,…, | |
Parameters: | |
UT | Maintenance threshold |
Maintenance time | |
B | A large positive integer |
The processing time of job at stage | |
The release time of job | |
Maximum completion time | |
, | The start processing time of job at stage |
, | The completion time of job at stage |
Continuous cumulative processing time at position of machine at stage | |
Decision variables: | |
If job is assigned to machine at stage , = 1, otherwise, = 0 | |
If job is processed at position , = 1, otherwise, = 0 | |
If maintenance is conducted immediately after the position of machine at stage , then = 1, otherwise, = 0 |
Equation ( 1 ) aims to minimise the maximum completion time. Equation ( 2 ) specifies how to calculate this maximum completion time. Equation ( 3 ) specifies that each job is assigned to one machine at one stage. Equations ( 4 ) and ( 5 ) ensure that each job is processed by only one position of one machine at any given stage. Equations ( 6 ) ensures that if there is a job in the next position on the same machine, then there must be a job in the previous position. Equations ( 7 ) and ( 8 ) establish the start and completion times for jobs at the first processing stage. Equation ( 10-11 ) determines that the cumulative continuous processing times of the first position of each machine at any given stage. Equations ( 10 ) and ( 11 ) determines that the cumulative continuous processing times on a machine do not exceed the maintenance threshold. Equations ( 12 ) determines that the cumulative continuous processing times on a machine do not exceed the maintenance threshold. Equations ( 13 ) and ( 14 ) determine the start and completion times for the job J i ${J}_{i}$ at stage k $k$ . Equation ( 15 ) ensures that the same machine is constrained in terms of backward and forward relationships.
To demonstrate the problem addressed in this paper, consider the following example: There are six jobs ( n = 6) to be processed through three stages ( K = 3), with the number of machines at each stage being { m 1 , m 2 , m 3 } = {2, 1, 2}. The release times of the jobs are R = {1, 1, 1, 3, 2, 1}, and the processing times at each stage are P = {[1, 1, 1], [3, 2, 3], [3, 5, 2], [5, 1, 5], [5, 3, 5]}. The maintenance threshold is set at UT = 10, and the maintenance time is t m = 2. In one of the solution scenarios illustrated by a Gantt chart in Figure 1 , the processing sequence at stage 1 is PS = [ J 1 , J 2 , J 5 , J 6 , J 4 , J 3 ]. At stage 2, jobs are processed based on the first-come-first-served (FCFS) rule. At decision moment 2, job J 1 finishes at stage 1 and immediately starts at stage 2. By decision moment 4, jobs J 2 and J 5 finish at stage 1; however, with only one machine ( M 21 ) is available at stage 2, job J 2 is processed first in line with the established sequence PS. Following this approach for subsequent stages leads to a solution with a maximum completion time of 23.
Gantt chart of a solution for an example.
The accuracy of the mathematical model was verified by CPLEX in python. A simple verification exercise was performed to ensure the correctness and credibility of the mathematical model. Several small test problems consisting of three machines, six jobs, and two stages were generated. The first stage comprised two machines, while the second stage had one machine. We programmed the mathematical model in the CPLEX in python to solve these test problems. The solution obtained by the mathematical model was compared with the enumeration method. The experimental results showed that the solutions from the model were consistent with those obtained by the enumeration method, thereby validating the mathematical model.
Stanley et al. first introduced the NEAT algorithm [ 27 , 38 ]. It is based on GA, which explores acceptable artificial neural network structures and optimal parameters for a specific machine-learning problem [ 26 ]. The standard GA employs a search method based on the principles of natural evolution. It starts by randomly creating an initial population of solution candidates, often known as genomes. Subsequently, the algorithm introduces mutations and crossovers to the random pairs of solutions to create new ones during multiple generations until a termination condition is satisfied, such as reaching a maximum number of generations [ 39 ]. GA-based NEAT has similarities with RL techniques due to its properties [ 23 ]. Both approaches do not rely on labelled training data. Instead, they create learning signals using fitness or reward functions. These functions assess the quality of the output produced by any artificial neural network solution. The NEAT algorithm enhances the weight parameters and the topology of neural networks, unlike traditional DRL algorithms such as DQN and AC [ 29 , 40 ]. NEAT utilises evolutionary algorithms to refine neural network structure and connection weights, addressing complex dynamic decision-making challenges. As a combination of GA and DRL, NEAT employs the population dynamics, crossover, and mutation processes of GA to evolve neural networks [ 41 ]. This research utilises the NEAT algorithm to tackle the DHFSP. The sorting of jobs at stage 1 is considered a dynamic decision-making process. A multi-agent system employing centralised training with distributed execution is developed to facilitate the learning of the scheduling policy, where the sorting results of jobs at stage 1 are used as selection constraints for subsequent stages. The NEAT algorithm and the scheduling model are implemented separately and are only linked through an input/output interface as shown in Figure 2 . Eight state features are extracted from the production environment as inputs, and six dispatching rules are defined as actions for the NEAT algorithm to minimise the maximum completion time.
NEAT algorithm and scheduling environment interactions.
The process begins with parameter initialisation, followed by the construction of a neural network that interacts with the environment to derive fitness values. The initial population consists of simple networks, usually including input and output nodes with randomly assigned weights and no hidden nodes. During the process of evolution, mutations introduce more nodes and connections, increasing the network structure's complexity. The fitness function is determined by simulating the scheduling process using each neural network. The most successful neural networks are chosen using an elitist method to generate the next generation. Offspring networks are produced via crossover and mutation, progressively evolving to yield an agent with the optimal scheduling policy. This methodology is illustrated in Figure 3 . Based on the references [ 2 , 29 ], an interaction module between the agent and the hybrid flow shop system environment is crafted, with the corresponding pseudocode shown in Algorithm 1 . The iteration process continues for several generations until convergence is achieved through evaluation, selection, crossover, and mutation.
The framework of the proposed NEAT algorithm.
For each neural network P:
Initialisation state S o , the number of stages K , processing sequence table PS = {∅}
For each decision moment t d
For k = 1 to K do
If current stage k = 1 and there are idle machines in stage k do
While sequentially apply agents of idle machines do
Update the instantaneous state S t d ${S}_{{t}_{d}}$ , and execute the action a t d ${a}_{{t}_{d}}$ , place the selected job into the machining sequence list PS
End While
If current stage k ≠ 1 and there are idle machines in stage k do
While sequentially select idle machines do
Assign jobs to the machines according to the sequence in the list PS
If all jobs have been completed do
Calculate the maximum completion time C max
Output fitness value Fitness = 1 C max $\text{Fitness}=\frac{1}{{C}_{\max }}$
The NEAT algorithm employs a unique genetic encoding method to represent the structure of the neural network. Genetic encoding involves modelling neural networks as genomes, which are essentially lists of genes. This encoding allows the NEAT algorithm to evolve both the structure (topology) and the weights of neural networks over successive generations. Each neural network is encoded as a genome consisting of two main parts: the node gene and the connection gene. Node genes represent the neurons in the neural network. Each node gene contains information about the node type and its unique identifier. Connection genes represent synaptic connections between neurons. Each connection gene includes the input node, the output node, the weight of the connection, the enable bit (indicating whether the connection is active), and an innovation number. The genetic encoding used for NEAT is designed to allow corresponding genes to be easily combined during crossover events. Genomes are linear representations of network connectivity and include the information of the two node genes being connected. This design facilitates the evolutionary process, enabling the NEAT algorithm to explore and optimise neural network structures efficiently.
The mutation and crossover operations are performed at each iteration in the NEAT algorithm. Mutation introduces random changes to a genome, which is essential for introducing new genetic material into the population and enabling the exploration of the solution space [ 27 ]. Two types of mutations are shown in Figure 4 . Add connection mutation adds a new connection between two previously unconnected nodes with a random weight, enabling the network to form new pathways and potentially discover new features and relationships in the data as shown in Figure 4a . Add node mutation splits an existing connection by inserting a new node. The original connection is disabled, and two new connections are added as shown in Figure 4b . This allows the network to evolve new layers and more complex behaviours. Crossover is a genetic operation used in NEAT to combine the genetic material of two parent genomes to produce offspring [ 42 ]. NEAT uses innovation numbers to align matching genes correctly, ensuring that similar structures from different parents are combined effectively as shown in Figure 5 . If a gene is present in one parent but not the other, it can be inherited from the more fit parent or randomly included/excluded.
The mutation operations of the NEAT algorithm.
The crossover operations of the NEAT algorithm.
The agent creates a scheduling action based on the state features, which need to accurately represent the current system state, encompassing both job and machine information. As these state features form the sole input for generating scheduling actions, they must encapsulate data crucial for evaluating these actions. In alignment with the action space and considering the fundamental details of certain jobs, eight state features ( ft 1 , ft 2 , …, ft 8 ) have been adopted from literature [ 43 - 45 ] and given in Table 3 . State features ft 1 , ft 2 , …, ft 5 represent job information in the buffer, while state features ft 6 , ft 7 , ft 8 represent machine and decision-related state information in the hybrid flow shop at stage 1. The dimensions of all state features are 1 + n ∗ 4 + m 1 ∗ 3 (where m 1 is the number of machines at stage 1 and n is the number of jobs).
No. | State characteristics | Definition |
---|---|---|
1 | = | The number of jobs in the buffer pool set ( ) at processing stage 1 |
2 | The quantity of job in the job buffer pool of processing stage 1 , = {0,1} | |
3 | The time interval between job and its release time. If job has not yet released, set the value of | |
4 | The total processing time of job in all processing stages of the job buffer pool set | |
5 | = { 1}, ∈ F | Processing time of job at stage 1 of the job buffer pool set |
6 | = { }, = 1, 2, …, | The processing time of the job being processed by machine ( = 1, 2, …, ) at stage 1 |
7 | The processing time of the job being processed by machine at stage 1 | |
8 | Remaining maintenance threshold for machine at stage 1 |
The symbols in Table 3 are described as follows. n BF represents the number of jobs in the buffer pool set ( BF ) at stage 1. Q i represents the quantity of job J i in the job buffer pool at stage 1. T i r ${T}_{i}^{r}$ represents the time interval between job J i and its release time. T 1 c represents the processing time of the job being processed by machine M 1 c ( c = 1, 2, …, m 1 ) at stage 1. T 1 c a ${T}_{1c}^{a}$ represents the processing time of the job being processed by machine M 1 c at stage 1. U 1 c r ${U}_{1c}^{r}$ represents the remaining maintenance threshold for machine M 1 c at stage 1.
The action is used to select a job from the buffer and assign it to a machine to be processed during scheduling decisions. The optimal policy of job selection in different states directly affects the final training effect and the quality of the solution. Therefore, the action should include effective actions for various production states. Six scheduling actions as follows were designed based on reference.
The action in the scheduling process involves selecting a job from the buffer and assigning it to a machine for processing. The effectiveness of job selection across different states plays a crucial role in the final training outcome and the quality of the solution. Thus, it is important that the action set encompasses effective responses to various production scenarios. As recommended by Liu et al. [ 2 ], six scheduling actions were designed to accommodate a range of production states, ensuring a comprehensive strategy for dynamic scheduling decisions. The dispatching rules for job sequence contain: ( a 1 ) FIFO; ( a 2 ) SPT; ( a 3 ) longest processing time (LPT); ( a 4 ) shortest total processing time (STPT); ( a 5 ) longest total processing time (LTPT); ( a 6 ) Skip. The processing time of SPT and LPT rules is the processing time of job at the stage 1. STPT and LTPT mean the shortest and longest total processing time (TPT) of job at all stages, and T P T = ∑ k = 1 K p i k $TPT=\sum \nolimits_{k=1}^{K}{p}_{ik}$ . If more than one job matches the dispatching rules of the job sequence, calculate the batch waste B 1 c i = U 1 c r − p i 1 $\left({B}_{1ci}={U}_{1c}^{r}-{p}_{i1}\right)$ of selected jobs based on the remaining threshold value U 1 c r ${U}_{1c}^{r}$ on machine M 1 c , and select the job with the smallest batch waste. If there are multiple jobs, randomly select one of job J i . Set the following constraints for several special situations: (1) When there are no jobs in the waiting queue, the agent can only choose scheduling action a 6 . (2) When a new job releases but there are no idle machines, select scheduling action a 6 . (3) When there are jobs in the waiting queue and all machines are idle, scheduling action a 6 cannot be selected.
The optimisation objective of this paper is to minimise the maximum completion time ( C max ), while the NEAT algorithm aims to learn the scheduling policy by maximising global rewards or fitness functions. Thus, the fitness function for NEAT is designed as Fitness = 1 C max $\text{Fitness}=\frac{1}{{C}_{\max }}$ , where C max = max C T i , K ${C}_{\max }=\max \,\left\{{CT}_{i,K}\right\}$ , and CT i , K is the completion time of the last processing stage of job J i . Maximising the fitness function in this manner is equivalent to minimising the maximum completion time, aligning the algorithm's objectives with the paper's optimisation goal.
4.1 experimental design.
The algorithms were developed in PyCharm and executed on a PC with a Windows 10 operating system, a 2.9 GHz CPU, and 16 GB of RAM to assess the computational performance. The hybrid flow shop simulation environment, along with the NEAT agent model and priority dispatching rules (PDRs), was constructed using Python 3.9. Additionally, the development of the DQN agent model was implemented using the Pytorch module in Python 3.9. Following the methodology described by Yuksel [ 41 ], the job processing time p ik , release time r i , and the number of machines at each stage m k for experimental problems are generated using a uniform distribution. The problems are categorised into three scales based on the number of stages K and n number of jobs: small scale ( K = 2, n = 10), medium scale ( K = 4, n = 50), large scale ( K = 6, n = 100). For each scale, three different sets of instances are created to serve as the training set. The specific range of parameter values is as follows: p ik and r i are distributed uniformly in the interval [1, 5], m k is distributed uniformly in the interval [1, 2], UT is set to 10 and t m is set to 2.
The performance of the algorithm highly depends upon the correct selection of parameters [ 46 , 47 ]. Therefore, to get optimal solutions for a specific problem, pre-experiments were conducted using the problem instances with K = 6 and n = 100 as example. The Taguchi method was employed to investigate the effect of population size and evaluation generations on the performance of the NEAT algorithm. Each parameter was considered at three levels: population size {100, 150, 200} and generations {15, 20, 25}. The response value from each experiment was converted into a signal-to-noise (S/N) ratio. The convergence curves of fitness values for a population size of 100 are presented in Figure 6 . Analysis of Figure 6 and S/N ratios indicates that all three schemes reached satisfactory fitness values near the 10th generation, beyond which the improvement in fitness values became stable and gradual, with the number of iterations as the termination condition. Consequently, an optimal level of each key parameters is shown in Table 4 . For a fair comparison, each algorithm was run 10 times on each problem instance to account for variability in performance. The mean and standard deviation values of the results were recorded to provide a reliable assessment of each algorithm's performance. The experimental setup was kept consistent to avoid discrepancies arising from different computational efficiencies. The parameters of the other compared algorithms were also fine-tuned using the Taguchi method as shown in Table 4 .
Convergence curves for NEAT algorithm under different evolutionary generations.
Algorithms and parameters | Value |
---|---|
NeuroEvolution of augmenting topologies (NEAT) | |
Generation | 10 |
Population | 100 |
Deep Q-network (DQN) | |
Discount factor | 0.7 |
Learning rate of training | 0.1 |
Generation | 50 |
This section assesses the NEAT scheduling method's performance using the maximum completion time ( C max ) and the average computation time T s R $\left({T}_{s}^{R}\right)$ for generating scheduling decisions. The trained model was applied to solve test problems mirroring the training set's distribution. For each scale, three sets of instance problems were generated and solved 10 times to ensure reliability. The results in Table 5 reveal that for all three problem scales, the NEAT algorithm consistently achieved optimal C max values. This success is attributed to the NEAT algorithm's ability to select the most appropriate scheduling actions based on the real-time state of the scheduling problem, thereby minimising C max . Specifically, for small-scale problems, SPT and STPT rules can also find near-optimal solutions due to the small solution space. However, the increased complexity of medium and large-scale problems hinders these scheduling rules from finding near-optimal solutions. In contrast, trained NEAT and DQN models can identify the most suitable real-time scheduling actions. NEAT outperforms the scheduling rules and DQN due to its adaptive neural network topology. The NEAT model, tailored through training, fits the problem more accurately than DQN's fixed neural network structure. Therefore, the NEAT algorithm obtains a better C max than other compared algorithms. Regarding the response time (SD) for a single decision, scheduling rules are the fastest due to their straightforward judgement and selection processes based on preset rules, requiring no complex computations. Although NEAT and DQN have longer response times, they operate within the e-05 s range, reflecting the more complex matrix calculations performed by their neural network models. With its adaptive topology that modifies the neural network's nodes and connections, the NEAT algorithm results in a non-fully connected network that demands fewer computational resources than DQN's fully connected network, thus slightly reducing the SD compared to DQN.
Algorithms | Instance 1 | Instance 2 | Instance 3 | |||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
NEAT | FIFO | SPT | LPT | STPT | LTPT | DQN | NEAT | FIFO | SPT | LPT | STPT | LTPT | DQN | NEAT | FIFO | SPT | LPT | STPT | LTPT | DQN | ||
= 2, = 10 | ||||||||||||||||||||||
24 | 27 | 24 | 28 | 26 | 29 | 24 | 21 | 22 | 21 | 26 | 25 | 25 | 21 | 32 | 32 | 32 | 33 | 32 | 35 | 32 | ||
25 | 29 | 27 | 32 | 26 | 29 | 26 | 22 | 22 | 22 | 27 | 25 | 25 | 23 | 32 | 32 | 32 | 37 | 32 | 35 | 32 | ||
0.46 | 0.98 | 1.28 | 1.28 | 0 | 0 | 0.98 | 0.5 | 0 | 0.3 | 0.46 | 0 | 0 | 0.54 | 0 | 0 | 0 | 1.34 | 0 | 0 | 0 | ||
24.3 | 27.8 | 25.4 | 29.4 | 26 | 29 | 24.8 | 21.5 | 22 | 21.9 | 26.7 | 25 | 25 | 21.9 | 32 | 32 | 32 | 36 | 32 | 35 | 32 | ||
5.39e-05 | 2.18e-07 | 2.21e-07 | 1.88e-07 | 2.42e-07 | 2.16e-07 | 4.08e-04 | 5.04e-05 | 1.88e-07 | 2.02e-07 | 2.04e-07 | 2.04e-07 | 1.90e-07 | 4.56e-04 | 5.73e-05 | 2.06e-07 | 2.10e-07 | 2.04e-07 | 2.17e-07 | 2.17e-07 | 3.48e-04 | ||
5.39e-04 | 2.18e-06 | 2.21e-06 | 1.88e-06 | 2.42e-06 | 2.16e-06 | 4.08e-03 | 5.04e-04 | 1.88e-06 | 2.02e-06 | 2.04e-06 | 2.04e-06 | 1.90e-06 | 4.56e-03 | 5.73e-04 | 2.06e-06 | 2.10e-06 | 2.04e-06 | 2.17e-06 | 2.17e-06 | 3.48e-03 | ||
= 4, = 50 | ||||||||||||||||||||||
222 | 231 | 223 | 243 | 229 | 230 | 224 | 186 | 198 | 186 | 221 | 204 | 195 | 186 | 191 | 196 | 192 | 199 | 197 | 195 | 191 | ||
227 | 238 | 233 | 254 | 233 | 232 | 235 | 190 | 212 | 192 | 234 | 209 | 200 | 196 | 196 | 201 | 196 | 207 | 199 | 199 | 203 | ||
1.92 | 2.15 | 3.04 | 3.46 | 1.33 | 0.92 | 3.30 | 1.58 | 3.77 | 2.06 | 3.44 | 1.27 | 1.33 | 2.33 | 1.74 | 1.58 | 1.2 | 2.79 | 0.92 | 1.26 | 3.10 | ||
224.9 | 234.4 | 227.5 | 249.2 | 232.2 | 231.4 | 228.9 | 187.9 | 205.7 | 188.6 | 226.5 | 206.3 | 197.2 | 191.4 | 193.4 | 198.1 | 194.6 | 203.0 | 197.6 | 197.0 | 199.0 | ||
1.89e-04 | 2.47e-07 | 2.53e-07 | 3.01e-07 | 2.50e-07 | 3.00e-07 | 6.75e-04 | 1.88e-04 | 2.84e-07 | 2.39e-07 | 2.51e-07 | 2.62e-07 | 2.41e-07 | 6.21e-04 | 1.79e-04 | 2.50e-07 | 2.28e-07 | 2.30e-07 | 2.53e-07 | 2.59e-07 | 6.47e-04 | ||
9.45e-03 | 1.24e-05 | 1.27e-05 | 1.51e-05 | 1.25e-05 | 1.50e-05 | 3.38e-02 | 9.39e-03 | 1.42e-05 | 1.19e-05 | 1.25e-05 | 1.31e-05 | 1.20e-05 | 3.11e-02 | 9.01e-03 | 1.25e-05 | 1.14e-05 | 1.15e-05 | 1.27e-05 | 1.29e-05 | 3.24e-02 | ||
= 6, = 100 | ||||||||||||||||||||||
397 | 413 | 411 | 482 | 423 | 421 | 405 | 407 | 412 | 408 | 423 | 421 | 417 | 409 | 396 | 434 | 406 | 471 | 419 | 405 | 406 | ||
415 | 426 | 424 | 498 | 429 | 427 | 420 | 418 | 420 | 422 | 434 | 427 | 423 | 417 | 405 | 447 | 417 | 481 | 426 | 409 | 414 | ||
6.39 | 3.43 | 4.25 | 4.79 | 2.29 | 2.04 | 4.08 | 3.31 | 2.62 | 3.69 | 3.26 | 2.06 | 2.0 | 2.69 | 3.1 | 3.39 | 3.36 | 2.87 | 1.89 | 1.66 | 2.73 | ||
405.9 | 421.2 | 418.4 | 491.8 | 425.5 | 424.2 | 415.3 | 411.8 | 415.1 | 414.7 | 429.0 | 423.4 | 420.0 | 412.5 | 399.7 | 441.9 | 410.9 | 475.6 | 422.8 | 407.2 | 410.6 | ||
3.46e-04 | 2.72e-07 | 2.74e-07 | 2.97e-07 | 2.92e-07 | 2.93e-07 | 8.04e-04 | 3.22e-04 | 2.43e-07 | 2.52e-07 | 2.54e-07 | 2.59e-07 | 2.60e-07 | 7.98e-04 | 3.77e-04 | 4.39e-07 | 2.85e-07 | 2.89e-07 | 4.17e-07 | 3.00e-07 | 7.90e-04 | ||
3.46e-02 | 2.72e-05 | 2.74e-05 | 2.97e-05 | 2.92e-05 | 2.93e-05 | 8.04e-02 | 3.23e-02 | 2.43e-05 | 2.52e-05 | 2.54e-05 | 2.59e-05 | 2.60e-05 | 7.98e-02 | 3.77e-02 | 4.39e-05 | 2.85e-05 | 2.89e-05 | 4.17e-05 | 3.00e-05 | 7.90e-02 |
Figure 7 illustrates a box plot comparing the maximum completion time C max and AD is required to derive the optimal scheduling strategy across three different scales using various algorithms. This visualisation helps in assessing the variability and distribution of both C max and AD among the algorithms, providing insights into their performance and efficiency in solving the scheduling problem at different complexity levels. Figure 7 demonstrates that the NEAT-based algorithm exhibits superior robustness. Figure 7a,b show that for small and medium-scale problems, both NEAT and DQN algorithms show better robustness and achieve near-optimal solutions, outperforming the SPT and LPT scheduling rules. Figure 7c indicates that for large-scale problems, the NEAT algorithm maintains the highest robustness and closest to optimal solutions, followed by DQN and SPT rules, with LPT rules trailing behind. This performance is attributed to the evolutionary and training processes of NEAT and DQN, which yield robust agent models capable of adapting to environmental changes, particularly for small and medium-sized problems. SPT rules, which prioritise jobs based on processing time, demonstrate moderately lower robustness, while other scheduling rules, which select jobs based on current states and objective values, show even less robustness. The NEAT algorithm's ability to explore a more exhaustive solution space through neural network and weight parameter evolution contributes to its optimal robustness in large-scale scenarios. DQN's fixed neural network structure, evolving only in terms of weight parameters, limits its robustness in comparison. The inherent limitations of scheduling rules in solving large-scale problems further contribute to their lower robustness.
Box plots of different algorithms at three different scales.
Regarding computational time (AD), scheduling rules consistently require the least time across all scales, followed by NEAT and DQN. The longer computation times for DQN are due to its fixed, fully connected neural network structure, necessitating extensive forward feedback and backpropagation calculations, which may not always align well with the scheduling problem. Conversely, NEAT's evolutionary algorithm approach tailors the neural network to the problem's complexity, resulting in a more straightforward network structure than DQN and, thus shorter computation times. Scheduling rules exhibit minimal response times, bypassing the complex calculations and iterative processes inherent in neural network-based methods.
Further, the statistical method, specifically the one-tailed t -test is adopted to confirm the statistical difference between the NEAT algorithm and DQN. The level of significance was set at 0.05 [ 48 ]. If NEAT was significantly worse than, statistically equivalent to, or significantly better than the comparative algorithms, the optimisation results are displayed as ‘−’, ‘∼’, or ‘+’, respectively [ 49 ]. The results are presented in Table 6 . Table 6 shows that for medium and large problem instances, the NEAT algorithm consistently outperforms the DQN algorithm. While for small problem instances, the performance of NEAT is similar to that of DQN. However, as discussed above, the NEAT algorithm demands fewer resources and thus provides a rapid response, as evident from the SD and AD values in Table 5 . Therefore, the NEAT algorithm offers superior performance compared to DQN when considering the C max , SD, and AD values.
Problem | Instance | NEAT | DQN | value | -test | ||
---|---|---|---|---|---|---|---|
Mean | Variance | Mean | Variance | ||||
= 2, = 10 | 1 | 24.3 | 0.48 | 24.8 | 1.03 | 0.0912 | ∼ |
2 | 21.5 | 0.53 | 21.9 | 0.57 | 0.0599 | ∼ | |
3 | 32 | 0 | 32 | 0 | 0.5 | ∼ | |
= 4, = 50 | 1 | 224.9 | 2.02 | 228.9 | 3.48 | 0.0028 | + |
2 | 187.9 | 1.66 | 191.4 | 2.46 | 0.0008 | + | |
3 | 193.4 | 1.84 | 199.0 | 3.27 | 8.440e-0.5 | + | |
= 6, = 100 | 1 | 405.9 | 6.74 | 415.3 | 4.30 | 0.0008 | + |
2 | 411.8 | 3.49 | 421.5 | 6.38 | 0.0003 | + | |
3 | 399.7 | 3.27 | 410.6 | 2.88 | 1.4153e-07 | + |
Generalisation performance is a crucial metric for evaluating DRL algorithms, assessing the ability to effectively apply learnt knowledge and experience to unfamiliar environments and achieve favourable outcomes [ 41 ]. To assess the generalisation performance of the NEAT scheduling method, parameters from the large-scale problem used in training were expanded to create diverse generalisation test problems. The details of these parameters are listed in Table 7 . For each generalisation test problem, 10 sets of instances were randomly generated, and each set was solved 10 times using the trained NEAT model and trained DQN model. The result, also known as win percentage (win %, representing the algorithm's superiority ratio over the 10 sets of instances in the generalisation test problem) is shown in Table 8 . This approach helps evaluate how well the NEAT algorithm adapts to changes and maintains its performance in scenarios not encountered during the training phase.
No. | Instance | Deviation percentage (%) | ||
---|---|---|---|---|
a | P5r7 | (1,5) | (1,7) | 40 |
b | P5r9 | (1,5) | (1,9) | 80 |
c | P7r5 | (1,7) | (1,5) | 40 |
d | P7r7 | (1,7) | (1,7) | 80 |
e | P7r9 | (1,7) | (1,9) | 120 |
f | P9r5 | (1,9) | (1,5) | 80 |
g | P9r7 | (1,9) | (1,7) | 120 |
h | P9r9 | (1,9) | (1,9) | 160 |
Instance | NEAT | DQN | NEAT-DQN win% | ||||||
---|---|---|---|---|---|---|---|---|---|
Max | Min | Avg | Max | Min | Avg | ||||
P5r7 | 429.9 | 393.5 | 10.17 | 410.96 | 447.2 | 413.8 | 11.70 | 428.2 | 100 |
P5r9 | 430.4 | 407.2 | 7.27 | 414.6 | 445.3 | 416.4 | 9.36 | 431.37 | 100 |
P7r5 | 605.8 | 528.7 | 28.44 | 559.48 | 637 | 557.2 | 24.47 | 583.13 | 90 |
P7r7 | 606.9 | 536.3 | 21.49 | 567.37 | 633.8 | 563.8 | 25.55 | 592.04 | 100 |
P7r9 | 612.5 | 539.3 | 24.22 | 568.71 | 629.4 | 558.1 | 22.63 | 592.99 | 100 |
P9r5 | 739.3 | 648.3 | 32.02 | 692.41 | 778.6 | 659.6 | 36.85 | 719.39 | 100 |
P9r7 | 740.7 | 649.5 | 30.58 | 703.65 | 770 | 658.4 | 36.68 | 719.76 | 80 |
P9r9 | 749.6 | 651.3 | 31.43 | 703.31 | 773.1 | 670.8 | 30.77 | 732.54 | 100 |
The data presented in Table 7 shows that the NEAT-based algorithm maintains a win rate of nearly 80% compared with DQN across all test problems. This indicates that the trained NEAT model provides better results than the trained DQN model. The reason is that since the NEAT algorithm considers the evolution of the neural network topology structure, it results in a more robust model with better generalisation to unknown environments or data.
It can be seen from the above results that NEAT algorithm outperformed DQN and other dispatching rules. The reason lies in NEAT's ability to evolve both the structure and weights of neural networks, its robustness to dynamic and uncertain environments, improved learning and generalisation capabilities, and integration of evolutionary strategies contribute to its superior performance over DQN in DHFSP. These advantages allow NEAT to develop more adaptable, efficient, and effective models tailored to the specific needs of the scheduling problem.
This paper addresses the dynamic scheduling problem in a hybrid flow shop environment, aiming to minimise the maximum completion time. It introduces a multi-agent NEAT RL approach tailored for dynamic decision-making at the initial stage, accounting for dynamic job release times, uncertain processing times, and flexible machine maintenance. Multiple features were incorporated to enhance the efficiency and effectiveness of the NEAT algorithm. A multi-agent system was designed to facilitate autonomous interaction and centralised training, aiding in the development of a scheduling policy for parallel machines. Additionally, a maintenance-specific scheduling action was introduced to optimise maintenance decisions. The performance of the proposed method is compared with that of DQN RL and five PDRs. The comparative experiments demonstrate the NEAT algorithm's superior ability to deliver effective scheduling solutions and promptly adapt to dynamic changes. Moreover, the generalisation experiments have showcased the NEAT model's strong adaptability, confirming its effectiveness in handling new, previously unseen scheduling problems within a defined parameter range. The NEAT algorithm, with its unique ability to concurrently optimise the neural network structure and weight parameters, has demonstrated superior performance in generating effective scheduling solutions and adapting rapidly to dynamic changes in the manufacturing environment.
Future research should aim to enhance the complexity management of scheduling in actual production settings. This involves developing an efficient approach to define state features that accurately represent the complex hybrid flow shop environment with minimal redundancy. Exploring multi-objective optimisation methods that align more closely with real-world conditions is crucial, particularly in developing DRL models and algorithms suited for multi-objective optimisation scheduling challenges.
Yarong Chen : Data curation; funding acquisition; resources; supervision; validation; writing – original draft. Junjie Zhang : Investigation; methodology; visualization; writing – original draft. Mudassar Rauf : Conceptualization; formal analysis; resources; supervision; validation; writing – review & editing. Jabir Mumtaz : Formal analysis; methodology; resources; visualization; writing – review & editing. Shenquan Huang : Data curation; methodology; project administration; software; visualization.
This work has been supported by the National Natural Science Foundation of China, Grant No. 51705370 and Basic Scientific Research Project of Wenzhou City, Grant No. G20240020 & G2023036.
All authors declare that they have no conflict of interest or financial conflicts to disclose.
Open research, data availability statement.
Data available on request from the authors.
Volume 6 , Issue 3
September 2024
Change password, your password must have 10 characters or more:.
Your password has been changed
Forgot your password.
Enter your email address below.
Please check your email for instructions on resetting your password. If you do not receive an email within 10 minutes, your email address may not be registered, and you may need to create a new Wiley Online Library account.
Can't sign in? Forgot your username?
Enter your email address below and we will send you your username
If the address matches an existing account you will receive an email with instructions to retrieve your username
This paper proposes a modeling and solution approach for the integrated planning of the planting and harvesting of sucrose cane and energy-cane considering multiple harvesters. An integer linear bi-objective optimization model is proposed, which seeks to find a trade-off between the maximization of the production volumes of sucrose and fiber and the minimization of the operational costs. The model considers the technical constraints of the mill, such as the milling capacity and meeting the monthly demand. A MIP-heuristic based on relax-and-fix and fix-and-optimize strategies with exact decomposition is appropriately proposed to determine approximations to Pareto optimal solutions to this problem. These approximations are used as incumbents for a branch-and-bound tree to generate potentially Pareto optimal solutions. The results reveal that the MIP-heuristic efficiently solves the problem for real and semi-random instances, generating approximate solutions with a reduced error and a reasonable computational effort. Moreover, the different solutions quantify the trade-off between cost and production volume, opening up the possibility of increasing sucrose and fiber content or decreasing the costs of solutions found. Thus, the proposed bi-objective approach, the solution technique and the different Pareto optimal solutions obtained can assist mill managers in making better decisions in sugarcane production.
This is a preview of subscription content, log in via an institution to check access.
Subscribe and save.
Price includes VAT (Russian Federation)
Instant access to the full article PDF.
Rent this article via DeepDyve
Institutional subscriptions
A multiple objective methodology for sugarcane harvest management with varying maturation periods.
More details, see the Gurobi manual at https://www.gurobi.com/documentation/current/refman/heuristics.html.
Agarwal, A. K. (2007). Biofuels (alcohols and biodiesel) applications as fuels for internal combustion engines. Progress in Energy and Combustion Science, 33 (3), 233–271.
Article Google Scholar
Akartunali, K., & Miller, A. J. (2009). A heuristic approach for big bucket multi-level production planning problems. European Journal of Operational Research, 193 (2), 396–411. https://doi.org/10.1016/j.ejor.2007.11.033
Aliano, A. F., Cantane, D. R., Isler, P. R., & Florentino, H. O. (2023). An integrated multi-objective mathematical model for sugarcane harvesting considering cumulative degree-days. Expert Systems with Applications, 232 , 120881.
Aliano, A. F., Melo, T., & Pato, M. V. (2021). A bi-objective mathematical model for integrated planning of sugarcane harvesting and transport operations. Computers & Operations Research, 134 , 105419.
Aliano, A. F., Moretti, A. C., Pato, M. V., & de Oliveira, W. A. (2021). An exact scalarization method with multiple reference points for bi-objective integer linear optimization problems. Annals of Operations Research, 296 (1), 35–69.
Aliano, F. A., de Oliveira, W. A., & Melo, T. (2022). Multi-objective optimization for integrated sugarcane cultivation and harvesting planning. European Journal of Operational Research, 309 (1), 330–344.
Beraldi, P., Ghiani, G., Grieco, A., & Guerriero, E. (2008). Rolling-horizon and fix-and-relax heuristics for the parallel machine lot-sizing and scheduling problem with sequence-dependent set-up costs. Computers & Operations Research, 35 (11), 3644–3656.
Bezanson, J., Edelman, A., Karpinski, S., & Shah, V. B. (2017). Julia: A fresh approach to numerical computing. SIAM Review, 59 (1), 65–98.
Bigaton, A., Danelon, A. F., Bressan, G., Silva, H. J. T., & Rosa, J. H. M. (2017). Previsão de custos do setor sucroenergético na região centro-sul do Brasil: safra 2017/18. Revista IPecege, 3 (3), 65–70.
Bruzzone, A. A. G., Anghinolfi, D., Paolucci, M., & Tonelli, F. (2012). Energy-aware scheduling for improving manufacturing process sustainability: A mathematical model for flexible flow shops. CIRP Annals - Manufacturing Technology, 61 (1), 459–462.
Calija, V., Higgins, A. J., Jackson, P. A., Bielig, L. M., & Coomans, D. (2001). An operations research approach to the problem of the sugarcane selection. Annals of Operations Research, 108 (1–4), 123–142.
Cárdenas-Barrón, L. E., Melo, R. A., & Santos, M. C. (2021). Extended formulation and valid inequalities for the multi-item inventory lot-sizing problem with supplier selection. Computers and Operations Research, 130 , 105234.
Cheavegatti-Gianotto, A., de Abreu, H. M. C., Arruda, P., Bespalhok Filho, J. C., Burnquist, W. L., Creste, S., di Ciero, L., Ferro, J. A., et al. (2011). Sugarcane ( saccharum x officinarum ): A reference study for the regulation of genetically modified cultivars in brazil. Tropical Plant Biology, 4 , 62–89.
Chu, S., & Majumdar, A. (2012). Opportunities and challenges for a sustainable energy future. Nature, 488 (7411), 294–303.
Cunha, J. O., Mateus, G. R., & Melo, R. A. (2022). A hybrid heuristic for capacitated three-level lot-sizing and replenishment problems with a distribution structure. Computers and Industrial Engineering, 173 , 108698. https://doi.org/10.1016/j.cie.2022.108698
Danna, E., Rothberg, E., & Pape, C. L. (2005). Exploring relaxation induced neighborhoods to improve MIP solutions. Mathematical Programming, 102 , 71–90.
Dunning, I., Huchette, J., & Lubin, M. (2017). Jump: A modeling language for mathematical optimization. SIAM Review, 59 (2), 295–320.
Ehrgott, M., & Wiecek, M. M. (2005). Multiobjective programming. Multiple Criteria Decision Analysis: State of the Art Surveys, 78 , 667–708.
Google Scholar
Etemadnia, H., Goetz, S. J., Canning, P., & Tavallali, M. S. (2015). Optimal wholesale facilities location within the fruit and vegetables supply chain with bimodal transportation options: An lp-mip heuristic approach. European Journal of Operational Research, 244 (2), 648–661.
Farias, M. E. A., Martins, M. F., & Cândido, G. A. (2021). Agenda 2030 e energias renováveis: Sinergias e desafios para alcance do desenvolvimento sustentável. Research, Society and Development, 10 (17), e13101723867.
Ferreira, D., Morabito, R., & Rangel, S. (2009). Solution approaches for the soft drink integrated production lot sizing and scheduling problem. European Journal of Operational Research, 196 (2), 697–706. https://doi.org/10.1016/j.ejor.2008.03.035
Fischetti, M., & Lodi, A. (2003). Local branching. Mathematical Programming, 98 , 23–47.
Florentino, H. O., Irawan, C., Aliano, F. A., Jones, D. F., Cantane, D. R., & Nervis, J. J. (2018). A multiple objective methodology for sugarcane harvest management with varying maturation periods. Annals of Operations Research, 267 (1), 153–177.
Florentino, H. O., Jones, D. F., Irawan, C., Ouelhadj, D., Khosravi, B., & Cantane, D. R. (2020). An optimization model for combined selecting, planting and harvesting sugarcane varieties. Annals of Operations Research, 314 , 451–469.
Florentino, H. O., & Pato, M. V. (2014). A bi-objective genetic approach for the selection of sugarcane varieties to comply with environmental and economic requirements. Journal of the Operational Research Society, 65 (6), 842–854.
García-Segura, T., Penadés-Plà, V., & Yepes, V. (2018). Sustainable bridge design by metamodel-assisted multi-objective optimization and decision-making under uncertainty. Journal of Cleaner Production, 202 , 904–915.
Giagkiozis, I., & Fleming, P. J. (2015). Methods for multi-objective optimization: An analysis. Information Sciences, 293 , 338–350.
Gurobi Optimization, LLC: Gurobi Optimizer Reference Manual (2022). https://www.gurobi.com .
Helber, S., & Sahling, F. (2010). A fix-and-optimize approach for the multi-level capacitated lot sizing problem. International Journal of Production Economics, 123 (2), 247–256.
Higgins, A. (2006). Scheduling of road vehicles in sugarcane transport: A case study at an Australian sugar mill. European Journal of Operational Research, 170 (3), 987–1000.
Higgins, A., Antony, G., Sandell, G., Davies, I., Prestwidge, D., & Andrew, B. (2004). A framework for integrating a complex harvesting and transport system for sugar production. Agricultural Systems, 82 (2), 99–115.
Higgins, A. J., & Muchow, R. C. (2003). Assessing the potential benefits of alternative cane supply arrangements in the Australian sugar industry. Agricultural Systems, 76 (2), 623–638.
Higgins, A. J., & Postma, S. (2004). Australian sugar mills optimise siding rosters to increase profitability. Annals of Operations Research, 128 (1–4), 235–249.
James, R. J. W., & Almada-Lobo, B. (2011). Single and parallel machine capacitated lotsizing and scheduling: New iterative MIP-based neighborhood search heuristics. Computers and Operations Research, 38 (12), 1816–1825. https://doi.org/10.1016/j.cor.2011.02.005
Jena, S. D., & Poggi, M. (2013). Harvest planning in the Brazilian sugar cane industry via mixed integer programming. European Journal of Operational Research, 230 (2), 374–384.
Junqueira, R., & Morabito, R. (2019). Modeling and solving a sugarcane harvest front scheduling problem. International Journal of Production Economics, 213 , 150–160. https://doi.org/10.1016/j.ijpe.2019.03.009
Kaab, A., Sharifi, M., Mobli, H., Nabavi-Pelesaraei, A., & Chau, K. (2019). Use of optimization techniques for energy use efficiency and environmental life cycle assessment modification in sugarcane production. Energy, 181 , 1298–1320.
Kaewtrakulpong, K., Takigawa, T., Koike, M., Hasegawa, H., & Bahalayodhin, B. (2008). Truck allocation planning for cost reduction of mechanical sugarcane harvesting in Thailand an application of multi-objective optimization. Journal of the Japanese Society of Agricultural Machinery, 70 (2), 62–71.
Lamsal, K., Jones, P. C., & Thomas, B. W. (2017). Sugarcane harvest logistics in Brazil. Transportation Science, 51 (2), 771–789.
Liebman, M., & Dyck, E. (1993). Crop rotation and intercropping strategies for weed management. Ecological Applications, 3 (1), 92–122.
Lima, C., Balbo, A. R., Homem, T. P. D., & Florentino, H. O. (2017). A hybrid approach combining interior-point and branch-and-bound methods applied to the problem of sugar cane waste. Journal of the Operational Research Society, 68 (2), 147–164.
Matsuoka, S. & Rubio, L. C. S. (2019). Energy cane: A sound alternative of a bioenergy crop for tropics and subtropics. In Sugarcane Biofuels , pp. 39–66. Springer.
Matsuoka, S., dos Santos, E. G. D. & Tomazela, A. L. (2017). Free Fiber Level Drives Resilience and Hybrid Vigor in Energy Cane. LAP LAMBERT Academic Publishing.
Matsuoka, S., Rubio, L., Tomazela, A., & Santos, E. (2016). A evoluçao do proálcool. AgroANALYSIS, 36 (1), 29–30.
Miettinen, K. (1999). Nonlinear Multiobjective Optimization, vol. 12. Springer.
Milan, E. L., Fernandez, S. M., & Aragones, L. M. P. (2006). Sugar cane transportation in Cuba, a case study. European Journal of Operational Research, 174 (1), 374–386.
Muchow, R. C., Higgins, A. J., Rudd, A. V., & Ford, A. W. (1998). Optimising harvest date in sugar production: A case study for the Mossman mill region in Australia: II. sensitivity to crop age and crop class distribution. Field Crops Research, 57 (3), 243–251.
Nervis, J. J. (2015). Simulação para a otimização da colheita da cana-de-açúcar.
Nikulin, Y., Miettinen, K., & Mäkelä, M. M. (2012). A new achievement scalarizing function based on parameterization in multiobjective optimization. OR Spectrum, 34 (1), 69–87.
Pathumnakul, S., & Nakrachata-Amon, T. (2015). The applications of operations research in harvest planning: A case study of the sugarcane industry in Thailand. Journal of Japan Industrial Management Association, 65 (4E), 328–333.
Poltroniere, S. C., Aliano, F. A., Caversan, A. S., Balbo, A. R., & Florentino, H. O. (2021). Integrated planning for planting and harvesting sugarcane and energy-cane for the production of sucrose and energy. Computers and Electronics in Agriculture . https://doi.org/10.1016/j.compag.2020.105956
Pongpat, P., Mahmood, A., Ghani, H. U., Silalertruksa, T., & Gheewala, S. H. (2023). Optimization of food-fuel-fibre in biorefinery based on environmental and economic assessment: The case of sugarcane utilization in Thailand. Sustainable Production and Consumption, 37 , 398–411. https://doi.org/10.1016/j.spc.2023.03.013
Pornprakun, W., Sungnul, S., Kiataramkul, C., & Moore, E. J. (2019). Determining optimal policies for sugarcane harvesting in Thailand using bi-objective and quasi-newton optimization methods. Advances in Difference Equations, 2019 , 1–15.
Qiu, M., Meng, Y., Chen, J., Chen, Y., Li, Z., & Li, J. (2023). Dual multi-objective optimisation of the cane milling process. Computers & Industrial Engineering, 179 , 109146. https://doi.org/10.1016/j.cie.2023.109146
Ramos, R. P., Isler, P. R., Florentino, H. O., Jones, D., & Nervis, J. J. (2016). An optimization model for the combined planning and harvesting of sugarcane with maturity considerations. African Journal of Agricultural Research, 11 (40), 3950–3958.
Santoro, E., Soler, E. M., & Cherri, A. C. (2017). Route optimization in mechanized sugarcane harvesting. Computers and Electronics in Agriculture, 141 , 140–146.
Santos, J. A. (2022). Análise do sistema de energia fotovoltaica do Instituto Federal do Ceará-Campus Maracanaú e sua colaboração na redução da emissão de CO2. Available on: http://conexoes.ifce.edu.br/index.php/conexoes/article/view/2168 . Retrieved February 19, 2023.
Sethanan, K., & Neungmatcha, W. (2016). Multi-objective particle swarm optimization for mechanical harvester route planning of sugarcane field operations. European Journal of Operational Research, 252 (3), 969–984.
Shirvani, N., Ruiz, R., & Shadrokh, S. (2014). Cyclic scheduling of perishable products in parallel machine with release dates, due dates and deadlines. International Journal of Production Economics, 156 , 1–12.
Silva, A. F., Marins, F. A. S., & Dias, E. X. (2015). Addressing uncertainty in sugarcane harvest planning through a revised multi-choice goal programming model. Applied Mathematical Modelling, 39 (18), 5540–5558.
Toso, E. A., Morabito, R., & Clark, A. R. (2009). Lot sizing and sequencing optimisation at an animal-feed plant. Computers & Industrial Engineering, 57 (3), 813–821.
USDA: United States Department of Agriculture. (2023). Available on: https://apps.fas.usda.gov/psdonline/app/index.html#/app/advQuery . Retrieved January 31, 2023.
Usman, M., & Balsalobre-Lorente, D. (2022). Environmental concern in the era of industrialization: Can financial development, renewable energy and natural resources alleviate some load? Energy Policy, 162 , 112780.
Varshney, D., Mandade, P., & Shastri, Y. (2019). Multi-objective optimization of sugarcane bagasse utilization in an Indian sugar mill. Sustainable Production and Consumption, 18 , 96–114.
Walfrido, A. P., Luengo, C. A., Koehlinger, J., Garzone, P., & Cornacchia, G. (2008). Sugarcane energy use: The Cuban case. Energy policy, 36 (6), 2163–2181.
Download references
The authors thank to Brazilian foundations: CNPq n \(^{\textrm{o}}\) 306518/2022-8, CNPq n \(^{\textrm{o}}\) 304218/2022-7, FAPESP 2021/03039-1,FAPESP 2022/12652-1, PROPE/PROPG/UNESP/ FUNDUNESP grant 12/2022, for the financial support and language services provided.
Gilmar Tolentino, Antônio Roberto Balbo, Sônia Cristina Poltroniere, Angelo Aliano Filho and Helenice de Oliveira Florentino have contributed equally to this work.
Department of Mathematics, State University of Sao Paulo, Bauru, São Paulo, 17033-360, Brazil
Gilmar Tolentino, Antônio Roberto Balbo & Sônia Cristina Poltroniere
Department of Mathematics, Universidade Tecnológica Federal do Paraná, Apucarana, Paraná, 86812-460, Brazil
Angelo Aliano Filho
Department of Bioestatistics, State University of Sao Paulo, Botucatu, São Paulo, 18618-690, Brazil
Helenice de Oliveira Florentino
You can also search for this author in PubMed Google Scholar
Correspondence to Angelo Aliano Filho .
Conflict of interest..
The authors declare to have no Conflict of interest.
This article does not contain any studies with human participants or animals performed by any of the authors.
Publisher's note.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Below is the link to the electronic supplementary material.
Rights and permissions.
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
Reprints and permissions
Tolentino, G., Balbo, A.R., Poltroniere, S.C. et al. A MIP-heuristic approach for solving a bi-objective optimization model for integrated production planning of sugarcane and energy-cane. Ann Oper Res (2024). https://doi.org/10.1007/s10479-024-06229-5
Download citation
Received : 06 May 2023
Accepted : 16 August 2024
Published : 02 September 2024
DOI : https://doi.org/10.1007/s10479-024-06229-5
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
Advertisement
COMMENTS
The document provides a learning guide for a mathematics course on problem solving, mathematical investigation, and modeling. It includes an introduction, course description, learning outcomes, and table of contents organizing the topics to be covered. The topics include problems involving decimals, fractions, percentages, proportions, equations, numbers, digits, geometry, money, age, distance ...
This module is designed to provide you with a background of knowledge in Problem Solving, Mathematical Investigation, and Modeling. It focuses on the following: (1) Problem Solving and Mathematics Education ( a) Problem Solving: Definition. and Process ( b) Problem Solving and Mathematics Education ( c) Problem Solving and the Conceptual
This document provides information about a learning module on mathematical investigation and modeling. It includes details about the course, module topics, outcomes, and lessons. The first lesson discusses the importance of mathematical investigations in developing problem solving skills. It presents a sample investigation problem asking students to determine which counting numbers can be ...
This document provides a course syllabus for a 3-unit Problem Solving, Mathematics Investigation and Modelling course. The course will introduce students to mathematical approaches for solving routine and non-routine problems, as well as investigating and modeling elementary functions. It will cover topics like solving different types of problems, mathematical investigation and modeling with ...
The use of problem-solving heuristics or strategies to solve mathematical problems was popularised by Pólya (1957) in his book How to solve it (first edition in 1945). Few educators would talk about solving mathematical problems by investigation. In fact, many educators (e.g., HMI, 1985; Lee & Miller, 1997) believe that mathematical ...
Mathematical investigation refers to the sustained exploration of a mathematical situation. It distinguishes itself from problem solving because it is open-ended. I first heard about math investigations in 1990 when I attended a postgraduate course in Australia. I love it right away and it has since become one of my favorite mathematical ...
To facilitate successful mathematics investigations, teachers need to equip their classrooms with a variety of resources that cater to diverse learning styles and assist in delving deeper into concepts. Here are several key resources and how they can be integrated into teaching: 1.Manipulatives and Models. Physical objects like base-ten blocks ...
Starting with a prompt - either a picture or a question - students can explore different ways of approaching the problem and communicating their thinking. Mathematics investigation links directly to the Critical and Creative Capability in the Australian Curriculum (Version 9.0). Investigations can be differentiated by adjusting the ...
Students will have the opportunity to explore the use of problem-solving strategies or heuristics as they engage in mathematical investigations, formulate and justify conjectures, make generalizations, and communicate mathematical ideas. Teacher: MELANIE GURAT. Enrolled students: 2. Enter this course.
Optimization and Mathematical Programming. • Optimization models (also called mathematical programs) represent problem choices as decision variables and seek values that maximize or minimize objective functions of the decision variables subject to constraints on variable values expressing the limits on possible decision choices. [1.3]
This book contributes to the field of mathematical problem solving by exploring current themes, trends and research perspectives. It does so by addressing five broad and related dimensions: problem solving heuristics, problem solving and technology, inquiry and problem posing in mathematics education, assessment of and through problem solving ...
Problem Solving in Mathematics Education
Standard exercises are used for mastery of a newly learned skill - computational, use of an instrument, and even new terms or vocabulary. These are important learning activities but must be used in moderation. If our teaching is dominated by these activities, students will begin to think mathematics is about learning facts and procedures only.
Based on these models but with some modifications, an investigation model (see Fig. 1) was developed for this study to describe the interaction of these processes. An important difference between a mathematical investigation model and a problem-solving model is the additional phase of problem posing after understanding the task in investigation.
To solve a word problem, students can pick out the numbers and decide on an operation.". But through the use of mathematical modeling, students are plucked out of the hypothetical realm and plunged into the complexities of reality—presented with opportunities to help solve real-world problems with many variables by generating questions ...
The rapidly advancing fields of machine learning and mathematical modeling, greatly enhanced by the recent growth in artificial intelligence, are the focus of this special issue. This issue compiles extensively revised and improved versions of the top papers from the workshop on Mathematical Modeling and Problem Solving at PDPTA'23, the 29th International Conference on Parallel and Distributed ...
This course will introduce you to mathematical approach in solving routine and non-routine problem, as well as investigating and modeling elementary functions to describe and explore real-world data and phenomena. Problem Solving involves the application of mathematical skills and reasoning to problems encountered in everyday life. Real world problems are not presented in a neat and orderly ...
Taking into consideration the aforementioned different theories of leaning embedded on mathematical modeling, this study attempted to determine the effects of integrating mathematical modeling to the problem solving performance and math anxiety of Grade 9 students. In this study, much is given importance to the process rather than the product.
This study highlighted cognitive abilities related to creative thinking and mathematics problem-solving by implementing the Project-Based Learning Model. This research was a quasi-experiment with a pretest-posttest control group design involving 43 students in the sixth grade of two elementary schools; data was collected through test and ...
This 3 credit course aims to provide future secondary teachers experience in mathematical problem solving and investigations. Over 54 hours students will formulate and solve routine and non-routine math problems. They will learn strategies like heuristics, Polya's problem solving steps, and apply the pigeonhole principle. Assessment includes quizzes, recitation, group projects and problem ...
The teaching of mathematics in higher secondary school and universities, and its relationship with problem modelling and solving, has become a topic of debate in many countries. Over the past few years, a large body of literature has been produced on this subject [1, 4, 9, 18, 24]. Two main strategies have been identified: "Teaching ...
When solving a mathematical problem, it is possible to appeal to the ordinal property of numbers, i.e. the fact that they are ordered, or to their cardinal property, i.e. the fact that they ...
This research aims to develop a digital pedagogy model in mathematics learning integrated with computational thinking to improve problem solving abilities of junior high school students. This research is development research that follows the Plomp model including preliminary research, prototyping phase, and assessment phase which are described ...
1 INTRODUCTION. The hybrid flow shop scheduling problem (HFSP) is extensively applied in chemical, textile, steel, and semiconductor industries and is recognised as an NP-hard problem [].HFSP exhibits greater complexity compared to traditional flow shop scheduling problems, as it integrates parallel machines and flow shop scheduling issues [].In the context of smart manufacturing, the ...
This document outlines a college course on problem solving, mathematical investigation, and modeling. The 3-credit course is offered by the College of Education at Taguig City University. The course consists of 18 weekly topics that cover defining and applying various problem solving strategies and heuristics, distinguishing between routine and non-routine problems, conducting mathematical ...
This paper proposes a modeling and solution approach for the integrated planning of the planting and harvesting of sucrose cane and energy-cane considering multiple harvesters. An integer linear bi-objective optimization model is proposed, which seeks to find a trade-off between the maximization of the production volumes of sucrose and fiber and the minimization of the operational costs. The ...