Duplicate code is one of the most pungent code smells. A rule that is often used is to re-structure code once it is duplicated in three or more places.
Common duplication problems, and corresponding solutions are:
Complex classes like Tokenizer often do a lot of different things. To break such a class down, we need to identify a cohesive component within that class. A common approach to find such a component is to look for fields/methods that share the same prefixes, or suffixes. You can also have a look at the cohesion graph to spot any un-connected, or weakly-connected components.
Once you have determined the fields that belong together, you can apply the Extract Class refactoring. If the component makes sense as a sub-class, Extract Subclass is also a candidate, and is often faster.
While breaking up the class, it is a good idea to analyze how other classes use Tokenizer, and based on these observations, apply Extract Interface, too.
1 | <?php |
||
19 | final class Tokenizer |
||
20 | { |
||
21 | /** |
||
22 | * parse and tokenize input string |
||
23 | * @param string $plainData |
||
24 | * @throws SyntaxErrorException |
||
25 | * @throws BadMethodCallException |
||
26 | * @return array |
||
27 | * @throws \Hoa\Ustring\Exception |
||
28 | 9 | * @throws \InvalidArgumentException |
|
29 | * @throws \LogicException |
||
30 | 9 | */ |
|
31 | public static function tokenize(string $plainData) : array |
||
35 | |||
36 | /** |
||
37 | * SphinxConfigurationParser constructor. |
||
38 | * @internal |
||
39 | 9 | * @param string $string |
|
40 | * @throws BadMethodCallException |
||
41 | 9 | * @throws \Hoa\Ustring\Exception |
|
42 | 9 | */ |
|
43 | 9 | private function __construct(string $string) |
|
48 | |||
49 | /** |
||
50 | 9 | * @internal |
|
51 | * @param string $string |
||
52 | 9 | * @return string |
|
53 | */ |
||
54 | private function removeComments(string $string) : string |
||
58 | |||
59 | /** |
||
60 | 9 | * @internal |
|
61 | * @return Tokenizer |
||
62 | * @throws \LogicException |
||
63 | 9 | * @throws \InvalidArgumentException |
|
64 | 2 | * @throws SyntaxErrorException |
|
65 | */ |
||
66 | 2 | private function tokenizeInternal() : Tokenizer |
|
76 | |||
77 | 9 | /** |
|
78 | * @internal |
||
79 | 8 | * @throws SyntaxErrorException |
|
80 | * @throws \InvalidArgumentException |
||
81 | * @throws \LogicException |
||
82 | 8 | */ |
|
83 | private function extractSection() |
||
111 | 9 | ||
112 | 9 | /** |
|
113 | 9 | * @internal |
|
114 | 8 | * @throws SyntaxErrorException |
|
115 | * @throws \InvalidArgumentException |
||
116 | 1 | * @throws \LogicException |
|
117 | */ |
||
118 | private function extractSectionType() |
||
132 | 8 | ||
133 | 8 | /** |
|
134 | 8 | * @internal |
|
135 | 8 | * @throws SyntaxErrorException |
|
136 | 8 | * @throws \InvalidArgumentException |
|
137 | 1 | * @throws \LogicException |
|
138 | 1 | */ |
|
139 | View Code Duplication | private function extractSectionName() |
|
154 | 8 | ||
155 | /** |
||
156 | 8 | * @internal |
|
157 | 6 | * @throws SyntaxErrorException |
|
158 | * @throws \InvalidArgumentException |
||
159 | * @throws \LogicException |
||
160 | 4 | */ |
|
161 | 3 | private function extractInheritance() |
|
179 | 3 | ||
180 | 3 | /** |
|
181 | 3 | * @internal |
|
182 | 3 | * @throws SyntaxErrorException |
|
183 | 3 | * @throws \InvalidArgumentException |
|
184 | * @throws \LogicException |
||
185 | */ |
||
186 | View Code Duplication | private function extractInheritanceName() |
|
201 | 6 | ||
202 | /** |
||
203 | 6 | * @internal |
|
204 | 1 | * @throws SyntaxErrorException |
|
205 | * @throws \LogicException |
||
206 | * @throws \InvalidArgumentException |
||
207 | 6 | */ |
|
208 | 2 | private function extractOptions() |
|
231 | |||
232 | /** |
||
233 | * @internal |
||
234 | 6 | * @throws SyntaxErrorException |
|
235 | * @throws \InvalidArgumentException |
||
236 | 6 | * @throws \LogicException |
|
237 | */ |
||
238 | private function extractOption() |
||
253 | |||
254 | /** |
||
255 | * @internal |
||
256 | 6 | * @throws SyntaxErrorException |
|
257 | * @throws \InvalidArgumentException |
||
258 | 6 | * @throws \LogicException |
|
259 | */ |
||
260 | 6 | View Code Duplication | private function extractOptionName() |
275 | |||
276 | /** |
||
277 | 5 | * @internal |
|
278 | * @throws SyntaxErrorException |
||
279 | 5 | * @throws \LogicException |
|
280 | 5 | * @throws \InvalidArgumentException |
|
281 | */ |
||
282 | 5 | private function extractOptionValue() |
|
316 | |||
317 | 5 | /** |
|
318 | 5 | * @internal |
|
319 | 5 | */ |
|
320 | private function saveCurrentSection() |
||
326 | |||
327 | /** |
||
328 | 2 | * @internal |
|
329 | */ |
||
330 | private function saveCurrentOption() |
||
335 | |||
336 | /** |
||
337 | * @internal |
||
338 | * @return array |
||
339 | 5 | */ |
|
340 | private function getEmptySectionData() : array |
||
349 | |||
350 | /** |
||
351 | * @internal |
||
352 | * @return array |
||
353 | */ |
||
354 | private function getEmptyOptionData() : array |
||
361 | |||
362 | /** |
||
363 | * @var StringStream |
||
364 | */ |
||
365 | private $stream; |
||
366 | |||
367 | /** |
||
368 | * Result of tokenize input string |
||
369 | * @var array |
||
370 | */ |
||
371 | private $tokens = []; |
||
372 | |||
373 | /** |
||
374 | * temporary storage of tokens for one section |
||
375 | * @var array |
||
376 | */ |
||
377 | private $currentSection = [ |
||
378 | 'type' => '', |
||
379 | 'name' => '', |
||
380 | 'inheritance' => '', |
||
381 | 'options' => [] |
||
382 | ]; |
||
383 | /** |
||
384 | * temporary storage of tokens for one option |
||
385 | * @var array |
||
386 | */ |
||
387 | private $currentOption = [ |
||
388 | 'name' => '', |
||
389 | 'value' => '' |
||
390 | ]; |
||
391 | } |
Duplicated code is one of the most pungent code smells. If you need to duplicate the same code in three or more different places, we strongly encourage you to look into extracting the code into a single class or operation.
You can also find more detailed suggestions in the “Code” section of your repository.