As a library developer, it’s possible you’ll create a well-liked utility that lots of of
1000’s of builders depend on every day, reminiscent of lodash or React. Over time,
utilization patterns may emerge that transcend your preliminary design. When this
occurs, it’s possible you’ll want to increase an API by including parameters or modifying
perform signatures to repair edge instances. The problem lies in rolling out
these breaking modifications with out disrupting your customers’ workflows.
That is the place codemods are available in—a robust instrument for automating
large-scale code transformations, permitting builders to introduce breaking
API modifications, refactor legacy codebases, and preserve code hygiene with
minimal guide effort.
On this article, we’ll discover what codemods are and the instruments you may
use to create them, reminiscent of jscodeshift, hypermod.io, and codemod.com. We’ll stroll by way of real-world examples,
from cleansing up function toggles to refactoring part hierarchies.
You’ll additionally discover ways to break down complicated transformations into smaller,
testable items—a apply generally known as codemod composition—to make sure
flexibility and maintainability.
By the tip, you’ll see how codemods can develop into an important a part of your
toolkit for managing large-scale codebases, serving to you retain your code clear
and maintainable whereas dealing with even probably the most difficult refactoring
duties.
Breaking Modifications in APIs
Returning to the situation of the library developer, after the preliminary
launch, new utilization patterns emerge, prompting the necessity to lengthen an
For easy modifications, a fundamental find-and-replace within the IDE may work. In
extra complicated instances, you may resort to utilizing instruments like sed
or awk
. Nonetheless, when your library is broadly adopted, the
scope of such modifications turns into tougher to handle. You’ll be able to’t make sure how
extensively the modification will influence your customers, and the very last thing
you need is to interrupt present performance that doesn’t want
updating.
A typical method is to announce the breaking change, launch a brand new
model, and ask customers emigrate at their very own tempo. However this workflow,
whereas acquainted, usually would not scale properly, particularly for main shifts.
Take into account React’s transition from class elements to perform elements
with hooks—a paradigm shift that took years for big codebases to totally
undertake. By the point groups managed emigrate, extra breaking modifications have been
usually already on the horizon.
For library builders, this example creates a burden. Sustaining
a number of older variations to assist customers who haven’t migrated is each
pricey and time-consuming. For customers, frequent modifications danger eroding belief.
They might hesitate to improve or begin exploring extra secure alternate options,
which perpetuating the cycle.
However what when you may assist customers handle these modifications mechanically?
What when you may launch a instrument alongside your replace that refactors
their code for them—renaming capabilities, updating parameter order, and
eradicating unused code with out requiring guide intervention?
That’s the place codemods are available in. A number of libraries, together with React
and Subsequent.js, have already embraced codemods to clean the trail for model
bumps. For instance, React gives codemods to deal with the migration from
older API patterns, just like the outdated Context API, to newer ones.
So, what precisely is the codemod we’re speaking about right here?
What’s a Codemod?
A codemod (code modification) is an automatic script used to rework
code to observe new APIs, syntax, or coding requirements. Codemods use
Summary Syntax Tree (AST) manipulation to use constant, large-scale
modifications throughout codebases. Initially developed at Fb, codemods helped
engineers handle refactoring duties for big tasks like React. As
Fb scaled, sustaining the codebase and updating APIs grew to become
more and more tough, prompting the event of codemods.
Manually updating 1000’s of information throughout totally different repositories was
inefficient and error-prone, so the idea of codemods—automated scripts
that remodel code—was launched to deal with this drawback.
The method sometimes entails three essential steps:
- Parsing the code into an AST, the place every a part of the code is
represented as a tree construction. - Modifying the tree by making use of a metamorphosis, reminiscent of renaming a
perform or altering parameters. - Rewriting the modified tree again into the supply code.
By utilizing this method, codemods make sure that modifications are utilized
constantly throughout each file in a codebase, decreasing the prospect of human
error. Codemods may also deal with complicated refactoring situations, reminiscent of
modifications to deeply nested constructions or eradicating deprecated API utilization.
If we visualize the method, it will look one thing like this:
Determine 1: The three steps of a typical codemod course of
The concept of a program that may “perceive” your code after which carry out
computerized transformations isn’t new. That’s how your IDE works once you
run refactorings like
Basically, your IDE parses the supply code into ASTs and applies
predefined transformations to the tree, saving the consequence again into your
information.
For contemporary IDEs, many issues occur below the hood to make sure modifications
are utilized accurately and effectively, reminiscent of figuring out the scope of
the change and resolving conflicts like variable identify collisions. Some
refactorings even immediate you to enter parameters, reminiscent of when utilizing
order of parameters or default values earlier than finalizing the change.
Use jscodeshift in JavaScript Codebases
Let’s take a look at a concrete instance to grasp how we may run a
codemod in a JavaScript undertaking. The JavaScript neighborhood has a number of
instruments that make this work possible, together with parsers that convert supply
code into an AST, in addition to transpilers that may remodel the tree into
different codecs (that is how TypeScript works). Moreover, there are
instruments that assist apply codemods to total repositories mechanically.
Probably the most standard instruments for writing codemods is jscodeshift, a toolkit maintained by Fb.
It simplifies the creation of codemods by offering a robust API to
manipulate ASTs. With jscodeshift, builders can seek for particular
patterns within the code and apply transformations at scale.
You should use jscodeshift
to determine and change deprecated API calls
with up to date variations throughout a whole undertaking.
Let’s break down a typical workflow for composing a codemod
manually.
Clear a Stale Characteristic Toggle
Let’s begin with a easy but sensible instance to display the
energy of codemods. Think about you’re utilizing a function
toggle in your
codebase to manage the discharge of unfinished or experimental options.
As soon as the function is reside in manufacturing and dealing as anticipated, the subsequent
logical step is to wash up the toggle and any associated logic.
As an example, take into account the next code:
const knowledge = featureToggle('feature-new-product-list') ? { identify: 'Product' } : undefined;
As soon as the function is absolutely launched and not wants a toggle, this
could be simplified to:
const knowledge = { identify: 'Product' };
The duty entails discovering all cases of featureToggle
within the
codebase, checking whether or not the toggle refers to
feature-new-product-list
, and eradicating the conditional logic surrounding
it. On the similar time, different function toggles (like
feature-search-result-refinement
, which can nonetheless be in growth)
ought to stay untouched. The codemod must perceive the construction
of the code to use modifications selectively.
Understanding the AST
Earlier than we dive into writing the codemod, let’s break down how this
particular code snippet appears in an AST. You should use instruments like AST
Explorer to visualise how supply code and AST
are mapped. It’s useful to grasp the node sorts you are interacting
with earlier than making use of any modifications.
The picture beneath reveals the syntax tree by way of ECMAScript syntax. It
accommodates nodes like Identifier
(for variables), StringLiteral
(for the
toggle identify), and extra summary nodes like CallExpression
and
ConditionalExpression
.
Determine 2: The Summary Syntax Tree illustration of the function toggle examine
On this AST illustration, the variable knowledge
is assigned utilizing a
ConditionalExpression
. The check a part of the expression calls
featureToggle('feature-new-product-list')
. If the check returns true
,
the consequent department assigns { identify: 'Product' }
to knowledge
. If
false
, the alternate department assigns undefined
.
For a job with clear enter and output, I want writing checks first,
then implementing the codemod. I begin by defining a adverse case to
guarantee we don’t by accident change issues we wish to depart untouched,
adopted by an actual case that performs the precise conversion. I start with
a easy situation, implement it, then add a variation (like checking if
featureToggle is known as inside an if assertion), implement that case, and
guarantee all checks cross.
This method aligns properly with Take a look at-Pushed Improvement (TDD), even
when you don’t apply TDD commonly. Figuring out precisely what the
transformation’s inputs and outputs are earlier than coding improves security and
effectivity, particularly when tweaking codemods.
With jscodeshift, you may write checks to confirm how the codemod
behaves:
const remodel = require("../remove-feature-new-product-list"); defineInlineTest( remodel, {}, ` const knowledge = featureToggle('feature-new-product-list') ? { identify: 'Product' } : undefined; `, ` const knowledge = { identify: 'Product' }; `, "delete the toggle feature-new-product-list in conditional operator" );
The defineInlineTest
perform from jscodeshift lets you outline
the enter, anticipated output, and a string describing the check’s intent.
Now, working the check with a traditional jest
command will fail as a result of the
codemod isn’t written but.
The corresponding adverse case would make sure the code stays unchanged
for different function toggles:
defineInlineTest( remodel, {}, ` const knowledge = featureToggle('feature-search-result-refinement') ? { identify: 'Product' } : undefined; `, ` const knowledge = featureToggle('feature-search-result-refinement') ? { identify: 'Product' } : undefined; `, "don't change different function toggles" );
Writing the Codemod
Let’s begin by defining a easy remodel perform. Create a file
known as remodel.js
with the next code construction:
module.exports = perform(fileInfo, api, choices) { const j = api.jscodeshift; const root = j(fileInfo.supply); // manipulate the tree nodes right here return root.toSource(); };
This perform reads the file right into a tree and makes use of jscodeshift’s API to
question, modify, and replace the nodes. Lastly, it converts the AST again to
supply code with .toSource()
.
Now we are able to begin implementing the remodel steps:
- Discover all cases of
featureToggle
. - Confirm that the argument handed is
'feature-new-product-list'
. - Change the complete conditional expression with the consequent half,
successfully eradicating the toggle.
Right here’s how we obtain this utilizing jscodeshift
:
module.exports = perform (fileInfo, api, choices) { const j = api.jscodeshift; const root = j(fileInfo.supply); // Discover ConditionalExpression the place the check is featureToggle('feature-new-product-list') root .discover(j.ConditionalExpression, { check: { callee: { identify: "featureToggle" }, arguments: [{ value: "feature-new-product-list" }], }, }) .forEach((path) => { // Change the ConditionalExpression with the 'consequent' j(path).replaceWith(path.node.consequent); }); return root.toSource(); };
The codemod above:
- Finds
ConditionalExpression
nodes the place the check calls
featureToggle('feature-new-product-list')
. - Replaces the complete conditional expression with the resultant (i.e.,
{
), eradicating the toggle logic and leaving simplified code
identify: 'Product' }
behind.
This instance demonstrates how simple it’s to create a helpful
transformation and apply it to a big codebase, considerably decreasing
guide effort.
You’ll want to jot down extra check instances to deal with variations like
if-else
statements, logical expressions (e.g.,
!featureToggle('feature-new-product-list')
), and so forth to make the
codemod strong in real-world situations.
As soon as the codemod is prepared, you may check it out on a goal codebase,
such because the one you are engaged on. jscodeshift gives a command-line
instrument that you should use to use the codemod and report the outcomes.
$ jscodeshift -t transform-name src/
After validating the outcomes, examine that every one practical checks nonetheless
cross and that nothing breaks—even when you’re introducing a breaking change.
As soon as happy, you may commit the modifications and lift a pull request as
a part of your regular workflow.
Codemods Enhance Code High quality and Maintainability
Codemods aren’t simply helpful for managing breaking API modifications—they will
considerably enhance code high quality and maintainability. As codebases
evolve, they usually accumulate technical debt, together with outdated function
toggles, deprecated strategies, or tightly coupled elements. Manually
refactoring these areas could be time-consuming and error-prone.
By automating refactoring duties, codemods assist hold your codebase clear
and freed from legacy patterns. Recurrently making use of codemods lets you
implement new coding requirements, take away unused code, and modernize your
codebase with out having to manually modify each file.
Refactoring an Avatar Part
Now, let’s take a look at a extra complicated instance. Suppose you’re working with
a design system that features an Avatar
part tightly coupled with a
Tooltip
. At any time when a person passes a identify
prop into the Avatar
, it
mechanically wraps the avatar with a tooltip.
Determine 3: A avatar part with a tooltip
Right here’s the present Avatar
implementation:
import { Tooltip } from "@design-system/tooltip"; const Avatar = ({ identify, picture }: AvatarProps) => { if (identify) { return ( <Tooltip content material={identify}> <CircleImage picture={picture} /> </Tooltip> ); } return <CircleImage picture={picture} />; };
The objective is to decouple the Tooltip
from the Avatar
part,
giving builders extra flexibility. Builders ought to have the ability to determine
whether or not to wrap the Avatar
in a Tooltip
. Within the refactored model,
Avatar
will merely render the picture, and customers can apply a Tooltip
manually if wanted.
Right here’s the refactored model of Avatar
:
const Avatar = ({ picture }: AvatarProps) => { return <CircleImage picture={picture} />; };
Now, customers can manually wrap the Avatar
with a Tooltip
as
wanted:
import { Tooltip } from "@design-system/tooltip"; import { Avatar } from "@design-system/avatar"; const UserProfile = () => { return ( <Tooltip content material="Juntao Qiu"> <Avatar picture="/juntao.qiu.avatar.png" /> </Tooltip> ); };
The problem arises when there are lots of of Avatar usages unfold
throughout the codebase. Manually refactoring every occasion can be extremely
inefficient, so we are able to use a codemod to automate this course of.
Utilizing instruments like AST Explorer, we are able to
examine the part and see which nodes characterize the Avatar
utilization
we’re focusing on. An Avatar
part with each identify
and picture
props
is parsed into an summary syntax tree as proven beneath:
Determine 4: AST of the Avatar part utilization
Writing the Codemod
Let’s break down the transformation into smaller duties:
- Discover
Avatar
utilization within the part tree. - Test if the
identify
prop is current. - If not, do nothing.
- If current:
- Create a
Tooltip
node. - Add the
identify
to theTooltip
. - Take away the
identify
fromAvatar
. - Add
Avatar
as a baby of theTooltip
. - Change the unique
Avatar
node with the brand newTooltip
.
To start, we’ll discover all cases of Avatar (I’ll omit a number of the
checks, however you must write comparability checks first).
defineInlineTest(
{ default: remodel, parser: "tsx" },
{},
`
<Avatar identify="Juntao Qiu" picture="/juntao.qiu.avatar.png" />
`,
`
<Tooltip content material="Juntao Qiu">
<Avatar picture="/juntao.qiu.avatar.png" />
</Tooltip>
`,
"wrap avatar with tooltip when identify is supplied"
);
Much like the featureToggle
instance, we are able to use root.discover
with
search standards to find all Avatar nodes:
root .discover(j.JSXElement, { openingElement: { identify: { identify: "Avatar" } }, }) .forEach((path) => { // now we are able to deal with every Avatar occasion });
Subsequent, we examine if the identify
prop is current:
root
.discover(j.JSXElement, {
openingElement: { identify: { identify: "Avatar" } },
})
.forEach((path) => {
const avatarNode = path.node;
const nameAttr = avatarNode.openingElement.attributes.discover(
(attr) => attr.identify.identify === "identify"
);
if (nameAttr) {
const tooltipElement = createTooltipElement(
nameAttr.worth.worth,
avatarNode
);
j(path).replaceWith(tooltipElement);
}
});
For the createTooltipElement
perform, we use the
jscodeshift API to create a brand new JSX node, with the identify
prop utilized to the Tooltip
and the Avatar
part as a baby. Lastly, we name replaceWith
to
change the present path
.
Right here’s a preview of the way it appears in
Hypermod, the place the codemod is written on
the left. The highest half on the appropriate is the unique code, and the underside
half is the reworked consequence:
Determine 5: Run checks inside hypermod earlier than apply it to your codebase
This codemod searches for all cases of Avatar
. If a
identify
prop is discovered, it removes the identify
prop
from Avatar
, wraps the Avatar
inside a
Tooltip
, and passes the identify
prop to the
Tooltip
.
By now, I hope it’s clear that codemods are extremely helpful and
that the workflow is intuitive, particularly for large-scale modifications the place
guide updates can be an enormous burden. Nonetheless, that is not the entire
image. Within the subsequent part, I’ll make clear a number of the challenges
and the way we are able to deal with these less-than-ideal points.