forked from ryanmcdermott/code-review-tips
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathREADME.md
454 lines (377 loc) · 14.8 KB
/
README.md
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
# code-review-tips
## Table of Contents
1. [Introduction](#introduction)
2. [Why Review Code?](#why-review-code)
3. [Basics](#basics)
4. [Readability](#readability)
5. [Side Effects](#side-effects)
6. [Limits](#limits)
7. [Security](#security)
8. [Performance](#performance)
9. [Testing](#testing)
10. [Miscellaneous](#miscellaneous)
## Introduction
Code reviews can inspire dread in both reviewer and reviewee. Having your
code analyzed can feel as invasive as being screened by the TSA as you go off
to your vacation. Even worse, reviewing other people's code can feel like
a painful and ambiguous exercise, searching for problems and not even knowing
where to begin.
This project aims to provide some solid tips for how to review the code that
you and your team write. All examples are written in JavaScript, but the advice
should be applicable to any project of any language. This is by no means an
exhaustive list, but hopefully this will help you catch as many bugs as
possible long before users ever see your feature.
## Why Review Code?
Code reviews are a necessary part of the software engineering process because
you alone can't catch every problem in a piece of code you
write. That's ok though! Even the best basketball players in the world miss
shots.
Having others review our work ensures that we deliver the best product to users
and with the least amount of errors. Make sure your team implements a code
review process for new code that is introduced into your codebase. Find a
process that works for you and your team. There's no one size fits all. The
important point is to do code reviews as regularly as possible.
## Basics
### Code reviews should be as automated as possible
Avoid discussing details that can be handled by a static analysis tool. Don't
argue about nuances such as code formatting and whether to use `let` or `var`.
Having a formatter and linter can save your team a lot of time from reviews
that your computer can do for you.
### Code reviews should avoid API discussion
These discussions should happen before the code is even written. Don't try to
argue about the floor plan once you've poured the concrete
foundation.
### Code reviews should be kind
It's scary to have your code reviewed and it can bring about feelings of
insecurity in even the most experienced developer. Be positive in your language
and keep your teammates comfortable and secure in their work!
## Readability
### Typos should be corrected
Avoid nitpicking as much as you can and save it for your linter, compiler, and
formatter. When you can't, such as in the case of typos, leave a kind comment
suggesting a fix. It's the little things that make a big difference sometimes!
### Variable and function names should be clear
Naming is one of the hardest problems in computer science. We've all given names
to variables, functions, and files that are confusing. Help your teammate out
by suggesting a clearer name, if the one you're reading doesn't make sense.
```javascript
// This function could be better named as namesToUpperCase
function u(names) {
// ...
}
```
### Functions should be short
Functions should do one thing! Long functions usually mean that they are doing
too much. Tell your teammate to split out the function into multiple different
ones.
```javascript
// This is both emailing clients and deciding which are active. Should be
// 2 different functions.
function emailClients(clients) {
clients.forEach((client) => {
const clientRecord = database.lookup(client);
if (clientRecord.isActive()) {
email(client);
}
});
}
```
### Files should be short
Just like functions, a file should be about one thing. A file represents a
module and a module should do one thing for your codebase.
For example, if your module is called `fake-name-generator` it should just be
responsible for creating fake names like "Keyser Söze". If the
`fake-name-generator` also includes a bunch of utility functions for querying a
database of names, that should be in a separate module.
There's no rule for how long a file should be, but if it's long like below and
includes functions that don't relate to one another, then it should probably
be split apart.
```javascript
1: import _ from 'lodash';
2: function generateFakeNames() {
3: // ..
4: }
...
1128: function queryRemoteDatabase() {
1129: // ...
1130: }
```
### Exported functions should be documented
If your function is intended to be used by other libraries, it helps to add
documentation so users of it know what it does.
```javascript
// This needs documentation. What is this function for? How is it used?
export function networkMonitor(graph, duration, failureCallback) {
// ...
}
```
### Complex code should be commented
If you have named things well and the logic is still confusing, then it's time
for a comment.
```javascript
function leftPad (str, len, ch) {
str = str + '';
len = len - str.length;
while (true) {
// This needs a comment, why a bitwise and here?
if (len & 1) pad += ch;
// This needs a comment, why a bit shift here?
len >>= 1;
if (len) ch += ch;
else break;
}
return pad + str;
}
```
## Side Effects
### Functions should be as pure as possible
```javascript
// Global variable is referenced by the following function.
// If we had another function that used this name, now it'd be an array and it
// could break it. Instead it's better to pass in a name parameter
let name = 'Ryan McDermott';
function splitIntoFirstAndLastName() {
name = name.split(' ');
}
splitIntoFirstAndLastName();
```
### I/O functions should have failure cases handled
Any function that does I/O should handle when something goes wrong
```javascript
function getIngredientsFromFile() {
const onFulfilled = (buffer) => {
let lines = buffer.split('\n');
return lines.forEach(line => <Ingredient ingredient={line}/>)
};
// What about when this rejected because of an error? What do we return?
return readFile('./ingredients.txt').then(onFulfilled);
}
```
## Limits
### Null cases should be handled
If you have a list component for example, all is well and good if you display a
nice beautiful table that shows all its data. Your users love it and you get a
promotion! But what happens when no data comes back? What do you show in the
null case? Your code should be resilient to every case that can occur. If
there's something bad that can happen in your code, eventually it will happen.
```javascript
class InventoryList {
constructor(data) {
this.data = data;
}
render() {
return (
<table>
<tbody>
<tr>
<th>
ID
</th>
<th>
Product
</th>
</tr>
// We should show something for the null case here if there's
// nothing in the data inventory
{Object.keys(this.data.inventory).map(itemId => (
<tr key={i}>
<td>
{itemId}
</td>
<td>
{this.state.inventory[itemId].product}
</td>
</tr>
))}
</tbody>
</table>
);
}
}
```
### Large cases should be handled
In the list above, what would happen if 10,000 items came back from the
inventory? In that case you need some form of pagination or infinite scroll.
Be sure to always assess the potential edge cases in terms of volume, especially
when it comes to UI programming.
### Singular cases should be handled
```javascript
class MoneyDislay {
constructor(amount) {
this.amount = amount;
}
render() {
// What happens if the user has 1 dollar? You can't say plural "dollars"
return (
<div className="fancy-class">
You have {this.amount} dollars in your account
</div>
);
}
}
```
### User input should be limited
Users can potentially input an unlimited amount of data to send to you. It's
important to set limits if a function takes any kind of user data in.
```javascript
router.route('/message').post((req, res) => {
const message = req.body.content;
// What happens if the message is many megabytes of data? Do we want to store
// that in the database? We should set limits on the size.
db.save(message);
});
```
### Functions should handle unexpected user input
Users will always surprise you with the data they give you. Don't expect that
you will always get the right type of data or even any data in a request from a
user. [And don't rely on client-side validation alone](https://twitter.com/ryconoclast/status/885523459748487169)
```javascript
router.route('/transfer-money').post((req, res) => {
const amount = req.body.amount;
const from = user.id;
const to = req.body.to;
// What happens if we got a string instead of a number as our amount? This
// function would fail
transferMoney(from, to, amount);
});
```
## Security
Data security is the most important aspect of your application. If users can't
trust you with their data, then you won't have a business. There are numerous
different types of security exploits that can plague an app, depending on the
particular language and runtime environment. Below is a very small and
incomplete list of common security problems. Don't rely on this alone! Automate
as much security review as you can on every commit, and perform routine security
audits.
### XSS should not be possible
Cross-site scripting (XSS), is one of the largest vectors for security attacks
on a web application. It occurs when you take user data and include it in your
page without first properly sanitizing it. This can cause your site to execute
source code from remote pages.
```javascript
function () {
let badge = document.getElementsByClassName('badge');
let nameQueryParam = getQueryParams('name');
/**
* What if nameQueryParam was `<script>sendCookie(document.cookie)</script>`?
* If that was the query param, a malicious user could lure a user to click a
* link with that as the `name` query param, and have the user unknowingly
* send their data to a bad actor.
*/
badge.children[0].innerHTML = nameQueryParam;
}
```
### Personally Identifiable Information (PII) should not leak
You bear an enormous weight of responsibility every time you take in user data.
If you leak data in URLs, in analytics tracking to third parties, or even expose
data to employees that shouldn't have access, you greatly hurt your users and
your business. Be careful with other people's lives!
```javascript
router.route('/bank-user-info').get((req, res) => {
const name = user.name;
const id = user.id
const socialSecurityNumber = user.ssn;
// There's no reason to send a socialSecurityNumber back in a query parameter
// This would be exposed in the URL and potentially to any middleman on the
// network watching internet traffic
res.addToQueryParams({
name,
id,
socialSecurityNumber
})
});
```
## Performance
### Functions should use efficient algorithms and data structures
This is different for every particular case, but use your best judgment to see
if there are any ways to improve the efficiency of a piece of code. Your users
will thank you for the faster speeds!
```javascript
// If mentions was a hash data structure, you wouldn't need to iterate through
// all mentions to find a user. You could simply return the presence of the
// user key in the mentions hash
function isUserMentionedInComments(mentions, user) {
let mentioned = false;
mentions.forEach(mention => {
if (mention.user === user) {
mentioned = true;
}
})
return mentioned;
}
```
### Important actions should be logged
Logging helps give metrics about performance and insight into user behavior.
Not every action needs to be logged, but decide with your team what makes sense
to keep track of for data analytics. And be sure that no personally identifiable
information is exposed!
```javascript
router.route('/request-ride').post((req, res) => {
const currentLocation = req.body.currentLocation;
const destination = req.body.destination;
requestRide(user, currentLocation, destination).then(result => {
// We should log before and after this block to get a metric for how long
// this task took, and potentially even what locations were involved in ride
// ...
});
});
```
## Testing
### New code should be tested
All new code should include a test, whether it fixes a bug, or is a new feature.
If it's a bug fix it should have a test proving that the bug is fixed. And if
it's a new feature, then every component should be unit tested and there should
be an integration test ensuring that the feature works with the rest of the
system.
### Tests should actually test all of what the function does
```javascript
function payEmployeeSalary(employeeId, amount, callback) {
db.get('EMPLOYEES', employeeId).then(user => {
return sendMoney(user, amount);
}).then(res => {
if (callback) {
callback(res);
}
return res;
})
}
const callback = (res) => console.log('called', res);
const employee = createFakeEmployee('john jacob jingleheimer schmidt');
const result = payEmployeeSalary(employee.id, 1000, callback);
assert(result.status === enums.SUCCESS);
// What about the callback? That should be tested
```
### Tests should stress edge cases and limits of a function
```javascript
function dateAddDays(dateTime, day) {
// ...
}
let dateTime = '1/1/2017'
let date1 = dateAddDays(dateTime, 5);
assert(date1 === '1/6/2017');
// What happens if we add negative days?
// What happens if we add fractional days: 1.2, 8.7, etc.
// What happens if we add 1 billion days?
```
## Miscellaneous
> _"Everything can be filed under miscellaneous"_
> George Bernard Shaw
### TODO comments should be tracked
TODO comments are great for letting you and your fellow engineers that something
needs to be fixed later. Sometimes you gotta ship code and wait to fix it
later. But eventually you'll have to clean it up! That's why you should track it
and give a corresponding ID from your issue tracking system so you can schedule
it and keep track of where the problem is in your codebase.
### Commit messages should be clear and accurately describe new code
We've all written commit messages like "Changed some crap", "damn it",
"ugg one more to fix this stupid bug". These are funny and satisfying, but not
helpful when you're up on a Saturday morning because you pushed code on a Friday
night and can't figure out what the bad code was doing when you `git blamed` the
commit. Write commit messages that describe the code accurately, and include
a ticket number from your issue tracking system if you have one. That will make
searching through your commit log much easier.
### The code should do what it's supposed to do
This seems obvious, but most reviewers don't have the time or take the time to
manually test every user-facing change. It's important to make sure the business
logic of every change is as per design. It's easy to forget that when you're
just looking for problems in the code!