added solution description to README

1 year ago · 8770f88df6
parent 7fe086cd38
commit 8770f88df6
1 changed files with 202 additions and 52 deletions
--- a/README.md
+++ b/README.md
@ -80,15 +80,204 @@ TN12345682,GLS,"Street 5, 70173 Stuttgart, Germany","Street 15, 1050 Copenhagen,

 # Solution

-## Performance
+I spent around 6 hours implementing this.
+
+## Domain model
+
+The original assignment leaves some questions unanswered, I tried to answer them myself:
+
+* **Are tracking numbers unique across the entire system, or only within a single carrier?**
+
+    The sample data makes it seem as if these were not carriers' tracking numbers but parcellab's own tracking numbers
+    (serving as an abstraction over actual carriers).
+
+    On the other hand, the assignment says that customers should be able to look shipments up by tracking number and carrier.
+
+    In the end, I decided to use carrier + tracking number as an identifier (meaning that carrier is required in the request,
+    and that requests with the wrong carrier will result in 404).
+
+* **For what point exactly should I retrieve the weather?**
+
+    The assignment mentions ZIP code in passing in an unrelated section.
+    But ZIP codes are not a great way to determine weather at the receiver area, because they can be very large
+    (e.g. 0872 in Australia is over 1500 kilometers across, with the area larger than the entire Germany and France combined).
+
+    On the other hand, addresses in the seed data are all fake ("Street 1" etc), and one cannot get weather for them.
+
+    It is not clear why would anybody even need this data, or how are they going to use it.
+    (Using location of a pickup point would probably make more sense than using the address of a recipient, but we don't have this data.)
+
+    In the end, I decided to use addresses, and replace some of the fake addresses in the seed data with real ones.
+
+    Which brings me to the next question...
+
+* **What is the supposed scenario in which caching weather data will be beneficial?**
+
+    Do we assume that the application should return data for a lot of different packages to different addresses,
+    with only infrequent requests (e.g. once a day) for the same package?
+    Then caching weather data by location will not be of any benefit, instead only increazing our cache size
+    (because the requests will never hit the cache, because every request is for the different location).
+    Or, alternatively, we could use coarser coordinates for caching, e.g. rounding them to 0.1 degree (roughly 11km or less),
+    so that we could still reuse the weather data even for different (but close enough) addresses.
+    This can be done in `src/packages.service.ts`.
+
+    Or do we assume that the application is going to receive a lot of requests for the small number of packages?
+
+    For simplicity, in this implementation I went with the latter assumption.
+
+* **How to get weather, knowing only the recipient address (including the zip code)?**
+
+    Most, if not all, weather APIs accept location as a parameter, not an address.
+    So in addition to integrating with a weather API, I also had to integrate with a map (geocoder) API
+    in order to resolve addresses to locations (latitude + longitude).
+
+* **What do tracking numbers even mean? Why are they not unique?**
+
+    I've only noticed this in the last moment: in the seed data, there are multiple rows
+    with the same carrier + tracking number, same addresses, same statuses, but different products.
+    I guess that this is because it is supposed to come from some kind of join SQL query, returning one row per product.
+
+    I don't think this is a good was to get the data for such an API endpoint, but that's what the source data is.
+
+    If only I noticed it earlier, I would write my code accordingly.
+    But I only noticed it in the very end.
+    So I just did a quick workaround to demonstrate that everything works, and changed tracking numbers to be unique.
+
+* **What weather data should we use?**
+
+    It is not clear why would anybody even need this data, or how are they going to use it.
+
+    I decided to use the current weather data instead of forecasts.
+    Since we're refetching the weather data if it's more than 2 hours old,
+    the "current weather" data will never be more than 2 hours out of date, and will still stay somewhat relevant.
+
+* **What data to return? What if location or weather APIs are unavailable or return an error?**
+
+    I decided to return package data always, and weather data only if both location and weather APIs resolved successfully.
+
+## Decisions made, implementation details
+
+* **Which framework to use**
+
+    Nest.js provides a good starting point, and I already created services with Nest.js in the past,
+    so I decided to use it here as well, to save time on learning the new boilerplate.
+
+* **Dependency injection**
+
+    All integrations and storages etc have proper generalized interfaces;
+    it is easy enough to switch to another weather / location provider or to another database,
+    simply by implementing another integration and switching to it as a drop-in replacement
+    in `src/app.module.ts`.
+
+    Additionally, since Nest.js unfortunately does not support typechecking for dependencies yet,
+    I implemented additional types as a drop-in replacement for some of the Nest.js ones;
+    they're quite limited but they have all the features I used from Nest.js,
+    and they do support typechecking for dependencies.
+    (`src/dependencies.ts`).
+
+* **Which APIs to use**
+
+    I decided to use Openmeteo for weather, and OSM Nominatim for location resolving,
+    because they are free / open-source, don't require any sign ups / API tokens,
+    and are easy enough to use.
+
+    However, they have strict request limits for the free tier, so in my integration
+    I limited interaction with them to one request per second.
+
+* **Resolving addresses to locations**
+
+    One problem with OSM Nominatim is that it returns all objects that match the query.
+    And very often, there are several different objects matching the same address
+    (e.g. the building itself, plus all the organizations in it, or other buildings with address supplements).
+
+    So I take the mean of all the locations returned by OSM Nominatim, and check if all returned locations lie
+    within 0.01 degree (1.1km or less) from the mean.
+
+    If they do, this means that all the results are close enough together to probably actually refer
+    to more or less the same location, so I return the mean.
+    If they don't, this means that results probably refer to different locations, and we cannot determine
+    which of these locations is the one we're supposed to return (for example, imagine the address:
+    "Hauptbahnhof, Germany"), so I throw an error.
+
+* **Caching**
+
+    The assignment says that the weather data should not be fetched more frequently than every two hours for the same location.
+    (It also says "zip code", but then again, 0872 Australia.)
+    So I'm caching the weather data for two hours (TTL in `src/clients/weather.ts`).
+
+    But I'm also caching the location data for a day, because it's unlikely to change very often
+    (it certainly doesn't change as often as the weather),
+    and because I don't want to send too many requests to OSM.
+
+* **Database implementation**
+
+    For simplicity, and to make this solution self-contained, I decided to use in-memory database
+    for the packages, with simulated 50ms latency (to make it feel more like a remote database).
+
+    If needed, it can be replaced by any other database, by creating a new implementation of `PackagesRepository`.
+
+* **Cache implementation**
+    The same in-memory database (just a JS `Map`) is used, with simulated 20ms latency.
+
+    If needed, it can be replaced by any other caching solution,
+    by creating a new implementation of `ClearableKeyValueStorage`.
+
+    (Also I should have created an intermediate caching layer handling the TTLs / expirations,
+    but I didn't have time for that, so this logic _and_ the logic of loading stuff and storing it in cache
+    both currently live in `storage/cache.ts`.)
+
+* **Throttling**
+
+    One problem with complicated things like caching is that the naive implementations are not really concurrent-safe,
+    with the code assuming that the related cache state will not change while we're working with that cache.
+
+    Of course it doesn't really work like that, but as long as there is only one copy of our application running,
+    and no other applications work with the same data, we can simulate this easily enough by making sure
+    that concurrency-unsafe code is only executed once at a time for every set of data it operates on.
+
+    Since all that code in this application is supposed to be more or less idempotent, there is no need to wait
+    for it to resolve before calling it again; if some data is requested again for the same key, we can simply
+    return the previous (not-yet-resolved) promise.
+
+    This is implemented in `src/utils/throttle.ts`.
+
+    Basically this means that e.g. if the location for the same address is requested twice at the same time,
+    the underlying function (checking the cache, querying the API and storing data to the cache if there is a cache miss)
+    will only be called once, and both callers will get the same promise.
+
+## How to use the app
+
+To lint: `npm run lint`.
+
+To test: `npm run test` and `npm run test:e2e`.
+
+To start: `npm run start`.
+
+This is a RESTful API. To get package info, send GET request to e.g. `/packages/UPS/TN12345679`.
+
+## Discussion points
+
+> What were the important design choices and trade-offs you made?
+
+See above ("Domain model", "Decisions made").
+
+> What would be required to deploy this application to production?
+
+First of all, identifying what problem are we solving would be required.
+Because this application does not seem like it's actually solving some problem.
+Why would anybody need such an endpoint? How are they going to use it?
+Without answering these questions, there is no point in deploying this application anywhere,
+and there cannot be any clear understanding of non-functional requirements.
+
+But also we'd need to use some production-ready reliable (and probably paid) APIs for geocoding and weather,
+and some actual database (or API) for retrieving package data (instead of in-memory key-value record),
+and some actual caching solution (e.g. redis), which will also mean rethinking how `throttle` is used here.

 > What would be required to scale this application to handle 1000 requests per second?

 This application already handles over 3000 requests per second with ease,
 even with an artificial 20ms cache access latency (and artificial 50ms DB access latency).

-Granted this is all for the same URL, and it can be slower if we will be requesting different URLs.
-
 ```
 ❯ ab -c 500 -n 100000 http://127.0.0.1:3000/packages/UPS/TN12345679
 This is ApacheBench, Version 2.3 <$Revision: 1903618 $>
@ -146,54 +335,15 @@ Percentage of the requests served within a certain time (ms)
 100%    259 (longest request)
 ```

-# Nest project readme
-
-## Description
-
-[Nest](https://github.com/nestjs/nest) framework TypeScript starter repository.
-
-## Installation
-
-```bash
-$ npm install
-```
-
-## Running the app
-
-```bash
-# development
-$ npm run start
-
-# watch mode
-$ npm run start:dev
-
-# production mode
-$ npm run start:prod
-```
-
-## Test
-
-```bash
-# unit tests
-$ npm run test
-
-# e2e tests
-$ npm run test:e2e
-
-# test coverage
-$ npm run test:cov
-```
-
-## Support
-
-Nest is an MIT-licensed open source project. It can grow thanks to the sponsors and support by the amazing backers. If you'd like to join them, please [read more here](https://docs.nestjs.com/support).
-
-## Stay in touch
-
- Author - [Kamil Myśliwiec](https://kamilmysliwiec.com)
- Website - [https://nestjs.com](https://nestjs.com/)
- Twitter - [@nestframework](https://twitter.com/nestframework)
+Granted this is all for the same URL, and it can be slower if we will be requesting different URLs.
+But also the only reasons why it can be slower (in terms of rps) for different URLs is:

-## License
+* Sending more requests to remote APIs
+    (will only affect local resources because more bandwidth is used,
+    and more requests are created and responses parsed, and more sockets are open at the same time, etc);
+* Using larger caches
+    (performance of JS `Map` naturally depends on how many entries are there within a given map,
+    and also there are memory constraints).

-Nest is [MIT licensed](LICENSE).
+But, depending on the usage profile (how many different packages are requested how often?),
+1000 requests per second should be doable.