Forensics Question: | |
OS Version: | |
Tools: |
The full blog version of this article can be found at www.doubleblak.com/blog/locations
Even though iOS Location Services has been around for a number of years now, there still seems to be some confusion about how and why it does what it does and how reliable the data is. I have testified in court a few times in relation to the data obtained from Location Services but since I recently incorporated this type of data into ArtEx, I thought now was a good time to do a really in-depth study of this technology.
This is a large post, but rather than break it up into smaller pieces, I thought I'd just segment it instead. Please bear with it and click the buttons to reveal additional information.
Also please note that I am not an expert in the mathematics required to calculate location information. What I am presenting in parts of this blog is simplified data to the best of my understanding as a result of research I have done. If you see anything that is incorrect please do not hesitate in contacting me and I will make the required changes. Ultimately, this post is about Location Services, but I couldn't write it without going over some other aspects of how it works.
To make this a little easier, I've also made this into 2 PDF documents for easier printing. The download links are available at the bottom of the page.
To my knowledge, all Smart Phones are Assisted-GPS devices (A-GPS) rather than straight GPS devices like in-car Sat-Nav systems are.
"Assisted-GPS" is the term used for the collection of technologies used by devices to determine their location.
Using Satellite GPS is the most accurate method for determining location but is not always practical (as described later), therefore other technologies already built into smart phones can be employed instead. Each technique has its own pros and con’s. An overview of each technology is shown below:
Select from the options below
Cell Siting WiFi Crowd Sourcing GPS
Pros : Fast, Very Low Power
Cons : Lowest accuracy
Cell Siting estimates the location of a device by using the cell signal from the cell tower which it is currently connected to and those around it in a process called Trilateration.
The accuracy of Cell Siting is drastically affected by the density of the towers. More Towers = Better Accuracy.
A device can uses the Received Signal Strength Indicator (RSSI) to calculate it’s approximate distance from the cell tower to which it is connected. The RSSI signal should be fairly consistant (notwithstanding obstacles) and so the lower the signal the further the device is from the tower.
Cell Towers constantly send out a signal which is picked up by the device. The device evaluates all of the signals received and selects the best quality signal to connect with. This is the tower that would be used if you made a phone call or sent a text message etc.
As mentioned, RSSI can be affected by obstacles, effectively weakening the signal which is why the distance is only an approximation.
For ease of understanding, I have grouped the distances in my examples into bands. So Under 1KM is Band 1, between 1KM and 2KM is Band 2 and so on.
This the only cell tower in range, it is therefore the strongest signal. The device "selects" this tower, they handshake and communication can occur.
So when trying to identify it's location, the device knows both tower ID and RSSI. In the example above, the phone would use the RSSI to calculate that it's distance would be around 4 to 5 km which I have placed into Band 5.
Some cell towers do have Omni-Directional antennae; which means a single antenna has 360 degree coverage. However, it is far more common to have numerous antennae on a single tower and all antenna focused in different directions. These are configured to best suit the location. We'll cover that in more detail later. For now, let's keep things simple with 3 antennae that each cover 33% of the radius.
This diagram shows an Cell Tower with 3 antenna, each servicing approximately 120 degrees (Realistically, there's likely to be some overlap). This now means that we know tower, antenan and band. We can work out that the device must be in the dark grey area indicated.
In remote, sparsely populated cell tower locations, this may be as good as you get. But in locations with multiple cell towers in relatively close proximity, the device can use the tower/antenna/band information from the towers to pinpoint it's location more accurately. The secondary towers do not "handshake" with the device, it is just passively emitting a signal which is picked up and measured.
This diagram shows how by using the same information (Tower/Antenna/Band) from 2 towers, the location can be more accurately identifed. The device has to be within the grey area because this is the only area where the cell tower location overlaps.
Adding a third or forth tower into the mix increases the accuracy even more.
Finally, the location can be pinpointed fairly accurately by using more towers and looking where the values overlap.
The screenshot below (from https://www.ertyu.org/steven_nikkel/cancellsites.html) shows the cell tower locations in and around Calgary, Alberta. If you apply the above logic to this map, it's easy to see why the accuracy of Cell Site information can vary so much. Devices in locations with sparse towers simply do not have the option of using multiple towers to determine their location.
As mentioned above, each tower can have numerous antenna, all configured to point different directions and cover different amounts of area
The configuration of the antenna is defined by AZIMUTH and BEAM WIDTH.
The AZIMUTH is simply the degrees clockwise from NORTH and The BEAM WIDTH is the degrees clockwise away from the AZIMUTH.
This image shows 5 antennae;
Colour |
|
|
---|---|---|
Green |
|
|
Blue |
|
|
Yellow |
|
|
Pink |
|
|
Cyan |
|
|
You can see that the area in Green starts at 0deg and travels 30deg. So the Azimuth is 0 and the Beam Width is 30.
The antenna configuration is determined by load; On the example above, the Cyan antenna is covering the same amount of degrees as the Green and Blue antennaes combined. This means that there are two antennae serving 120 degrees instead of one and would be used in more densley populated areas where you would expect the need for connectively to be higher.
Geography can play a massive role in cell tower reception; with mountains, buildings, bodies of water etc. all causing obstruction and reflection of the signal. This is great when it comes to getting a better signal to make calls, but is not so good when trying to work out the device location. Signal reflection means that the device may actually be connecting to an antenna not pointed in that direction at all.
Other factors that can affect cell tower location is number of towers (as discussed above), network traffic or maintenance.
It is important to note that Cell Tower locating is NOT as accurate as many people would like to believe it is. Being connected to a particular tower does not necessarily mean that it is the closet tower.
It's also worth noting that cell towers have a maximum theoretical range of 31km (~19 miles) but a more realistic range would be less than 15 km ~(10 miles).
Cell Tower locations are usually available (at a cost) from the cell phone providers. This data will likely include the Identifier, the Physical Address of the tower, the Azimuth and Beam width and Range etc. This information is good to know but due to the reliability issues described above, should never be used as conclusive evidence of location.
Another source of cell tower data is www.opencellid.org; an open source database of average locations. Users basically submit their actual location information to the OpenCellID servers along with information about which tower they are connected to at the time. Then, when a user requests to know where a specific tower is, the user submitted locations are averaged.
I actually lean to preferring the OpenCellID data compared to the actual cell tower location. I like it because it's real world data. You don't need to consider obstruction and reflection etc. as the location presented has already been subjected to real obstruction and reflection. Courts however seem to prefer actual Cell Tower addresses. Presenting both, and acknowledging that neither is perfect is in my opinion, the best way to go however.
Cell Towers are typically identified by several pieces of information (I say typically because there are sometimes differences in notation and probably different names/acroynms in different countries too)
Mobile Country Code (MCC) : Quite simply the country that the cell tower is in.
Mobile Network Code (MNC) : The owner/operator of the cell tower.
Cell ID (CID) : The Cell Tower identifier number.
To confuse matters a little more, the tower could be indentified as LONG or SHORT notation. A LONG CID is a combination of an RNC (Radio Network Controller) and the CID.
If you have a SHORT CID that you need converting to LONG:
LONG ID = 65536 * RNC + CID
If you have a LONG CID that you need converting to SHORT :
RNC = Long CID / 65536 (integer division) CID = Long CID mod 65536 (modulo operation)
Source : https://en.wikipedia.org/wiki/Mobile_country_code http://wiki.opencellid.org/wiki/FAQ
Pros : More accurate than Cell Siting / Relatively low power Cons: Not as accurate as GPS / Won't work in remote areas
WiFi Crowd Sourcing works in a similar way to Cell Siting except with WiFi networks.
Devices are constantly scanning for WiFi networks in an attempt to find a network it recognizes (ie one that you have connected to previously) and connect to it for you. Since this scan is occurring anyway, it makes sense to make additional use of this activity.
A WiFi network has an approximate radius of around 70m outdoors but only 45m indoors (due to obstructions such as walls). Obviously, as the device is further away, the signal strength becomes weaker. As with Cell Tower connections, the measurement of the signal strength is known as RSSI (received Signal Strength Indicator). The RSSI value can be used to approximate how far away the receiver is from the emitter. The stronger the signal, the closer the reciever is to the emitter.
The diagram below illustrates a WiFi signal in Red and the device in Blue.
The fact that the device is able to see the network at all means that it must be close; within about 65m (as an approximate once you consider the signal interferences). This would still actually give a fairly wide area, encompassing at least the 4 houses visible within the red circle.
If RSSI is calculated, then the devices approximate distance from the source can also be calculated. In our example above, this would mean that the phone would have to be at roughly the distance it is shown, although it could still be anywhere on the circumference.
Another flaw to this would be that the exact location of every router would need to be known by the phone. If a homeowner moved to a new house or even just moved the router to a new location in the current house, the location information would be affected.
While still based on the similar idea explained above, what actually happens is that a device uses a number of the WiFi networks that it can see, all at the same time. These are not networks that the user has necessarily ever connected. Think about when you are first setting up your new device and it presents you with all of the network names it can see for you to choose from (invariably including the ever funny droll "FBI SURVEILLANCE VAN")...
This is the same street as above but with the locations and theoretical radii of all WiFi networks shown.
If we place the device over on the right, you can see that the blue radius only intersects with 4 of the networks.
These 4 networks create a unique "fingerprint" that is not going to be found anywhere else in the world.
That unique fingerprint allows the device to work out its locations with decent accuracy.
On this example: On this example:
If the device moved down, it would lose If the device moved left it would lose sight
sight of network 12, but gain sight of the of network 12, but gain sight of the network 27. network 3.
Moving up would lose network 21 but gain Moving right would lose network 8 but network 7. gain network13.
But how does the unique fingerprint get resolved to a physical location? Wardriving basically.
Someone literally drove around recording their own GPS coordinates and the names/signal strengths of all visible WiFi networks at that time. This could have been done by Google or Apple while creating map data, by third party companies such as Skyhook or Wigle or by device users themselves who agree to send that kind of data back to the phone manufacturer / app developers in the form of diagnostics data. Note this paragraph on Apple's website:
Remember this paragraph for later on in this post too!
However the data is obtained, it results in a huge database of WiFi networks and their locations; a smaller subset of which is stored locally on each device.
And it makes sense. If the device had to go online everytime the location was needed, battery life would go down and data usage would go up. By having a subset of local networks saved on the device, the location can quickly be determined without the battery/data usage overheads. Many of the tests I have done have been with a non-internet connected phone and the locations have been resolved just fine.
Services such as Wigle.net map out the WiFi networks in a searchable way. This also demonstrates the massive difference between highly populated and lesser populated areas.
Bluetooth beacons can also be used in a similar way and are typically found in stores. For example, when you get near an Apple Store, you phone may pop up with a little "Hey! Your near an Apple Store" notification. The beacon location can also be used to track your location which is referenced when you turn off Bluetooth with the message "Airdrop, Airplay, FindMy and Location Services use Bluetooth".
Source : https://en.wikipedia.org/wiki/Wi-Fi_positioning_system
Source : https://support.apple.com/en-us/HT203033
Source : https://en.wikipedia.org/wiki/Bluetooth_low_energy_beacon
Pros : Most accurate
Cons : Most power consumption / slowest / heavily affected by interference
Finally, GPS can be used to locate the device. GPS is essentially a network of over 30 satellites orbiting the earth each of which contain a synchronized atomic clock and emit a signal which includes (but is not limited to) the time of the signal, an identifier for the satellite and the satellites current location.
The signal travels through radio waves to the earth at 299,792,458 metres/second (Speed of Light) but is slowed down by factors such as atmosphere and geology. As the satellites orbit at 20,200km above the earth, that means the signal takes somewhere around 0.088728049323 seconds to reach the centre of the earth and less to reach the surface.
Notice that the receiver has worked out an approximate sphere, not radius. This is because satellites are trying to track in 3 dimensions, not 2 like Cell Siting and WiFi Crowd Sourcing.
The general principle is the same though as once a minimum of 3 satellite signals are received /calculated, the device has 3 intersecting spheres to identify its latitude and longitude.
A single satellite is not very useful
A secondary satellite helps to A third satellite is required to find
as it would cover a vast distance narrow down the area. the users latitude/longitude.
(Anywhere within the red circle)
The timestamp that was sent by each satellite can be used to calculate even more precise location information by comparing the infinitesimally small differences in the time the signals were received .
One huge thing that GPS can do that both Cell Siting and WiFi Sourcing cannot do, is give the devices altitude.
Cell Siting and WiFi Sourcing both assume you are "ground" level, so using the method of trilateration explained above is fine. (WiFi 'fakes' the altitude by having a stored list of known altitudes for "ground" level in a given location it is not the devices actual altitude).
GPS however is able to work out the devices Altitude the same way as it works out the Lat/Long. This is the benefit of working with Spheres not Circles. Although it should be noted that: a) A forth satellite is required to calculate Altitude
b) Altitude is not 100% accurate. This is mainly due to the "geoid" shape used to represent the earth not being a perfect match to the earth itself.
Source : https://en.wikipedia.org/wiki/Geoid
Source : https://gulpmatrix.com/how-gps-satellite-navigation-works/
Source
Before you continue, I don't want you to miss out on the rest of the information in the Cell Siting, WiFi Crowd Sourcing and GPS sections. This is a reminder that you may want to scroll back to the top and move to the next section.
iOS comes with an API known as 'Location Services' for easily requesting the device location from the device.
Developers can specify how accurate they need the location information to be. This means that applications that just need a general area can ask for low accuracy data (a city or area for example) but applications that require more accuracy such as navigation services can request the higher accuracy; striking a balance between speed and accuracy as the app dictates.
Prior to iOS8, a "Significant" change in location meant when a device changes from one cell area to another and for the most part that was fine. But when iOS8 came out, Location Services had been overhauled to include Geo-Fencing services. It's what allows location based reminders such as "Hey Siri, Remind me to do X when I leave work".
Using Geo-Fences meant that it needed to be much more accurate that cell tower locations would allow and so WiFi crowd sourcing became a norm. It makes sense, because as mentioned above, WiFi networks are already being scanned anyway to find a network to connect to. Monitoring it for significant location changes is a small amount of extra programming. Basically, the device keeps track of your location and once you have moved a significant distance, it checks to see if there is a Geo-Fence setup for either the area you have entered or the area you have left and responds appropriately.
The iOS device has a large quantity of locations and WiFi networks stored locally in Cache.sqlite and threebars.sqlite (and possibly elsewhere too) so it is able to resolve locations from WiFi networks without even needing an internet connection. This list of networks appears to refresh every so often in order to keep up-to-date.
Now that we've covered the basics of how devices obtain their location, I'd like to take a closer look at when devices get their location data.
I'm sure that most people are familiar with "Frequent Locations" but for the sake of completeness I will describe it again.
The idea is that every time a device visits a location a record is kept that the device was there from time A to time B. When the device returns to that same location again, a second record is created and therefore the tally of "total visits" to that location increases.
Eventually, when the device has returned to the same location enough times, it is considered a
"Frequent Location". If you have an iPhone, you can check yourself what your Frequent Locations are by going to Settings > Privacy > Location Services > System Services > Significant Locations (Note the additional security requirement at this stage; this is why this data only appears on a Full FileSystem extraction).
The actual algorithms that Apple use to determine what is and what is not a frequent location, or even when to record a location visit in the cache are not documented anywhere. I have done exhaustive testing to try to help answer these questions..
I have an iPhone 6 running iOS12.4.1 which I use for testing. It doesn't have a SIM card and only has a WiFi connection when I'm at home. I placed this phone in the centre console of my car as I drove around the city. The phone was never even touched for the entirety of the journeys. It's also worth noting that this was the first time I'd taken this device to any of these locations.
I extracted the phone several times a day to monitor the changes and plotted out some of my journeys using ArtEx..
Over the course of several days (16th June to 23rd June), with minimal usage of the device, it still cached over 7,000 location records in com.apple.routined/Cache.sqlite database..
There was generally a noteable difference on the daily plots depending on how much traveling had been done.
Date |
|
|
|
---|---|---|---|
16th June |
|
|
|
17th June |
|
|
|
18th June |
|
|
|
19th June |
|
|
|
20th June |
|
|
|
21st June |
|
|
|
22nd June |
|
|
|
Notes:
Although the drive around the city on the 20th was approximately the same duration as the drive on the 21st, there was more time spent parked.
Although the 22nd was spent at home, the phone was being constantly moved around between rooms.
So as you can see, with virtually no usage of the device, it was still recording my location on average 42 times every hour, with increased frequency when in motion.
As for what causes the location to be recorded? I'm still not entirely sure. But remember that paragraph from Apples website from earlier that I told you to remember? In particular the bit that says:
" If you're traveling (for example, in a car) and Location Services is on, a GPS-enabled iOS device will also periodically send GPS locations, travel speed, and barometric pressure information to Apple to be used for building up Apple's crowd-sourced road-traffic and indoor pressure databases."
At random(?) times, iOS will just get a GPS fix and send that data to Apple. Awesome as that should mean even better location information that with WiFi Crowd Sourcing.
The time difference between records is sporadic at best; with gaps being anywhere between a single second and 20 minutes or more.
I am of the opinion that the sporadic nature of the data is a result of several things;
Being in a location with no/weak WiFi networks - As the location is determined by scanning for local WiFi networks, it stands to reason that no networks means no location data.
Unrecognized WiFi networks - maybe the networks that are being seen by the device aren't in the devices location database?
As stated, it's just periodic in order to update the Apple servers.
**Update : A complete oversight on my part is that as pointed out above, by test device does not have a SIM card installed. I decided to take a look at my real device and found that some of the cached locations were a 1000m+. This is clearly too large for a WiFi network so left me thinking that either the source WiFi data is inaccurate (unlikely as I would have probably observed the same low accuracy reading on my test phone) or that Cell Tower locations can be used too. I'm guess that switching between Cell Towers causes a location to be recorded but this is a little difficult to test. It's certainly good to bear in mind though and can account for some way-out accuracies.
Accuracy
The cache.sqlite database includes "Horizontal Accuracy" as a field which is a value in metres. Basically, it is an indicator of how accurate the phone believes the GPS co-ordinates to be.
ArtEx uses the Latitude and Longitude to plot the location on a map and draws an approximate radius around it using the "HorizontalAccuracy" information. The maps created by ArtEx look like this:
For anyone who hasn't used ArtEx, I should mention that it only draws maps when the Horizontal Accuracy is under 100m. This is simply to save on processing/memory and to make it easier to home in on the most accurate location data.
After plotting the map information for my journeys, I manually went though all of the created maps and verified their accuracy, including verifying the accuracy of the locations that were not plotted to maps.
Click on the links below to see a breakdown of the journeys in this case study
Journey 1 Journey 2
So now we have an idea of how often locations are saved in the Cache database and how accurate they are. But what flips the status from simply a location that was visited to one that is designated a "Frequent Location"?
Let's start by taking a look at the tables found in Local.sqlite and Cloud.sqlite (Both very similar content but as the names suggest, one is local on the device and one is designed to be sent to the cloud).
The three tables I'm interested in right now are:
ZRTLEARNEDLOCATIONOFINTERESTMO ZRTLEARNEDLOCATIONOFINTERESTTRANSITIONMO
Firstly, ZRTLEARNEDLOCATIONOFINTERESTMO is a list of the device's Frequent Locations. Of note, it includes ZLOCATIONLATITUDE, ZLOCATIONLONGITUDE, ZDATAPOINTCOUNT, ZPLACECREATIONDATE and ZPLACEEXPIRATIONDATE but it does not include any information about when they were actually visited.
I noticed that of the 6 locations, all had the same expiry date and the creation dates were not unique either.
You can see that records 1 and 2 share a Creation Date, as do records 4, 5 and 6 (well, within a few seconds anyway).
This means that the Frequent Location record in this table cannot possibly be made at the time a location is revisited and thus must happen afterwards, during a single process that works out all visits and creates the appropriate frequent location records in this table.
I typically found that this process ran every 2 to 4 days but don't quote me on that. There really wasn't any discernable pattern. I expected maybe every X days or every X records but neither seemed to be right.
The table ZRTLEARNEDLOCATIONOFINTERESTTRANSITIONMO relates to the movement between locations; You may have noticed on your own phone when reading Frequent Locations it often says something to the effect of "Arrived via a 10 min. drive". This table is how it knows that you drove and how long it took.
The ZLOCATIONOFINTEREST field relates to the Z_PK on the ZRTLEARNEDLOCATIONOFINTERESTMO table.
Again we have Creation and Expiry Dates, but now we also have ZSTARTDATE and ZSTOPDATE which relate to the journey to the frequent location. This table also has a
field ZPREDOMINANTMOTIONACTIVITYTYPE which is the mode of travel. 4 appears to be car whereas 1 appears to be walking.
ZRTLEARNEDLOCATIONOFINTERESTMO is a log of the time actually spent at the defined Frequent Locations.
Again you can see that a lot of records share the same creation date, suggesting that a block process creates these records at the same time.
You can see that there are:
3 records at 2020/06/10 at 07:38:10 (Green)
3 records at 2020/06/10 at 11:22:41 (Yellow)
2 records at 2020/06/17 at 06:59:56 (Pink)
2 records at 2020/06/18 at 07:45:05 (Orange) 6 records at 2020/06/22 at 18:36:31 (Red)
You will also notice that the creation dates are not in date order (It's ordered by the Z_PK field) and the CreationDates are also incredibly similar to the PlaceCreationDates of the ZRTLEARNEDLOCATIONOFINTERESTMO table.
So we can assume that there is a process that runs every few days that determines which locations have been visited enough to upgrade them to "Frequent Locations". The presentation below goes into a little more detail about how this probably works.
Interestingly, in the Cloud.sqlite database is a table called ZRTADDRESSMO which is the resolved address for each of the Frequent Location records. This includes Locality, City, Road, Area of Interest etc.. But this table is not found in Local.sqlite.
Note for testing : The records are created once a condition has ended. For example, the Frequent Location Visit record is only recorded in the database once I leave the location. Don't do test extractions expecting to see when you arrived at home/office if you are still there. Likewise if you were imaging a device at the scene, you will not get the time the device arrived there.
We should also define what a Frequent Location is in terms of area.
Obviously, to be useful in knowing when I'm at "Home" or "Work" or wherever, it's no good to have a specific Lat/Long coordinate. For example, I want my phone to know I'm at Home regardless if I'm pulling onto my driveway or at the far reaches of my backyard. For my Work address, the area may be even larger. So there must be some tolerance.
My testing has shown that once i am around 80m from my home address, my phone registers that I've now left that Frequent Location. I tested this by slowly moving away from the house and stopping frequently for a set amount of time. Once I extracted the phone, I knew how far away from my house I was at the time the "Exit" event occurred.
As this has been consistent at around 80m, I don't believe this is the result of a coincidence that my phone decided to create a record at that time and I instead believe that this is directly related to my phone leaving the WiFi range of my home network triggering a location check.
Of course, 80m is a fairly massive area...
The good thing about this is that just being close to a frequent location is probably enough to keep the record of it (instead of losing it as a purged cache record). The downside is that if you need highly accurate data, it may not be good enough.
**Update : According to further research, it seems that 250m is the radius used for Frequent Locations.
Another location artifact of note is the vehicle event locations stored
in ~/private/var/mobile/Library/Caches/com.apple.routined/local.sqlite. This appears to be somewhat related to the bluetooth connection information found in KnowledgeC.
The idea is that when I'm in my car, my phone is connected via Bluetooth. Once I stop driving and turn the car off, the bluetooth connection ends and a vehicle event of "Parked Car" is created.
In practice, I've found it to be cleverer than it sounds.
I conducted several tests with different variables:
While in motion, Bluetooth turned off on the phone: No vehicle event was recorded.
While in motion, Bluetooth turned off on the car stereo: No vehicle event was recorded.
Drive to location A, turn engine off and leave the phone static in the car for ~25 minutes: No vehicle event was recorded.
Drive to location A, turn engine off and walk away from the car with the phone: vehicle event created before I was even 100m away from the car.
I have tested these scenarios multiple times, with a few variables such as with/without network connectivity etc. and it appears pretty consistent that is the combination of Bluetooth Disconnection followed by walking that causes the phone to realise you have gotten out of the car and therefore create the Parked Car event.
This was important for a case I was working where I was able to correlate the bluetooth connection/disconnection events of a device to frequent location events.
The thought process was that when the bluetooth connects, it was because the engine started and a journey (ie. a Exit Frequent Location in this case) was anticipated.
Entering a Frequent Location (ie arriving home) would precede a Bluetooth Disconnection as the user arrived home and exited the car.
The issue was that there were bluetooth disconnection and reconnection event with no associated Frequent Location information. This suggested the user parked the car briefly but there was absolutely no Vehicle Park events for the day in question.
My determination was that the phone never left the car, hence a park event was never recorded. The time of the bluetooth disconnection/connection was extremely relevant and showed that *something* occurred at that time. This hypothesis was later supported by additional evidence found soon after..
** Update : As pointed out by Josh Hickman (@josh_hickman1 / thebinaryhick.blog), CarPlay also appears to create Vehicle Events. Alas, my car does not have CarPlay so I am unable to test. And the last time I drove a CarPlay vehicle was well over the 2 months that the vehicle events appear to be stored for.
**Update 2 : Further testing shows my hypothesis may be incorrect. On a single trip, I parked 3 times without getting out of the car. But only 2 park events were recorded.
Finally, it would be remiss of me to write an article about iOS Cell Siting and Location Services without mentioning netusage.db.
netusage.db can be found in ~/private/var/networkd/netusage.db and contains a table called ZNETWORKATTACHMENT.
This table keeps track of all cell towers connected to; but it only keeps the FIRST and LAST DATE the connection is made.
For example; My phone connects to tower A for the first time. A record is created with the Tower ID and today's date as both the FIRST and LAST connected dates.
When I connect to that tower again the day after, the existing record is updated with a new LAST connected date.
It's not great, but I used it (in absence of much other location data) to prove that a device had left the city and headed west, contradicting the suspect's story. Without going into too much detail, the device had numerous cell tower connection records that all showed a FIRST connection date of the date of the offence and I used their order of creation in the database to work out a rough direction of travel.
Of course, this was only really helpful because the distances involved were way too much to be the result of network handoff. The truly beautiful thing was that the cell tower connections pretty much matched the connections of my victim's device; for whom we also had Cached Location data which confirmed the location data obtained from the cell tower connections.
Reviewers found the content of the paper to be informative and detailed. They identified some grammatical issues and found that there were links/references to items that were not included with the paper. They recommended that those items be included or removed.
Reviewers indicated that understanding how location data has come to be on a device and the correct interpretation of that data is critically important. Location data being generated by the device indicating it was in a particular location versus data that has been sent to that device in the form of media images or parsed from social media, website visits, or email can be pivotal.
Some of the reviewers conducted their own research and were able to validate some of the author’s findings.
Future work could study the relation between the accuracy reported by the device and the errors effectively observed in real world situations. Future work could also include updating/verifying that the database structure is still the same in more recent iOS versions.
It would be helpful to conduct more research on the effect of having multiple devices visiting frequent locations at different times as well as the significance of the ZRTSETTLEDSTATETRANSITION table and potential factors affecting the radius associated with a specific frequent location.
Additionally, testing regarding trips with origins that are frequent locations but destinations that are not, the timing of writing vehicle park/frequent location events to the database, and dependence of location logging on an initial Wi-Fi connection would be helpful.
Jessica Hyde (Methodology Review, Validated Review Using Reviewer Generated Datasets)
Arica Kulm (Methodology Review, Validated Review Using Reviewer Generated Datasets)
Troy Pugliese (Methodology Review)
Linda Shou (Methodology Review, Validated Review Using Reviewer Generated Datasets)
Hannes Spichiger (Methodology Review)