Virtual File System
The virtual file system (VFS) is a component of the Consulo that encapsulates most of its activity for working with files represented as Virtual File.
It serves the following main purposes:
- Providing a universal API for working with files regardless of their actual location (on disk, in an archive, on an HTTP server, etc.)
- Tracking file modifications and providing both old and new versions of the file content when a change is detected.
- Providing a possibility to associate additional persistent data with a file in the VFS.
To provide the last two features, the VFS manages a persistent snapshot of some of the user's hard disk contents. The snapshot stores only those files which have been requested at least once through the VFS API, and is asynchronously updated to match the changes happening on the disk.
The snapshot is application level, not project level - so, if some file (for example, a class in the JDK) is referenced by multiple projects, only one copy of its contents will be stored in the VFS.
All VFS access operations go through the snapshot.
If some information is requested through the VFS APIs and is not available in the snapshot, it is loaded from disk and stored into the snapshot. If the information is available in the snapshot, the snapshot data is returned. The contents of files and the lists of files in directories are stored in the snapshot only if that specific information was accessed. Otherwise, only file metadata like name, length, timestamp, attributes are stored.
NOTE This means that the state of the file system and the file contents displayed in the Consulo UI comes from the snapshot, which may not always match the disk's actual contents. For example, in some cases, deleted files can still be visible in the UI for some time before the deletion is picked up by the Consulo.
The snapshot is updated from disk during refresh operations, which generally happen asynchronously. All write operations made through the VFS are synchronous - i.e., the contents are saved to disk immediately.
A refresh operation synchronizes the state of a part of the VFS with the actual disk contents. Refresh operations are explicitly invoked by the Consulo or plugin code - i.e., when a file is changed on disk while the IDE is running, the change will not be immediately picked up by the VFS. The VFS will be updated during the next refresh operation, which includes the file in its scope.
Consulo refreshes the entire project contents asynchronously on startup. By default, it performs a refresh operation when the user switches to it from another app. Still, users can turn this off via Settings | Appearance & Behavior | System Settings | Synchronize external changes [...].
On Windows, Mac, and Linux, a native file watcher process is started that receives file change notifications from the file system and reports them to the Consulo. If a file watcher is available, a refresh operation looks only at the files that have been reported as changed by the file watcher. If no file watcher is present, a refresh operation walks through all directories and files in the refresh scope.
Refresh operations are based on file timestamps. If a file's contents were changed, but its timestamp remained the same, the Consulo will not pick up the updated contents.
There is currently no facility for removing files from the snapshot. If a file was loaded there once, it remains there forever unless it was deleted from the disk, and a refresh operation was called on one of its parent directories.
The VFS itself does not honor ignored files listed in Settings | Editor | File Types and folders to ignore and excluded folders listed in Project Structure | Modules | Sources | Excluded. If the application code accesses them, the VFS will load and return their contents. In most cases, the ignored files and excluded folders must be skipped from processing by higher-level code.
During the lifetime of a running instance of an Consulo IDE, multiple VirtualFile
instances may correspond to the same disk file.
They are equal, have the same hashCode
, and share the user data.
Synchronous and Asynchronous Refreshes
From the point of view of the caller, refresh operations can be either synchronous or asynchronous. In fact, the refresh operations are executed according to their own threading policy. The synchronous flag simply means that the calling thread will be blocked until the refresh operation (which will most likely run on a different thread) is completed.
Both synchronous and asynchronous refreshes can be initiated from any thread. If a refresh is initiated from a background thread, the calling thread must not hold a read action, because otherwise, a deadlock would occur. See Consulo Architectural Overview for more details on the threading model and read/write actions.
The same threading requirements also apply to functions like LocalFileSystem.refreshAndFindFileByPath()
, which perform a partial refresh if the file with the specified path is not found in the snapshot.
In nearly all cases, using asynchronous refreshes is strongly preferred.
If there is some code that needs to be executed after the refresh is complete, the code should be passed as a postRunnable
parameter to one of the refresh methods:
In some cases, synchronous refreshes can cause deadlocks, depending on which locks are held by the thread invoking the refresh operation.
Virtual File System Events
All changes happening in the virtual file system, either due to refresh operations or caused by user actions, are reported as virtual file system events. VFS events are always fired in the event dispatch thread and in a write action.
The most efficient way to listen to VFS events is to implement BulkFileListener
and to subscribe with it to the VirtualFileManager.VFS_CHANGES
topic.
A non-blocking variant AsyncFileListener
is also available in 2019.2 or later.
See How do I get notified when VFS changes? for implementation details.
WARNING VFS listeners are application level and will receive events for changes happening in all the projects opened by the user. You may need to filter out events that aren't relevant to your task (e.g., via
ProjectFileIndex.isInContent()
).
VFS events are sent both before and after each change, and you can access the old contents of the file in the before event.
Note that events caused by a refresh are sent after the changes have already occurred on disk.
So when you process the beforeFileDeletion
event, for example, the file has already been deleted from disk.
However, it is still present in the VFS snapshot, and you can access its last contents using the VFS API.
Note that a refresh operation fires events only for changes in files that have been loaded in the snapshot.
For example, if you accessed a VirtualFile
for a directory but never loaded its contents using VirtualFile.getChildren()
, you may not get fileCreated
notifications when files are created in that directory.
If you loaded only a single file in a directory using VirtualFile.findChild()
, you will get notifications for changes to that file, but you may not get created/deleted notifications for other files in the same directory.