Trends in Ediscovery and Litigation Support
- Big Data and Ediscovery
- Top Ediscovery Trends of 2013
- Modus and Flat-fee pricing
- Email no longer top ediscovery priority
- How Clearwell and Symantec could benefit each other
- Sony and Geohot
- LegalTech New York 2011 Recap
- Litigation Support Software Options
- Free Ediscovery Course Online
- Ediscovery Jobs coming in 2011
- Digital Copiers as an Ediscovery Source
- Eighth Circuit Court Skips Opportunity to Enforce Legal Holds
There’s been a lot of talk about how leveraging Big Data in eDiscovery is going to revolutionize the industry. And in many ways this trend will almost certainly prove to be true. But sometimes the vendors pushing these tools can put the cart ahead of the horse. Unfortunately, learning valuable insights from Big Data is not as simple as pointing an eDiscovery tool at a data set and pressing a button.
Big Data has become a hip buzzword in the past year or two, and for good reason. IBM suggests that 90 percent of the data we can use in the world today has been created in the last two years alone. There are a couple of approaches to sifting through this massive data set.
The first approach is to hire a data scientist. These are people who are experts in data analysis and can help you get a solid grasp on what data you are looking at, and how the terabytes of data you have collected in eDiscovery can be used to your advantage.
The other approach is a bit more organic – make sure you have tools at your disposal to let your document review team come up with new ways of searching for documents. Many of the traits that make a great data scientist are also present in the analytical minds of attorneys. Don’t just lock your document review team in a room and review every single document!
One thing that’s difficult for lawyers to part with is the notion of the keyword search. Instead, as technology-assisted review (TAR) tools become more robust, they can be used to find important documents that could never be identified with a simple keyword search.
A new way to search documents is by using what’s called “concept searching” – the idea behind this is to find the true intent of the documents you are searching by using contextual clues. By analyzing an entire document for contextual clues rather than just picking words out of a document, it’s easy to reduce the number of false positives and make review more efficient. This is also especially useful when dealing with homonyms and common acronyms.
Another key new eDiscovery tool to point at large sets of data is that of predictive coding. Predictive coding works by relying upon reviewers to properly code a set of documents with a categorization, and the predictive coding engine then analyzes those documents and determines whether others are likely to be coded in a certain way (such as Responsive, Privileged, etc.). This makes document review much more efficient, as coders can merely verify that a document has been coded properly rather than making the determination themselves.
There are many ways that Big Data will soon be fully utilized by eDiscovery vendors. There is always a new technology on the horizon promising to make litigation simpler and cheaper. In reality, the best place to invest your dollars is not only in cutting-edge products, but with employees who know how to leverage these new document review tools to get results.
There were many changes to the ediscovery landscape in 2012 – I think many judges are getting sick and tired of counsel being unable to coorperate on ediscovery matters. The changes made to the Federal Rules of Civil Procedure were made in 2006, and seven years later attorneys still are unable to figure it out.
Which beings me to my first prediction for the ediscovery landscape: more sanctions! That actually stands in contrast to what many vendors are predicting – Kroll expects less sanctions in 2013, as firms will be able to get a better grasp on the vast amounts of data. But Big Data is not the only issue at work here – there are plenty of clever (and somewhat defensible) ways that attorneys can reduce the scope of their discovery by pointing to a product without knowing exactly what it does.
My second prediction is of course the rise of technology-assisted review (TAR). This term means many things to many people, but I consider it to include advanced analytics, data sampling, and predictive coding. The predictive coding aspect is probably going to be the most challenging technology to put into place, as the system needs to be “trained” with hot and responsive documents in order to work, but it can greatly reduce the high cost of linear document review.
My final “prediction” – and I’m loathe to call it that – is that law firms and document review systems will continue to be breached by attackers looking to gain an edge into a particular company or piece of litigation. Increased security will need to be deployed for ediscovery review software, and at least one major law firm will be forced to report a breach this year.
And with that, my ediscovery predictions for 2013 are complete – I have more predictions, but would prefer to work them into some future blog posts instead.
Just noticed an article in TechCrunch which noted Modus is attempting to sell their eDiscovery services at a flat-fee rate. Many companies are attempting this strategy now, as lawyers seek to cut costs by eliminating traditional billing methods. The question among eDiscovery vendors is whether or not they can leverage “Big Data” (and I don’t really like that term because it’s a buzzword but it’s there) to their advantage. Given the vast amount of tools out there that can analyze vast quantities of data, I would bet that it’s going to be the way of the future. Forget massive-scale document reviews – if you’re looking for a needle in a haystack, why not reduce the size of the haystack?
According to a survey released by Symantec, email is no longer the primary source of ediscovery responses. While email can provide important communications and contextual information, loose files (such as Microsoft Office documents) and database files are being produced on an increasingly regular basis. The inclusion of databases in particular indicates that more and more litigation could be focused on theft of intellectual property – when a disgruntled employee leaves one company for another, customer databases are a ripe target for theft. Of course, confidential and proprietary documents are also high on the list, so look for companies to become even more litigious as their IP is increasingly at risk.
When the news of the acquisition of Clearwell by Symantec was announced last month, it didn’t seem to make much sense at first glance. Why would an antivirus company want to suddenly purchase one of the top ediscovery solutions on the market?
While Clearwell is not the shining star of the industry it once was, perhaps Symantec could leverage Clearwell’s search technology into a desktop-level applet used by many of the enterprise security/ediscovery vendors. The antivirus market has grown from merely searching for signatures of known viruses to searching for patterns in data. Symantec’s knowledge in this arena can be complimented by an ediscovery solution, which is also similarly designed to look for patterns in data.
I don’t know how this will play out, but it’s difficult to ignore the parallels between the two industries! I’ll wait for a while longer, then it might be time to update my comparison of document review platforms.
There are a couple of interesting developments in the case brought by Sony against George Hotz, one of the hackers who helped jailbreak the Playstation 3.
The first development is that a judge has allowed a subpoena to be issued to Geohot’s webhost, revealing the IP addresses of anyone who has visited Geohot’s website (Geohot is the online pseudonym for George Hotz – I’m sure you can figure out it’s origins) since January 2009. In addition, subpoenas have been issued to YouTube and Twitter for information relating to George Hotz on those sites as well.
The latest development is that now George Hotz’s Paypal records can also be accessed, as Sony will attempt to use this information to show that California has jurisdiction in this case, rather than New Jersey, where Hotz resides.
What do you think? Is the magistrate judge in this case overstepping his bounds, or does Sony have a right to dig up all the dirt they can on this hacker?
It’s a bit late for a LegalTech recap, but better late than never, right?
Gabe Acevedo was covering LegalTech for the great Above the Law blog. While that blog doesn’t usually focus specifically on ediscovery issues, this year they brought in an ediscovery pro to cover the event. I’m not surprised that people like to tout the benefits of “the cloud” but back off when pressed for details. There’s still the impression that the use of tech buzzwords will automatically translate to success in the legal field, which is obviously not the case. In my opinion, there are still opportunities for the right technology team to come up with a great ediscovery product that is useful, secure and “buzzword compliant.”
The other big thing this year is the emergence of predictive coding. While this technology has been around for awhile, it’s finally matured enough to be intergrated into some ediscovery products. I’ll have a roundup of litigation support products that implement predictive coding in a future blog post.
When it comes to choosing litigation support software, there are a few options out there. You need to assess what data you have (or could request via discovery), how you are going to analyze that data, and how you are going to leverage that data to better help your client.
There is no one-size-fits-all solution for litigation software, and you have to make some important decisions when selecting an appropriate platform. First, how much data will you be receiving and in what format will it be in? If the only responsive documents are some emails, what format will they be provided to you in? Can you just open and review the files through a native application on your computer? If so, great! But if some responsive documents are part of a complex specialized software package (for example, AutoCAD files), will you be able to review them properly? You may need a litigation support vendor to convert those files to TIFF for you.
Better yet, have those special files reviewed by an expert – converting documents to TIFF can eliminate the actual value of a file. Think of what you use a spreadsheet for – you can have formulas in certain rows and columns, you can sort columns by any number of criteria, etc. Now, if someone took away your computer and replaced it with a printout of that spreadsheet, try getting some work done! That’s what happens when a heavy-handed solution is used where some precision is required.
Another thing to consider with your litigation support software is how many people will be using it to code? Factor in the cost of actually training people on how to properly use the software. Too may times we have had to deal with users who only bothered to learn enough to get by, but when a deadline comes up, they need to know how to perform complex searches, bates number documents, endorse documents, and get them to the court by 5:00 P.M.! Often, after the deadline has passed, that same user will come back and say “boy if I had known what this software could do earlier, I wouldn’t have needed to rush.”
So with ediscovery software, there are many factors to consider. Anyone who flat out tells you that XYZ platform is “the best” probably has limited experience with other platforms. Each has its use and purpose, but of course, we all have our own preferences!
Ralph Losey has made an ediscovery training course available online. According to the site, the bulk of the coursework will take 50 to 150 hours to complete. Sure, you only get the first 15 modules for free (there are sixty modules total). But registration for this course is free (it’s called Basic Registration), so at least you can take it out for a test-drive and if it’s not up your alley, you won’t be forced to continue down that path. Registration is open to everyone, so I’m probably going to take a look at this and see what’s inside. It’s not everyday you get free ediscovery training – here’s a list of the modules from the syllabus.
According to Gabe’s Guide to Ediscovery, the job market for ediscovery professionals is about to heat up. Does this mean I need to demand a raise or look for a new job?
I wonder how the candidates are going to differentiate themselves from one another – experience is important, but so is the ability to use the right tools for the job. Legal services and litigation support vendors will be able to leverage this ediscovery surge into more lucrative contracts that have dried up over the past couple of years.
If the rise in work for ediscovery professionals continues, this may also lead to some traction on ediscovery certification efforts. Considering the relative size and projected growth of the industry, it’s as good a time as any to start some sort of standardization for professionals. Right now there are many diverse skill sets that ediscovery professionals have, but in order to move upward (or even laterally) within the industry there needs to be some sort of quality control over the workforce.
Recently, a team over at CBS News uncovered a new potential security hole at many corporations – the digital copier. Essentially, they purchased several copiers used, then managed to “scrape” documents off of each of the copiers hard drives, exposing sensitive documents previously thought to be deleted. The question then becomes, what needs to happen to a modern printer in the event of a legal hold?
I don’t think that documents that require special software to extract should be produced during discovery. During the normal course of business, these documents were copied or scanned and should reside in other, more accessible places. Of course, this all changes if it turns out that a party is acting in bad faith.
I would also not be surprised if, as one result of increased ediscovery productions, companies started looking toward “dumb” technology in order to better serve their needs. Attorneys still communicate with experts over the phone because electronic correspondence is discoverable in some circumstances. Similarly, companies might decide that the added features of these new fancy copiers aren’t worth the bother if they’re just creating more headache (and expense) for the legal team.
Just ran across this case which took place earlier this year. The Eighth Circuit court declined to enforce sanctions against the City of St. Paul for failure to enforce a legal hold. If you read further, you’ll discover that one reason the court did not act on this is that plaintiff’s counsel did not properly brief the court on this issue, and did not show any evidence that defendants acted in bad faith.
In short, the district court (first the magistrate judge and as affirmed by the district court judge) held that spoliation sanctions for the failure to implement a litigation hold were not warranted. The Eighth Circuit affirmed using an abuse of discretion standard of review. In other words, did the lower court’s opinion deviate from applicable law in an arbitrary or unreasonable way. The Eighth Circuit held that the record failed to establish prejudice to the plaintiffs or an intentional destruction of evidence or withholding of evidence in an attempt to suppress the truth. For anyone keeping score on litigation hold decisions, this is very helpful to an organization that accidentally loses or destroys electronically stored evidence – with or without a legal hold in place.
Early Case Assessment is becoming an important part of the ediscovery landscape. Those who have thorough ECA practices will often see an increase in ROI on cases as analytics tools become more robust. Proper ECA practices allow a law firm to plan strategically for litigation, in addition to reducing overall ediscovery costs. While ECA is not a money-saving silver bullet in every case, there are steps your law firm can take to reduce overall cost while improving the quality of output.
The first step is often to identify the key players in the pending litigation. This is where you’re going to want to question key witnesses and construct an organizational chart of parties that may be involved. With this data, you can better evaluate whether this litigation is going to be quick, or if you may need to dig in for a protracted battle.
At this stage, you now have enough information to issue litigation holds for key custodians and be able to determine the potential scope of ediscovery. Server shares, mail accounts, document management software, and backup tapes should all be evaluated for litigation responsiveness.
There are further steps, of course, but at this point you should have a grasp of the amount of data that could potentially be involved in this litigation. Additional steps will depend on the amount of data you have identified and collected, but you will likely want to load this into a document review platform (if the data set is small and is ready for attorney review) or into a tool with more robust features such as Clearwell or EMC’s Kazeon.
Early Case Assessment is an important tool for reducing overall cost and better preparing you for litigation. The longer you ignore tools and processes like this, the faster you will fall behind the ediscovery curve.
Fulbright & Jaworski have released their 7th Annual Litigation Trends Survey. EDD Update has a few of the highlights, including the result that regulation compliance is becoming a chief area of concern for survey respondents.
I also noticed that there are a high number of respondents that believe the Federal Rules of Civil Procedure should be pared down to limit the scope of electronic discovery in civil cases. I have to generally agree, but I wonder what type of data should be off-limits for ediscovery? If anything, I would like tape backups or other media that is not used during the normal course of business to be discoverable only if data was found to be intentionally or maliciously deleted.
Of course, I’m still somewhat surprised that more firms aren’t employing social media to their advantage, but then again, I wonder how much of an advantage can be really be had in this industry. For example, the exposure/usage of a Facebook group is probably minimal for a law firm, but that’s no reason to avoid having a simple blog on your website. I wouldn’t be surprised to see some ambitious junior associates willing to write a few paragraphs every month on interesting legal topics, and it offers a stage for impressing senior partners.
Even one of the biggest names in data storage cannot escape a costly mistake when it comes to eDiscovery.
SanDisk was requested by opposing counsel to produce the hard disk images that were used by two former employees. No problem, right? SanDisk had already put away the laptops temporarily (in compliance with a legal hold), but decided they would like to use the laptops again, so they moved the former employees’ images to their servers.
Then, of course, they lost the data.
In the end, SanDisk was sanctioned not because they couldn’t find what they were looking for, and not because of the process they used to recover the data (they took it from a tape backup), but because they had not been forthcoming about what really happened to the data. Because of this, plaintiffs endured a longer delay than was originally anticipated, hence the need for the ediscovery sanctions.
Last time, we discussed some trends in the ediscovery market. One trend we’re seeing within organizations is an increased focus on information management.
This information management focus comes from a perfect storm of problems that companies are facing right now. Regulatory compliance is at the forefront of most technology-related decisions, but other factors play a role in deciding how to get a grip on how information flows within an organization.
One important aspect that any information management system must account for is the ability to create a legal hold. When an employee or company may the subject of litigation, it is important that documents are properly preserved to avoid evidence spoliation. Having a single appliance that can defensibly handle this task would be of great benefit to any company. Most solutions we are seeing deployed now are much more patchwork, as companies also worry about things like DLP (data loss prevention) and the security of proprietary information.
A few weeks ago, the annual Socha-Gelbmann Electronic Discovery Survey was released, highlighting several trends in the ediscovery market.
First, let’s start with market size: it appears that the overall ediscovery market grew in 2009, up 10% from 2008. This is modest growth in good years, but in these lean times it shows that law firms have found value and are actually saving money by using ediscovery services. Expect to see beat-the-market growth from the vendors who do the best job of culling document sets down, leading to less attorney document review work.
However, another recent trend may be pushing documents out of the hands of ediscovery vendors. Targeted collection and preservation has made some of these document culling tools less useful, as attorneys gain experience in requesting specific documents during litigation. The usefulness of the blanket “all documents relating to XYZ” request is outweighed by the cost of culling those documents down and then reviewing them. This will be especially true as the data deluge continues, with ediscovery document sources becoming more numerous with no end in sight.
This is good news for many law firms engaged in litigation. Instead of paying contract attorneys to review large swaths of varied documents, you can have those lawyers (and your experts) focus on a smaller set of documents, reducing cost.
Next post (later this week!) we’ll include some information on how the 2010 Socha-Gelbmann Electronic Discovery Survey relates to information management, and we’ll discuss a few other trends that we expect to continue in 2010 in the ediscovery market.
Clearwell has taken the lead in the mobile ediscovery market with their new product, iClearwell. It’s a mobile app for the iPhone and iPad which allows end users to quickly get information about cases and Clearwell appliances.
Clearwell is taking advantage of the law firms who have slowly been migrating to the iPhone as their preferred mobile platform. While the once-ubiquitous Blackberry continues to be a major player for many reasons (IT support and familiarity being important, not to mention a keyboard with tactile response), development and visibility for iPhone apps remains a distinct advantage for Apple.
Time will tell if this means that Clearwell is likely to develop for that other important mobile OS, Android. I think Android has a bright future, but the full computing power of the OS has yet to be harnessed in the corporate world.
This new app is also a great marketing tool for Clearwell – it shows that they are responsive to the market, and makes Clearwell data easy to manipulate, display, and ultimately show off to clients. CW’s web interface looks nice and everything, but this gives a legitimate reason to show off your new iPad!
Quick news on the ediscovery front – AccessData and CT Summation are going to merge. Summation iBlaze is one of the more popular document review platforms around, loved and despised by many litigation support professionals – it all depends on who you talk to and what coast you’re on. CT Summation has other tools as well, such as CaseVault and Discovery Cracker.
AccessData has typically been more focused on the data collection side of the EDRM model. Their core business is focused on computer forensics and cyber security, but this acquisition allows them to expand a bit more into the ediscovery realm. It will be interesting to see how they integrate their Forensic Toolkit (FTK) and incident response software into CT Summation’s review and processing platforms.
Interesting article in the New York Times about the modern use of cell phones. Well, it’s not so much that they’re phones, but more that they’re devices for sending and receiving data. Increasingly, people are using their cell phones in a manner we have typically associated with computers – checking Twitter, Facebook, email, etc. Those who work on the collection side of the ediscovery realm need to come up with a way to collect this data and present it so that it can be either manually reviewed or loaded into a document review platform.
This is a bit of a new challenge – while most companies think about the challenge of culling large data sets (email archives, for example), a new emphasis will also need to be placed on the collection of small amounts of data from disperate and uncontrollable environments. Company XYZ may be able to cull and produce responsive documents within their possession, but who is responsible for the collection of relevant information that may be posted to Facebook? What if that information is set to “private” – is it discoverable, and how do you go about collecting it?