The original goal of Google Flu Trends (GFT) was to provide accessible data on influenza-like illness in order to reduce reporting delays, increase the spatial resolution of data, and provide information on countries outside the United States of America. Clearly, NDS cannot replace physician and laboratory data, though it can be used to augment the surveillance coming out of systems collecting that type of data. We define NDS as those data streams whose content is initiated directly by the user (patient) themselves. Even after EMR are at the fingertips of public health decision-makers and researchers, NDS will provide a snapshot of activity, which is unrelated to the medical encounter. At the same time, no prediction is certain as the future rarely repeats itself in the same way as the past. This suggests that large-scale experiments combining NDS could explore these behaviors. Despite the recent cessation of GFT, Google provided a living system for NDS surveillance. Although this is a challenge for translating NDS signals into estimates of disease incidence, it presents a unique opportunity to study health seeking behavior. For example, NDS has facilitated an exploration of population-level changes in health-related behaviors following changes in tobacco related policy or after unpredictable events such as celebrity deaths or cancer diagnoses. In these cases, although NDS-based systems are being asked to estimate data that is actually being collected, those data are not available quickly enough for use in public health decision making. For example, there are well-documented cases of failure when the training set does not contain important dynamics of the system. While the lack of validation is troubling, there is a deeper issue: it may be the case that many existing standards for validation are inadequate for use on disease surveillance systems using NDS. This has also been done in the context of hospitalizations in Texas, mental illness, psychological manifestations of physical morbidities, and search queries from clinical decision support sites, such as UpToDate. Furthermore, the need for model validation highlights the often-overlooked importance of maintaining traditional/existing systems in the existing NDS literature. However, access to these high resolution data-sets varies by public health level (local, state, federal, and international) as well as by user group: researchers, public health authorities, and the private sector. This would exclude data sources such as electronic health records, disease registries, vital statistics, electronic lab reporting, emergency department visits, ambulance call data, school absenteeism, prescription pharmacy sales, serology, amongst others. Much of the recent criticism of GFT seems to stem from two issues: the first is the effect of changing user behavior during anomalous events and the second is whether real-time, nowcasting of influenza using GFT adds value to the existing systems available to public health authorities. Therefore, a component of validation must be the use of data that is publicly available (or at least available to researchers) for training and testing of NDS. Forecasting, however, requires ample historical data. Since the release of GFT, similar NDS-based systems have been developed to extend surveillance to places where resource or other constraints limit the availability of direct clinical or laboratory surveillance data and improve the timeliness of detection and forecasting of disease incidence. This publication arose from a Santa Fe Institute workshop entitled, "Next Generation Surveillance for the Next Pandemic." We wish to thank the attendees of this workshop, held May 18th-22nd, 2014 at the Santa Fe Insitute in Santa Fe NM, USA. Without these systems, it would be impossible to validate and update NDS-enabled systems. Despite this more narrow definition our suggestions for improving NDS surveillance may also be applicable to more established surveillance systems, participatory systems (e.g., Flu Near You, influenzaNet), and new data streams aggregated from established systems, such as Biosense 2.0 and ISDS DiSTRIBuTE network. These validation procedures should include both best practices in machine learning and also best practices from surveillance system design such as the proportion of persons identified that are true positives for the disease under surveillance. Realizing their potential will require more rigorous standards of validation and improved collaboration between researchers in academia, the private sector, and public health. The second criticism, the need for nowcasting, may depend on the user's access to different data sources. Finally, while the field has been critical of Google and GFT, it is because we are able to criticize: No other NDS-based system had continuously provided public health predictions for as long as GFT, many NDS surveillance systems had not been as carefully evaluated, and fewer still had been implemented prospectively. Novel data streams (NDS), such as web search data or social media updates, hold promise for enhancing the capabilities of public health surveillance. NDS should provide robust, long-term surveillance solutions. Peer review of systems must carefully evaluate validation relative to established surveillance systems. At a minimum, our definition of NDS would include Internet search data and social media, such as Google searches, Google Plus, Facebook, and Twitter posts, as well as Wikipedia access logs, restaurant reservation and review logs, non-prescription pharmacy sales, news source scraping, and prediction markets. Of 66 papers identified, only 27 (41%) performed any validation, only one stated that the source code was available, and while some used publicly available data, no papers publicly shared the data used in their analyses. NDS, by their very definition, do not have a long track record of use. For that reason we advocate for model development by repeated training and testing on subsets of the data and that a final, validation set be held back entirely during model construction. As a result, developing NDS-based surveillance systems presents a number of challenges, many of which are comparable to those faced by systems comprised of more established data sources such as physician visits or laboratory test results. All authors read and approved the final manuscript. Recent efforts by Google and Twitter to better engage with the research community represents an important first step. NDS can help us understand and monitor health-related behavior, but little recent work has focused on this area. Identifying these opportunities will necessitate the involvement of public health authorities and an appreciation of the diversity of objectives and scales across agencies at different levels (local, state, national, international). In 2008, Google developed an algorithm which translates search queries into an estimate of the number of individuals with influenza-like illness that visit primary healthcare providers. All authors attended the Santa Fe Institute workshop entitled, "Next Generation Surveillance for the Next Pandemic" and contributed to the intellectual content of the manuscript. As industries have evolved to embrace new processes, and more stringent safety, quality & environmental standards, the demands on monitoring and control systems have increased. NDS can increase the timeliness of surveillance information, improve temporal or spatial resolution of surveillance, add surveillance to places with no existing systems, improve dissemination of data, measure unanticipated outcomes of interest. For public health authorities with access to high-resolution data on reported cases of influenza, simple autoregressive models can be used to nowcast with high accuracy. In this paper, we outline a conceptual framework for integrating NDS into current public health surveillance. Similarly, posting a Tweet about a "healthy recipe" is likely a different action than searching for a "healthy recipe"; where the former is an act of broadcasting information, while the latter is an act of searching for information. Although ready access to aggregated information from these excluded sources is novel in many health settings, our focus here is on those streams which are both directly initiated by the user and also not already maintained by public health departments or other health professionals. These steps are comparable to those prescribed for evaluating more established systems. While these steps help to ensure the validity of models, it may be that given the volatile nature of disease processes and human behavior (non-linear and non-stationary dynamics), it may be technically impossible to design robust surveillance systems using proxy data and regression models alone. The most studied example of the potential benefits and unique challenges associated with NDS comes from Google Flu Trends. Answering this question requires accurate forecasting the spread of confirmed cases as well as analysis of the number of deaths and recoveries. Lastly, after these development and validation steps, models should be openly evaluated prospectively to further support their validity. A second important distinction is that surveillance needs, potential benefits, and general utility vary by country, region, and locality. Clear definition of appropriate baseline models and their definition is critical to assessing the improvement of new models utilizing novel streams. In examining utility, the utility of models may be both better understood and easier to validate in real-time. And continued prospective evaluation. In some cases NDS can be used to assess behavior - something that remains challenge for traditional case-based surveillance. After a while I find it is an issue for both existing systems may be limited to understanding responses. Web-based participatory surveillance experience. In real-time.