CGIAR Platform for Big Data in Agriculture Plan of Work and Budget 2021 CGIAR, Platform for Big Data in Agriculture The CGIAR Platform for Big Data in Agriculture is a cross-cutting program of the global CGIAR consortium of non-profit research institutes looking into virtually every aspect of food security spanning: genomics, breeding, agroecology, climate science, and the socioeconomic drivers and context of food systems change. The Platform tends to data standards and data sharing, digital innovation strategy and technology transfer, and research into the intersection of digital technologies and agricultural development in emerging regions. CGIAR is a global research partnership for a food secure future dedicated to reducing poverty, enhancing food and nutrition security, and improving natural resources. https://bigdata.cgiar.org/ Table of Contents 1 ADJUSTMENTS TO THEORIES OF CHANGE 4 2 PLANS AND EXPECTED PROGRESS TOWARDS OUTCOMES 6 3 FINANCIAL PLAN FOR THE COMING YEAR, INCLUDING USE OF W1/2 12 TABLES 13 TABLE 2A. PLANNED MILESTONES 13 TABLE 2B. PLANNED EVALUATIONS/REVIEWS, IMPACT ASSESSMENTS AND LEARNING EXERCISES 23 TABLE 2C. PLANNED MAJOR NEW COLLABORATIONS 24 TABLE 3. PLANNED BUDGET 27 TABLE 4. ESTIMATED 2020 CARRYOVER & 2021 BUDGET TABLE 28 3 CGIAR Platform for Big Data in Agriculture Plan of Work and Budget 2021 1 ADJUSTMENTS The Platform for Big Data has over the course of four years developed a cross-cutting digital innovation function for CGIAR. The Platform TO THEORIES OF set out to demonstrate that data standards, sharing, and analysis; partnerships and technical communities of practice; and applied innovation CHANGE processes, strategy, and management can mutually reinforce each other to accelerate inclusive, impactful digitization in our sector and keep CGIAR abreast and evolving with digital disruption in how CGIAR research is done and the contexts in which it is delivered. The Platform Theory of Change (TOC) has both organizational and sector-level milestones, focused on helping CGIAR and partners increase their capacity to embrace big data, and information and communications technologies (ICTs) through: 1 enhanced collaboration across Centers, Programs and partners in using state-of- the-art data standards, analytics and ICTs; 2 enabling unrestricted discoverability of inter-linked datasets to understand and tackle multi-faceted food security challenges in new data and evidence- driven ways; 3 leveraging CGIAR expertise in the broader big data and ICT sphere, to establish CGIAR as an innovative digital thought leader; and 4 development of initiatives using proven big data innovations to drive agricultural growth in developing countries. 4 CGIAR Platform for Big Data in Agriculture Plan of Work and Budget 2021 This Theory of Change has not changed, and the Platform can point to important progress under each of these goals. Implementation of the Platform, however, has highlighted the importance of change management within the System to achieve the full potential of digital tools and technologies within our organization and our sector. In 2021, the Platform aims to help launch a new One CGIAR digital strategy building on or helping to embed key learnings from the Platform, and will promote ways to tend to the change management needed to mainstream it across the new, more unified CGIAR organization. Understanding the role of CGIAR in facilitating sector-wide digital transformation has deepened over the course of the past four years. As a result, the Platform now places more emphasis on strategic alliances to help our partners apply global data standards for making development data Findable, Accessible, Interoperable and Reusable (FAIR) and to demonstrate its value via analytic pipelines in support of commonly-encountered use-cases (e.g. climate adaptation options by location, analysis of return on investment of fertilizer use). The Platform will also synthesize evaluation and learnings from the Inspire Challenge--CGIAR’s signature digital innovation process--regarding how innovation strategy and management can be applied to increase the uptake and impact of digital innovations in agricultural research for development. 5 CGIAR Platform for Big Data in Agriculture Plan of Work and Budget 2021 2 Plans and The Platform for Big Data catalyzes data-driven innovation across CGIAR through data standards and sharing; analytics; partnerships and technical Expected communities of practice. These approaches are reinforced by synergistic innovation processes to accelerate inclusive, impactful digitization in Progress agriculture, helping CGIAR evolve in step with rapid changes in the research landscape, and establishing CGIAR technical leadership in the Towards global digital agriculture sector. 2021 will be a year of transition to a unified CGIAR, and BIG DATA will contribute to the digital dimensions of this Outcomes vision and seek to launch a new digital strategy in support of it. Building momentum for digital innovation throughout the One CGIAR transition will be a central theme for the Platform in 2021. Applied, large-scale modeling leveraging CGIAR expertise with the computing power and data of industry partners will demonstrate the power of unified research infrastructure. Seven technical communities of practice--now totalling 5,000 members--will leverage domain expertise from across CGIAR and beyond to promote specific proofs of concept demonstrating the power of collaborative networks for One CGIAR. The Platform will seek to crowd in new investors in the Inspire Challenge, CGIAR’s signature digital innovation process for fostering data-driven innovations, and integrate this process with CGIAR-wide research innovation strategy and management. The whole of the Platform team will be engaged in supporting the transition to a digital One CGIAR. 6 CGIAR Platform for Big Data in Agriculture Plan of Work and Budget 2021 Module One: ORGANIZE Under Module One (“Organize”) the Platform will continue to build a global knowledge base and data ecosystem supporting agricultural research for development through the Global Agricultural Research Data and Innovation Network (GARDIAN), a pan-CGIAR and partnership-driven data ecosystem. The GARDIAN team will continue to enable search across all CGIAR repositories and connect more repositories from strategic partners, substantially building on GARDIAN’s 170,000 publications and almost 28,000 datasets. Module One will work with the CGIAR Excellence in Agronomy (EiA) initiative, a key partner and client which will also contribute to the data standards, management, and analytic capabilities on offer for CGIAR and other researchers. The data ecosystem will be enriched by a data management toolkit with guidelines, services and tools, allowing users to more easily align with best practices towards making data assets open, FAIR-standards compliant (i.e. data that are findable, accessible, interoperable, and reusable), and ethically managed. The FAIR data discoverable via GARDIAN will help researchers derive value from large amounts of data, aided by workshops to build CGIAR data science capacity. These efforts will be augmented by data processing and machine learning algorithms available through GARDIAN, and by an analytic environment, Collaborative GARDIAN Labs (CGLabs). In 2021 the team will prototype connectors between GARDIAN data and two commonly-used crop simulation models (WOFOST and DSSAT) leveraging data-to-model translators developed by the University of Florida as part of the Agricultural Model Intercomparison and Improvement Project (AgMIP). CGIAR’s open and FAIR knowledge base will be further enriched through GARDIAN’s FAIR data workflow for helping researchers easily apply data management standards to high-value legacy data. The Platform will facilitate efforts for data to be “born FAIR” through a user-friendly Agronomy Ontology, the Socioeconomic Ontology (SEOnt) and an improved Agronomy Field Information Management System (AgroFIMS). AgroFIMS allows agronomists to generate FAIR standards-compliant field books for data collection via any one of three freely available applications. Module One will also continue to improve the CGIAR Expert Finder that allows users to discover CGIAR’s research by location, activity, subject area, funder and more. 7 CGIAR Platform for Big Data in Agriculture Plan of Work and Budget 2021 Module Two: CONVENE The “Convene” module of the Platform will engage across CGIAR and our global networks of collaborators, beneficiaries, funders, and impact partners as we build collective actions to shape the future of digital agriculture, propelling the “One CGIAR” reform. In 2021 the Platform will hold virtual events exploring the partnerships, data, and innovations that will support and accelerate CGIAR contributions to the Impact Areas of the CGIAR 2030 strategy. The Platform will build on the momentum of the CGIAR Convention on Big Data in Agriculture, which is becoming a reference in the digital agriculture space for innovation strategy, technical agenda-setting, collective action, and showcasing the depth and breadth of CGIAR’s data science capabilities and its research and partner networks. The Convention has steadily increased fundraising, and 2021 will be a critical year for determining if this event can continue. The Platform is a bridge to state-of-the-art innovation in industry, and exciting new partnerships will unfold in 2021. BIG DATA, Google X, Digital Green, Hewlett Packard Enterprise, and Yara will pilot solutions for responsible management and use of farm and farmer data for large-scale analytics and digital financial services. The Platform will deepen its research collaborations with the French Digital Sciences Institute (INRIA) and Cambridge University into emerging topics at the intersection of Artificial Intelligence and agricultural research for development to ensure that AI can be used to mitigate, rather than exacerbate systemic risks in global food security. BIG DATA technical Communities of Practice (CoP) will engage with ongoing initiatives to offer their substantial intellectual and social capital as needed, and highlight community-specific data, methods, tools, and analytic products with an eye to mainstreaming these across CGIAR through proofs of concept and partner engagement: The Data-Driven Agronomy CoP will facilitate cross-regional and transdisciplinary learning on digitally enabled extension. In collaboration with the Policies, Institutions, and Markets Program, the CoP will define and develop a roadmap for the digital “extension agent of the future”. The effort will bridge elements of agricultural analytics, common digital agriculture tools and approaches, human-computer interaction, and socioeconomics, building on and contributing to digitally-enabled agricultural extension across CGIAR. The community plans to feature the effort in a series of webinars, discussion briefs, and research papers. 8 CGIAR Platform for Big Data in Agriculture Plan of Work and Budget 2021 The Crop Modeling CoP will continue improving global coordination of crop modeling efforts. Through strategic collaborations with Purdue University, University of Florida and CGIAR centers, the CoP will close crop modeling knowledge and data gaps and make more data available for crop modeling purposes. To do this, the CoP will develop tools to classify environments, test remote-sensed data in crop simulations, and promote guidelines and recommendations for minimum datasets to effectively leverage crop models. The CoP will also coordinate big data analysis through the International Wheat Improvement Network (IWIN) with CIMMYT, and work towards increasing digital data collection capacity with the Alliance of Bioversity-CIAT (Alliance). Several webinars and training workshops are planned for knowledge sharing, capacity and partnership building, and to collectively address current challenges. The Ontologies CoP enhances CGIAR ontologies and vocabularies as valid reference ontologies that are becoming more widely adopted for research data annotation (e.g., Crop Ontology, Small Fisheries and Aquaculture Ontology). With the active participation of members from the public and private sector, the CoP will continue increasing the robustness of its products and connecting the results to standard open public semantic frameworks for agri-food data (including AGROVOC and the Food Ontology, among others). Through linked data tagging, this effort supports the cross-domain research needed to drive impacts under the CGIAR Research Strategy and enables complex data queries through GARDIAN and other data discovery platforms. The CoP will produce recommendations regarding the governance of the ontologies developed by the working groups and align activities with the need of the One CGIAR Strategy. The Socioeconomic Data CoP will implement a Socioeconomic Ontology with an Ontology Independent Metadata Schema to enhance the interoperability of messy socioeconomic data (together with Ontologies CoP), develop partnerships to validate digital trust and transparency, and continue to support the informal group of CGIAR Internal Review Boards on the ethics of human subjects data collection under One CGIAR. The Geospatial Data CoP will build the community’s technical capacity to develop crop analytics that contributes to in-season crop predictions (e.g., crop area planted, phenology, yield prediction) and build external alliances to accelerate their use. The community will assess and propose actions to address cross-cutting geospatial analysis capabilities needed to implement the CGIAR research strategy, further develop a pan-CGIAR tool for guiding investments in climate adaptation for small agricultural producers, and collaborate with the Excellence in Breeding Platform on defining key operating procedures and data annotation and management practices for drone data. 9 CGIAR Platform for Big Data in Agriculture Plan of Work and Budget 2021 The Information and Data Management CoP will continue tackling issues relating to enhancing the CG Core metadata standard, CGIAR’s data and publications repositories, and leveraging best practices towards Open, FAIR, and secure data assets. CoP working groups will collaborate to consistently implement a revised One CGIAR Policy tackling Open and FAIR data assets, and help address CGIAR-wide Data Management Maturity Assessment recommendations. These efforts will be supported through monthly meetings, by enhancing capacity to help researchers annotate and upload data assets to repositories, and via workshops to test and refine the Platform’s data management tools and services. Module Three: INSPIRE Module Three (“Inspire”) is the innovation component of the Platform, with a portfolio of twenty-one digital innovation projects linking CGIAR researchers with external partners through the Inspire Challenge. In 2020, the Platform awarded COVID-19 ‘Rapid Response Grants’ to established Inspire Challenge projects that proposed an additional, localized impact in terms of COVID response, recovery, and long-term resilience. The results of these projects will be known in early 2021. The 2020 Inspire Challenge selection process included additional selection criteria related to response, recovery, and resilience. In 2021 the final cohort of seven final new start-up projects, winners from the 2020 Challenge, will be implemented, and three scale-up grant awardee will enter their final year. The team will focus on showcasing existing projects to potential funders and strategic allies and on synthesis and learning from its portfolio and the five years of the Challenge. This synthesis will provide important learning for future digital innovation under One CGIAR. Without additional funding, no new innovation grants will be awarded in 2021. In previous years an array of partners (seed companies, agribusiness, multilateral lending banks, and development agencies) have expressed interest in potentially funding the Challenge. In 2021 the Platform will assess ways to enable this vital new digital innovation process to continue promoting the breadth and depth of CGIAR research, linking it to new partnerships and avenues to impact at scale under One CGIAR. 10 CGIAR Platform for Big Data in Agriculture Plan of Work and Budget 2021 Program Management Unit: Collaboration with the GENDER Platform BIG DATA is working to enable gender data to be leveraged to its full potential to improve our understanding of relationships between gender, agriculture, and rapidly digitizing economies and societies where CGIAR works. Working with the Generating Evidence and New Directions for Equitable Results (GENDER) Platform, we will advance the visibility of sex-disaggregated and gender- sensitive research by building researchers’ capacity for metadata tagging and searching. Gender researchers’ individual and organizational risks will be reduced through support for responsible and ethical management of sensitive data. Further, the GENDER Platform resource hub will source publications and data from GARDIAN. In 2020 the Platform included more gender-sensitive selection criteria in the Inspire Challenge, agricultural research and development projects. Eleven total projects (four Rapid Response, seven Inspire Challenge) will be reviewed for insights related to gender and big data in agricultural innovation in 2021 at their one year mark of maturation. We aim to capture and share relevant learnings with the wider agricultural innovation sector. The Platforms will promote human-centered design (HCD) methodologies for the development of digital tools and services in CGIAR. In 2021, the Platform aims to synthesize the HCD research into an educational output and pilot it with real human-computer interface issues. 11 CGIAR Platform for Big Data in Agriculture Plan of Work and Budget 2021 3 Financial Plan Module 1 ORGANIZE: Almost half of the total W1/W2 funds allocated to the Organize Module in 2021 will be disbursed to partners across for the coming CGIAR, and collaborators outside the System. Just over 15% is earmarked for enhancements to the GARDIAN data ecosystem. year, including Module 2 CONVENE: Module Two will continue to be implemented between PMU and the Communities of Practice. The CoPs will drive pilot/ use of W1/2 proofs of concept that demonstrate their research innovations in practice and seek to embed them in the One CGIAR structure, and the PMU will continue to build alliances that leverage the array of BIG DATA Platform offerings, particularly the CGLabs analytic environment and the GARDIAN ecosystem. Module 3 INSPIRE: The Inspire Challenge, CGIAR’s signature digital innovation process, has begun to attract W3 funding (over US$500k to date) and has a growing portfolio of projects with an array of impactful stories. This presents an opportunity to build it into a new fundraising and partnership development mechanism in support of the evolving portfolio under One CGIAR. The Platform (similar to CRPs) was curtailed by one year as the One CGIAR process takes root, which means that the Challenge will need to shut down and make no new innovation grants unless it can be integrated with pan-CGIAR innovation efforts. All Inspire Challenge funds will be fully executed by the end of 2021. Additional explanations for table 3 (optional): 12 CGIAR Platform for Big Data in Agriculture Plan of Work and Budget 2021 TABLES TABLE 2A. Planned Milestones MODULE MAPPED TO 2022 E OF THE MEANS OF CGIAR CROSS-CUTTING MARKERS SUB-IDO FP OUTCOMES MILESTONES INDICAT FOLLOWING VERIFICATION FOR THE MILESTONE FOR FOR FOR FOR GENDER YOUTH CAPDEV CC y {primary} CC M1 Outcome:1.1. 2021 - 1.1.6. 2021: Reworded/ Data cleaning and 1 N/A 2 1 Enhanced A demand- At least two rephrased from processing scripts, institutional capacity driven analytics high-value data proposal and/or augmented of partner research environment is products created datasets of relevance organizations available. through CoP to CGIAR’s core y CC Increased engagement and mission openly capacity for by leveraging available via innovations in GARDIAN, with GARDIAN data partner research analytics made ecosystem. organizations available through y CC Improved Collaborative M1 forecasting of GARDIAN (CG) impacts of climate Labs. change and targeted technology development y CC Increased capacity for innovation in partner development organizations and in poor and vulnerable communities M1 Outcome: 1.1. 2021 - 1.1.7. 2021: Reworded/ Cloud-based 1 N/A 2 1 A demand-driven At least one rephrased from machine learning analytics environ- cloud-based proposal application/s ment is available. machine learning of relevance to application CGIAR scientists developed in openly available conjunction with via GARDIAN data BIG DATA CoPs ecosystem. and scientists from and beyond CGIAR. MODULE MAPPED TO 2022 HE MEANS OF CGIAR CROSS-CUTTING MARKERS SUB-IDO FP OUTCOMES MILESTONES INDICATE OF T FOLLOWING VERIFICATION FOR THE MILESTONE FOR FOR FOR FOR GENDER YOUTH CAPDEV CC M1 Outcome: 2021 - 1.1.8. 2021: Reworded/ Services/scripts 1 N/A 2 1 1.1. A demand- At least two new rephrased from to clean, process, driven analytics services/scripts proposal analyze, and/or environment is that can be used more easily derive available. to clean and insight from available process GARDIAN data assets openly data. available via the GARDIAN data ecosystem. M1 Outcome: 2021 - 1.1.9. 2021: At Reworded/ Data-to-model N/A N/A 2 N/A 1.1. A demand- least one data-to- rephrased from translators available driven analytics model pipeline proposal via CG Labs to enable environment is for the generation decision support by available. of model-ready easing the ability data, tested and to leverage crop available for wider simulation models. use via CG Labs. M1 Outcome: 2021 - 1.1.10. 2021: New/ changed Enhanced data 1 1 2 1 1.1. A demand- Data science science capacity and driven analytics approaches creation of high-value environment is for innovation data products. available. in agricultural research and wider use of CG Labs enabled. y {primary} CC M1 Outcome: 1.2. 2021 - 1.2.9. 2021: New/ changed At least 200 new 1 1 2 1 Enhanced CGIAR resources GARDIAN includes datasets discoverable institutional capacity are discoverable data assets from via GARDIAN of partner research and reused. at least two organizations new partner y CC Increased repositories capacity for beyond CGIAR. innovations in partner research organizations y CC Increase capacity of beneficiaries to adopt research outputs MODULE MAPPED TO 2022 HE MEANS OF CGIAR CROSS-CUTTING MARKERS SUB-IDO FP OUTCOMES MILESTONES INDICATE OF T FOLLOWING VERIFICATION FOR THE MILESTONE FOR FOR FOR FOR GENDER YOUTH CAPDEV CC y CC Increased M1 Outcome: 1.2. 2021 - 1.2.10: New/ changed At least 50 datasets 1 1 2 1 capacity for CGIAR resources 2021: At least 50 with high FAIR scores innovation in partner are discoverable datasets scoring discoverable via development and reused. at least ⅘ for all GARDIAN. organizations and in FAIR indicators poor and vulnerable annotated using communities GARDIAN’s FAIR y CC Improved workflow and PII capacity of women checker. and young people to participate in decision-making M1 Outcome: 1.2. 2021 - 1.2.11: 2021: New/ changed Alpha version of a 1 1 2 1 CGIAR resources Early prototype of tool that allows easier are discoverable a smart data query dataset aggregation and reused and aggregation available via tool developed to GARDIAN. mine GARDIAN’s data pool and enable easy aggregation of relevant datasets. M1 Outcome: 1.2. 2021 - 1.2.12: 2021: New/ changed Alpha version of a 1 1 2 1 CGIAR resources Early prototype of tool that allows users are discoverable pan-CGIAR “Expert to discover CGIAR’s and reused. Finder” developed research by location, in consultation activity, Center with early-adopter etc. made openly Centers. available. MODULE MAPPED TO 2022 HE MEANS OF CGIAR CROSS-CUTTING MARKERS SUB-IDO FP OUTCOMES MILESTONES INDICATE OF T FOLLOWING VERIFICATION FOR THE MILESTONE FOR FOR FOR FOR GENDER YOUTH CAPDEV CC M1 Outcome: 1.2. 2021 - 1.2.13.: New/ changed New GARDIAN user 1 1 2 1 CGIAR resources 2021: GARDIAN interface, providing are discoverable interface and enhanced data and reused. data exploration exploration and enhancements download, and access to enable easier to data management leveraging of data tools and services. assets and data management tools and services. y CC Increased capacity M1 Outcome: 1.3. 2021 - 1.3.11: 2021: Reworded/ 1. At least three 1 N/A 2 N/A for innovations in Standards and User-friendly rephrased from trainings on best partner research semantics are workflows and proposal practices in data organizations utilized to enable tools, training, and management, y CC Increased FAIR (Findable, documentation for including data capacity for Accessible, easy refining and annotation with innovation in partner Interoperable implementation CG Core metadata development and Reusable) of the CG Core elements and organizations and in agricultural data. Metadata ontology concepts. poor and vulnerable Schema v.2.0 communities and ontologies 2. User-driven y {primary} CC across Center enhancements to Increase capacity of publications and data annotation beneficiaries to adopt data repositories workflows. research outputs and GARDIAN. y CC Improved 3. Data prioritization forecasting of framework available impacts of climate via GARDIAN to help change and users identify high- targeted technology value legacy datasets development for FAIRification. MODULE MAPPED TO 2022 HE MEANS OF CGIAR CROSS-CUTTING MARKERS SUB-IDO FP OUTCOMES MILESTONES INDICATE OF T FOLLOWING VERIFICATION FOR THE MILESTONE FOR FOR FOR FOR GENDER YOUTH CAPDEV CC M1 Outcome: 1.3. 2021 - 1.3.12: New/ changed 1. Beta version of N/A N/A 2 1 Standards and 2021: Further AgroFIMS tested for semantics are development of data collection from utilized to enable AgroFIMS to pilot: agronomic surveys FAIR (Findable, Easy generation via KDSmart and Accessible, of field books other apps (e.g. ODK). Interoperable and digital data and Reusable) collection from 2. Further use-driven agricultural data. “non-traditional” enhancements multi-locational to the Agronomy agronomic survey Ontology (e.g. for and demo trials; use data dictionary-type of AgroFIMS with applications) openly the Open Data Kit available for wide use. (ODK) platform, in addition to the KDSmart app currently enabled. M1 Outcome: 1.3. 2021 - 1.3.13. New/ changed Enhanced ontology N/A N/A 2 1 Standards and 2021: Agronomy available via GitHub semantics are Ontology and indexed by utilized to enable optimized for Ontology Lookup FAIR (Findable, AgroFIMS trial Service Accessible, and survey Interoperable functionalities, and and Reusable) easier to interact agricultural data. with for novice users. MODULE MAPPED TO 2022 HE MEANS OF CGIAR CROSS-CUTTING MARKERS SUB-IDO FP OUTCOMES MILESTONES INDICATE OF T FOLLOWING VERIFICATION FOR THE MILESTONE FOR FOR FOR FOR GENDER YOUTH CAPDEV CC y {primary} CC M1 Outcome: 1.4. 2021 - 1.4.7: 2021: New/ changed 1. At least two 2 N/A 2 N/A Enhanced Enhance capacity, Course materials trainings on best institutional capacity catalyze culture and webinars for practices in data of partner research change to further researchers and management organizations CGIAR OA/OD data managers on towards open and y CC Increased compliance and best practices for FAIR outcomes, with capacity for public goods data management a focus on enabling innovations in mandate. and maximizing better gender partner research FAIRness of CGIAR disaggregated data. organizations resources, with a y CC Increased focus on gender 2. Course on FAIR capacity for aspects. Data Management innovation in partner (“Best Practices for development Open, FAIR, and organizations and in Ethical Data”) offered poor and vulnerable at least once. communities y CC Enhanced individual capacity in partner research organizations through training and exchange M1 Outcome: 1.4. 2021 - 1.4.8: 2021: New/ changed 1. Documentation N/A N/A 2 N/A Enhance capacity, At least two and videos to enable catalyze culture workshops and/or collection of FAIR change to further trainings for data agronomic data, and CGIAR OA/OD and information to improve FAIRness compliance and managers and of legacy data. public goods researchers on mandate. ways to render 2. At least two datasets FAIR, workshops/trainings including for researchers and through the use data managers on of standards- the use of standards- compliant field compliant field books books for data to generate FAIR collection. data. MODULE MAPPED TO 2022 HE MEANS OF CGIAR CROSS-CUTTING MARKERS SUB-IDO FP OUTCOMES MILESTONES INDICATE OF T FOLLOWING VERIFICATION FOR THE MILESTONE FOR FOR FOR FOR GENDER YOUTH CAPDEV CC y CC Enhanced M2 Outcome: 2.1. 2021 - 2.1.8: 2021: Reworded/ 2020 Flagship 1 1 2 N/A institutional capacity CGIAR is more CoPs around rephrased from products from CoPs of partner research broadly engaged geospatial data, proposal released and applied organizations in the big data socioeconomic in 2021. y CC Enhanced community. data, ontologies, 2021 Products from individual capacity data-driven CoPs (shareable, in partner research agronomy, harmonized data organizations information and across disciplines, through training and data management, position papers, small M2 exchange livestock data, and pilots). y {primary} CC crop modeling Increased capacity establish CoP for innovations in networks across partner research CGIAR and organizations produce outputs addressing key constraints in data and analytics. M2 Outcome: 2.1. 2021 - 2.1.9: 2021: Reworded/ Terms of reference 1 1 2 N/A CGIAR is more New CoP on rephrased from for Information and broadly engaged Information and proposal Data Management in the big data Data Management CoP approved and community. incorporated adopted by BIG DATA into BIG DATA Platform governance governance. bodies. Updated CGIAR standards for disaggregation and discovery of gender and youth data. MODULE MAPPED TO 2022 HE MEANS OF CGIAR CROSS-CUTTING MARKERS SUB-IDO FP OUTCOMES MILESTONES INDICATE OF T FOLLOWING VERIFICATION FOR THE MILESTONE FOR FOR FOR FOR GENDER YOUTH CAPDEV CC M2 Outcome: 2.1. 2021 - 2.1.10: 2021: Reworded/ 2021 Virtual 1 1 1 1 CGIAR is more Hold high-level rephrased from convention events, broadly engaged Convention proposal convention reports, in the BIG DATA on Big Data in and communications community. Agriculture, with materials. wide participation of CGIAR and non-CGIAR actors. Establishment of collaborative agreements. M2 Outcome: 2.1. 2021 - 2.1.11: 2021: New/ changed Alliance N/A N/A 1 N/A CGIAR is more Develop a pre- Memorandum of broadly engaged competitive, Understanding. in the BIG DATA pro-commercial, community. multi-stakeholder alliance for food security research, data sharing, and technology matchmaking and transfer. y CC Enhanced M2 Outcome: 2.2. 2021 - 2.2.4: New/ changed New analytic 0 0 2 0 institutional capacity CGIAR increases its 2021: New, products of of partner research capacity to work high-frequency high-frequency organizations on priority topics computational computational y {primary} CC more quickly, more methods applied methods applied to Enhanced effectively and at to sustainability sustainability analysis, individual capacity greater scale. analysis leveraging leveraging CGIAR and in partner research CGIAR and partner partner data. organizations data. through training and exchange y CC Increased capacity for innovations in partner research organizations MODULE MAPPED TO 2022 HE MEANS OF CGIAR CROSS-CUTTING MARKERS SUB-IDO FP OUTCOMES MILESTONES INDICATE OF T FOLLOWING VERIFICATION FOR THE MILESTONE FOR FOR FOR FOR GENDER YOUTH CAPDEV CC y {primary} CC M2 Outcome: 2.3. 2021 - 2.3.13: 2021: New/ changed Launch CGIAR Digital 1 1 2 1 Enhanced CGIAR develops Implement an Strategy in Q2. institutional capacity as a learning aligned, pan- of partner research organization. CGIAR action plan organizations for information y CC Increased technology tools, capacity for processes, and innovations in infrastructure. partner research organizations M2 Outcome: 2.3. 2021 - 2.3.14: Reworded/ At least two high- N/A N/A 1 N/A CGIAR develops 2021: Establish or rephrased from value shared services as a learning leverage shared proposal investments that organization. big data services expand our capacity to further pan- to work with large CGIAR collective datasets. action on priority research themes (especially cloud services enhancing pan-CGIAR research efforts including on climate adaptation and ecosystem services). M2 Outcome: 2.3. 2021 - 2.3.15: 2021: Reworded/ At least two N/A N/A 2 N/A CGIAR develops Develop capacity rephrased from workshops (online as a learning building activities proposal and in person) to organization. linked to high build CGIAR big demand data data capacity. Pilot science skills. online data science academy for CGIAR researchers. MODULE MAPPED TO 2022 HE MEANS OF CGIAR CROSS-CUTTING MARKERS SUB-IDO FP OUTCOMES MILESTONES INDICATE OF T FOLLOWING VERIFICATION FOR THE MILESTONE FOR FOR FOR FOR GENDER YOUTH CAPDEV CC y CC Improved M3 Outcome: 2020 extended Identical to Synthesis report 0 0 1 0 forecasting of 3.1 CGIAR shows to 2021 - 3.1.6. proposal examining the Inspire impacts of climate how data-driven Synthesis of Inspire Challenge in light change and approaches project successes of digital innovation targeted technology yield results in and failures in strategy. development poverty reduction, 2019; best practice y Optimized enhanced nutrition guidance provided. consumption of or environmental M3 diverse nutrient-rich benefits. foods y {primary} Increased resilience of agro- ecosystems and communities, especially those including smallholders M3 Outcome: 2021 - 3.1.11: 2021: Identical to 2020/2021 Cohort 1 1 1 1 3.1 CGIAR shows Implement up proposal project reports; how data-driven to seven start up completed selection approaches Inspire projects process and awards yield results in (awarded in 2020) for up to four new poverty reduction, and up to four Inspire and four enhanced nutrition scale up projects scale-up Inspire or environmental awarded in 2019. projects. benefits. M3 Outcome: 2021 - 3.1.12: 2021: Identical to Synthesis report N/A N/A 2 N/A 3.1 CGIAR shows Synthesis of Inspire proposal examining the Inspire how data-driven project successes Challenge in light approaches and failures over of digital innovation yield results in the course of strategy. poverty reduction, 2017-2021, policy enhanced nutrition documents, best- or environmental practice guidance. benefits. TABLE 2B. Planned evaluations/reviews, impact assessments and learning exercises PLATFORM MODULE STATUS PLANNED STUDIES/LEARNING EXERCISES GEOGRAPHIC WHO IS IN THE COMING YEAR FOR GENDER SCOPE COMMISSIONING THIS STUDY End-Project Report consolidating project Alliance of National, BigData M3 On Going successes, failures and lessons learnt from Bioversity and Regional, Peru the prototyping of Croppie CIAT Alliance of Meta-analysis of digital agriculture from BigData M1,M2 On Going Global Bioversity and clearinghouse CIAT Alliance of BigData M1,M2 On Going Digital strategy research Global Bioversity and CIAT TABLE 2C. Planned major new collaborations NAME OF PLATFORM/CRP OR NON-CGIAR COLLABORATOR BRIEF DESCRIPTION OF COLLABORATION AND VALUE ADDED USDA - U.S. Department of Agriculture Collaborate to address issues relating to crop interoperability. SCiO - Big Data in Food Systems Collaborate to enhance the GARDIAN data ecosystem. Collaboration focused on enabling best practices towards open and FAIR(ER) ICARDA data through the TRANSFORM module of the Excellence in Agronomy initiative. Collaboration focused on enabling best practices towards open and FAIR(ER) CIMMYT data through the TRANSFORM module of the Excellence in Agronomy initiative. Collaboration focused on enabling best practices towards open and FAIR(ER) IRRI data through the TRANSFORM module of the Excellence in Agronomy initiative. Collaboration focused on enabling best practices towards open and FAIR(ER) IITA data through the TRANSFORM module of the Excellence in Agronomy initiative. Collaboration focused on enabling best practices towards open and FAIR(ER) ICRISAT data through the TRANSFORM module of the Excellence in Agronomy initiative. Collaboration focused on enabling best practices towards open and FAIR(ER) CIAT data through the TRANSFORM module of the Excellence in Agronomy initiative. Collaboration focused on enabling best practices towards open and FAIR(ER) AfricaRice data through the TRANSFORM module of the Excellence in Agronomy initiative. FAO - Food and Agriculture Collaboration to: (1) enable GARDIAN data assets to be discoverable via FAO- Organization of the United Nations AGRIS; (2) address issues relating to crop interoperability. Work with the World Bank data management team to enable wider testing, The World Bank refinement, and use of GARDIAN FAIR data workflow and associated data management resources. Collaboration on building model-ready pipelines into Collaborative GARDIAN UF - University of Florida (CG) Labs. EMBRAPA - Empresa Brasileira de Collaborate to address issues relating to crop interoperability. Pesquisa Agropecuária Assist in the finalization of an appropriate digital strategy for the CGIAR and Accenture identify key priorities for the BIG DATA Platform. 24 CGIAR Platform for Big Data in Agriculture Plan of Work and Budget 2021 NAME OF PLATFORM/CRP OR NON-CGIAR COLLABORATOR BRIEF DESCRIPTION OF COLLABORATION AND VALUE ADDED The Centre for the Study of Existential Risk and the BIG DATA Platform are CAM - University of Cambridge conducting horizon scanning research to stay abreast of emerging topics in digital agriculture. Leading two Big Data Platform Communities of Practice (Socioeconomic data CIMMYT and crop modeling) and participating actively in the Geospatial community of practice. Leading the Livestock Data for Decisionmaking Community of Practice, an University of Edinburgh externally hosted CoP of the BIG DATA Platform. BIG DATA will join an ILRI committee guiding information infrastructure and ILRI governance. ILRI is also a key partner in the Information and Data Managers Community of Practice that is joining Big Data. X - the Moonshot Factory Collaboration on model development for large-scale, dynamic crop modeling Inspire winner 2020: “Citizen-H2D3: Pilot in Rwanda” UC Davis - University of California, Davis UC-Davis will lead activities in data analysis. Inspire winner 2020: “Citizen-H2D3: Pilot in Rwanda” CSIRO - Commonwealth Scientific and CSIRO will lead activities in data collection and analysis. All partners will Industrial Research Organisation contribute to project and results dissemination. Inspire winner 2020: “Big data in resilience of rangeland communities” RECONCILE - Resource Conflict Institute Partners in Kenya and Kyrgyzstan will support the piloting. Inspire winner 2020: “Big data in resilience of rangeland communities” GMV - GMV GMV is responsible for the technical aspects of the data platform development. KyRICH - Kyrgyz Research Institute Inspire winner 2020: “Big data in resilience of rangeland communities” of Crop Husbandry Partners in Kenya and Kyrgyzstan will support the piloting. WOCAT - World Overview of Inspire winner 2020: “Big data in resilience of rangeland communities” Conservation Approaches and Partners in Kenya and Kyrgyzstan will support the piloting. Technologies NOAA - National Oceanic and Inspire winner 2020: “The ClimaCell Locust Project” Atmospheric Administration NOAA will serve as data partner (United States) Inspire winner 2020::N-ALLyzer: From Nitrogen to ALL other nutrients” IFDC - International Fertilizer IFDC will contribute with existing and new relevant data on yield and nutrient Development Center uptake under a wide range of environmental and management conditions; AI, and modeling. Inspire winner 2020::N-ALLyzer: From Nitrogen to ALL other nutrients” OptionLine - Optionline LLC Optionline: help with a combined approach of Machine Learning and Artificial Intelligence (AI), App development and greenhouse trials. Inspire winner 2020: “ Croppie - the PhotoCropping app” Producers Direct will manage the project; facilitate pilot testing activities with Producers Direct smallholders through in-country teams and farmer / youth networks in Peru and Uganda; develop the prototype app; support for training data collection. 25 CGIAR Platform for Big Data in Agriculture Plan of Work and Budget 2021 NAME OF PLATFORM/CRP OR NON-CGIAR COLLABORATOR BRIEF DESCRIPTION OF COLLABORATION AND VALUE ADDED Inspire winner 2020: “Hola Talia”-Boosting extension service through AI” tec - Tecnológico de Costa Rica Instituto Tecnológico de Costa Rica will provide expertise on artificial intelligence. Inspire winner 2020: “Big data in resilience of rangeland communities” ALIN Africa - Arid Landscape Initiative Partners in Kenya and Kyrgyzstan will support the piloting. Inspire winner 2020: “The ClimaCell Locust Project” ClimaCell.org will be coordinating and project managing the activities, and ClimaCell.org ensuring the voice of the vulnerable are represented in the product design and strategy. ClimaCell is ClimaCell.org’s technological partner. Collaboration on data fabric solutions and supercomputing for agroclimatic HPE - Hewlett Packard Enterprise modeling Inspire winner 2020: “Rapid, Low-Cost Aflatoxin detection using AI” PureScan AI will support: PureScan AI - PureScan AI -Product Development -Data transparent platform/application development -Onground implementation, Industry connects Inspire winner 2020: “ Croppie - the PhotoCropping app” IDEO.org Ideo.org will support user-experience design - including gamification and use incentives. RAB - Rwanda Agriculture and Animal Inspire winner 2020: “Citizen-H2D3: Pilot in Rwanda” Resources Development Board RAB will support project management, data collection, and data analysis. NCAR - National Center for Atmospheric Inspire winner 2020: “The ClimaCell Locust Project” Research NCAR will serve as data partner. Inspire winner 2020: “Hola Talia”-Boosting extension service through AI” CORBANA - Corporación CORBANA and MAG will provide in-depth agronomic knowledge and data Bananera Nacional sources for the project, as well as facilitating the testing of a live prototype with Costa Rican banana farmers. Inspire winner 2020: “Hola Talia”-Boosting extension service through AI” MAG - Ministerio de Agricultura CORBANA and MAG will provide in-depth agronomic knowledge and data y Ganadería (Costa Rica) sources for the project, as well as facilitating the testing of a live prototype with Costa Rican banana farmers. Inspire winner 2020: “Citizen-H2D3: Pilot in Rwanda” Viamo VIAMO will lead activities in data collection. Assist in the finalization of an appropriate digital strategy for the CGIAR and Accenture identify key priorities for the Big Data Platform 26 CGIAR Platform for Big Data in Agriculture Plan of Work and Budget 2021 TABLE 3. Planned Budget COMMENTS PLANNED BUDGET ONMAJOR CHANGES W1/W2 Center W3/Bilateral Total Own fund 2020 Carryover 2021 Budget M1 $730,827.00 $885,600.00 $90,000.00 $0.00 $1,706,427.00 M2 $878,967.00 $1,054,400.00 $0.00 $0.00 $933,367.00 M3 $519,169.00 $550,000.00 $400,000.00 $0.00 $1,469,169.00 CRP Management $83,495.00 $600,000.00 $80,000.00 $0.00 $763,495.00 & Support Cost Strategic Competitive $0.00 $0.00 $0.00 $0.00 $0.00 Research grant Platform Total $2,212,458.00 $3,090,000.00 $570,000.00 $0.00 $5,872,458.00 27 CGIAR Platform for Big Data in Agriculture Plan of Work and Budget 2021 TABLE 4. ESTIMATED 2020 CARRYOVER & 2021 BUDGET TABLE BUDGET 2021 W1/W2 Carryover from Comments on Major   2021 Budget Total 2021 Budget 2020 changes Personnel $1.134.398 $1.639.458 $2.773.856   Consultancy $350.000 $231.909 $581.909   Travel $137.000 $128.988 $265.988   Operational Expenses $579.000 $840.498 $1.419.498   Collaborators & $0 $0 $0   Partnerships Capital & Equipment $0 $0 $0   Indirect costs $12.060 $249.147 $261.206   Platform Total Budget $2.212.458 $3.090.000 $5.302.458   28 CGIAR Platform for Big Data in Agriculture Plan of Work and Budget 2021