{"ymdb_id":"YMDB00579","created_at":"2011-05-29T18:32:46.000Z","updated_at":"2016-09-08T18:35:39.000Z","name":"7,8-Dihydroneopterin","cas":"1218-98-0","state":"Solid","melting_point":"","description":"7,8-Dihydroneopterin is the precursor to 6-hydroxymethyl-7,8-dihydropterin in the 6-hydroxymethyl-dihydropterin diphosphate biosynthesis pathway. 6-Hydroxymethyl-dihydropterin diphosphate is the pterin precursor for the biosynthesis of several important cofactors, including tetrahydrofolate, methanopterin and sarcinapterin. [Biocyc PWY-6147]","experimental_water_solubility":"","experimental_logp_hydrophobicity":"","location":"mitochondrion;cytoplasm","synthesis_reference":null,"chebi_id":"17001","hmdb_id":"HMDB02275","kegg_id":"C04874","pubchem_id":"659","cs_id":"24816137","foodb_id":null,"wikipedia_link":null,"biocyc_id":"DIHYDRONEOPTERIN-P3","iupac":"2-amino-6-[(1S,2R)-1,2,3-trihydroxypropyl]-1,4,7,8-tetrahydropteridin-4-one","traditional_iupac":"2-amino-6-[(1S,2R)-1,2,3-trihydroxypropyl]-7,8-dihydro-1H-pteridin-4-one","logp":"-3.3354137886666666","pka":"13.244737730644115","alogps_solubility":"1.86e+00 g/l","alogps_logp":"-2.10","alogps_logs":"-2.14","acceptor_count":"9","donor_count":"6","rotatable_bond_count":"3","polar_surface_area":"152.56000000000003","refractivity":"69.657","polarizability":"24.004282973536625","formal_charge":"0","physiological_charge":"0","pka_strongest_basic":"0.2826065573151887","pka_strongest_acidic":"9.371218174035606","bioavailability":"1","number_of_rings":"2","rule_of_five":"0","ghose_filter":"0","veber_rule":"0","mddr_like_rule":"0","synonyms":["2-amino-4-hydroxy-6-(D-erythro-1,2,3-trihydroxypropyl)-7,8-dihydropteridine","2-Amino-4-hydroxy-6-(D-erythro-1',2',3'-trihydroxypropyl)-7,8-dihydropteridine","2-amino-6-[(1S,2R)-1,2,3-trihydroxypropyl]-7,8-dihydropteridin-4(1H)-one","2-amino-6-[(1S,2R)-1,2,3-trihydroxypropyl]-7,8-dihydropteridin-4(3H)-one","2-Amino-7,8-dihydro-6-(1,2,3-trihydroxypropyl)-4(1H)-pteridinone","7,8-Dihydro-D-erythro-neopterin","7,8-Dihydro-D-neopterin","7,8-Dihydroneopterin","D-erythro-7,8-Dihydroneopterin","Dihydroneopterin","NPR"],"pathways":[{"name":"Folate biosynthesis","kegg_map_id":"00790"},{"name":"tetrahydrofolate biosynthesis","kegg_map_id":null}],"growth_conditions":[],"references":[{"pubmed_id":21051339,"citation":"UniProt Consortium (2011). \"Ongoing and future developments at the Universal Protein Resource.\" Nucleic Acids Res 39:D214-D219."},{"pubmed_id":18846089,"citation":"Herrgard, M. J., Swainston, N., Dobson, P., Dunn, W. B., Arga, K. Y., Arvas, M., Bluthgen, N., Borger, S., Costenoble, R., Heinemann, M., Hucka, M., Le Novere, N., Li, P., Liebermeister, W., Mo, M. L., Oliveira, A. P., Petranovic, D., Pettifer, S., Simeonidis, E., Smallbone, K., Spasic, I., Weichart, D., Brent, R., Broomhead, D. S., Westerhoff, H. V., Kirdar, B., Penttila, M., Klipp, E., Palsson, B. O., Sauer, U., Oliver, S. G., Mendes, P., Nielsen, J., Kell, D. B. (2008). \"A consensus yeast metabolic network reconstruction obtained from a community approach to systems biology.\" Nat Biotechnol 26:1155-1160."}],"proteins":[{"created_at":"2011-05-26T23:22:21.000Z","updated_at":"2011-07-22T17:54:09.000Z","name":"Repressible alkaline phosphatase","uniprot_id":"P11491","uniprot_name":"PPB_YEAST","enzyme":true,"transporter":false,"gene_name":"PHO8","num_residues":566,"molecular_weight":"63003.60156","theoretical_pi":"5.33","general_function":"Involved in catalytic activity","specific_function":"A phosphate monoester + H(2)O = an alcohol + phosphate","reactions":[{"id":1309,"direction":"\u003e","locations":"cytoplasm","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":2499,"direction":"\u003e","locations":"Cytoplasmic;Cytoplasm; Single-pass membrane protein (Potential).;Secreted;Vacuole membrane","altext":"A phosphate monoester + H(2)O = an alcohol + phosphate.","export":false,"pw_reaction_id":null,"source":null},{"id":2613,"direction":"\u003e","locations":"Vacuole membrane; Single-pass membrane protein (Potential).","altext":"(2E,6E)-farnesyl diphosphate + H(2)O = (2E,6E)-farnesol + diphosphate.","export":false,"pw_reaction_id":null,"source":null},{"id":2614,"direction":"\u003e","locations":"Vacuole membrane; Single-pass membrane protein (Potential).","altext":"beta-D-fructose 2,6-bisphosphate + H2O = beta-D-fructofuranose 2-phosphate + phosphate","export":false,"pw_reaction_id":null,"source":null}],"signal_regions":"None","transmembrane_regions":"34-59","pdb_id":null,"cellular_location":"Vacuole membrane; Single-pass membrane protein (Potential).","genbank_gene_id":"M21134","genbank_protein_id":"172164","gene_card_id":"PHO8","chromosome_location":"chromosome 4","locus":"YDR481C","synonyms":[],"enzyme_classes":["3.1.3.1","3.1.7.6","3.1.3.54"],"go_classes":[{"category":"Component","description":" Not Available"},{"category":"Function","description":" catalytic activity"},{"category":"Function","description":" hydrolase activity"},{"category":"Function","description":" hydrolase activity, acting on ester bonds"},{"category":"Function","description":" phosphoric ester hydrolase activity"},{"category":"Function","description":" phosphatase activity"},{"category":"Process","description":" metabolic process"}],"pfams":[{"name":"Alk_phosphatase","identifier":"PF00245"}],"pathways":[{"name":"Folate biosynthesis","kegg_map_id":"00790"}],"gene_sequence":"ATGATGACTCACAGATTACCAAGCGAACAGACACGTCTTGTTCCTGGATCTGACTCGAGCTCTCGTCCTAAAAAGAGACGGATCTCGAAGAGATCGAAGATAATAGTATCCACTGTGGTCTGTATTGGTTTGTTGCTGGTTTTAGTGCAACTGGCATTTCCAACCAGTTTTGCTATACGTTCTGCATCACACAAGAAGAAGAATGTCATATTCTTCGTGACGGATGGAATGGGACCTGCGTCTTTGTCCATGGCAAGATCATTCAACCAGCACGTTAATGATCTACCTATAGACGACATCCTGACACTTGACGAGCATTTTATCGGGTCTTCGAGAACAAGGTCCTCGGACTCACTAGTCACTGACTCAGCTGCGGGCGCCACCGCATTTGCATCTGCATTAAAGAGCTATAATGGGGCCATCGGCGTTGATCCGCATCACAGACCTTGTGGTACAGTGCTCGAGGCGGCTAAGTTAGCAGGTTATCTCACTGGGCTTGTTGTTACGACAAGAATCACAGATGCCACTCCGGCTTCATTCAGTTCACATGTTGATTACAGATGGCAGGAAGATCTTATTGCTACGCACCAGTTAGGCGAATATCCTCTGGGGAGAGTAGTAGACTTACTCATGGGTGGCGGCAGGAGCCATTTCTACCCTCAAGGCGAAAAGGCTTCTCCATATGGTCACCACGGTGCTAGAAAAGATGGAAGAGATTTAATCGATGAGGCTCAATCTAATGGCTGGCAGTACGTGGGTGACCGTAAAAATTTTGATTCTTTGTTGAAAAGCCACGGTGAAAACGTTACTTTTCCTTTCTTGGGTCTCTTTGCAGATAACGACATTCCTTTTGAAATTGACAGAGATGAAAAGGAGTATCCTTCTCTTAAGGAACAAGTCAAGGTGGCTTTGGGTGCCTTGGAAAAAGCTTCCAATGAGGACAAAGATTCTAATGGGTTTTTCTTAATGGTGGAAGGCTCCAGAATTGACCACGCTGGTCACCAAAACGATCCAGCTTCACAGGTAAGGGAGGTGTTGGCTTTCGATGAAGCATTCCAATACGTTTTGGAGTTTGCTGAAAATTCCGATACGGAGACTGTTTTGGTCTCTACGTCGGATCATGAAACGGGTGGTTTAGTCACTTCAAGACAAGTCACTGCTAGCTATCCTCAGTATGTCTGGTATCCACAGGTGCTTGCCAACGCCACCCATTCAGGTGAGTTTTTGAAAAGAAAACTAGTAGATTTTGTTCATGAACATAAGGGCGCAAGCAGCAAAATAGAAAATTTCATCAAACATGAGATCCTGGAAAAGGATTTGGGCATTTACGATTACACGGAAAGTGATTTAGAAACATTAATTCATTTGGATGACAACGCAAATGCTATCCAAGACAAACTGAATGACATGGTATCATTTAGGGCCCAAATTGGCTGGACCACGCATGGTCACAGCGCAGTCGACGTCAACATATATGCATACGCCAATAAGAAAGCTACATGGTCATATGTCTTGAACAATTTACAAGGCAATCACGAAAATACGGAAGTTGGCCAGTTCTTAGAAAACTTTTTAGAATTGAACCTGAACGAAGTCACTGATTTGATTAGGGACACCAAACACACATCAGATTTTGATGCTACAGAAATAGCTAGTGAAGTACAACATTATGACGAATACTACCATGAGTTGACCAACTGA","protein_sequence":"MMTHTLPSEQTRLVPGSDSSSRPKKRRISKRSKIIVSTVVCIGLLLVLVQLAFPSSFALRSASHKKKNVIFFVTDGMGPASLSMARSFNQHVNDLPIDDILTLDEHFIGSSRTRSSDSLVTDSAAGATAFACALKSYNGAIGVDPHHRPCGTVLEAAKLAGYLTGLVVTTRITDATPASFSSHVDYRWQEDLIATHQLGEYPLGRVVDLLMGGGRSHFYPQGEKASPYGHHGARKDGRDLIDEAQSNGWQYVGDRKNFDSLLKSHGENVTLPFLGLFADNDIPFEIDRDEKEYPSLKEQVKVALGALEKASNEDKDSNGFFLMVEGSRIDHAGHQNDPASQVREVLAFDEAFQYVLEFAENSDTETVLVSTSDHETGGLVTSRQVTASYPQYVWYPQVLANATHSGEFLKRKLVDFVHEHKGASSKIENFIKHEILEKDLGIYDYTDSDLETLIHLDDNANAIQDKLNDMVSFRAQIGWTTHGHSAVDVNIYAYANKKATWSYVLNNLQGNHENTEVGQFLENFLELNLNEVTDLIRDTKHTSDFDATEIASEVQHYDEYYHELTN"},{"created_at":"2011-05-27T09:05:09.000Z","updated_at":"2011-07-22T17:54:37.000Z","name":"ATPase GET3","uniprot_id":"Q12154","uniprot_name":"GET3_YEAST","enzyme":true,"transporter":false,"gene_name":"GET3","num_residues":354,"molecular_weight":"39353.39844","theoretical_pi":"4.61","general_function":"Inorganic ion transport and metabolism","specific_function":"ATPase required for the post-translational delivery of tail-anchored (TA) proteins to the endoplasmic reticulum. Recognizes and selectively binds the transmembrane domain of TA proteins in the cytosol. This complex then targets to the endoplasmic reticulum by membrane-bound receptors GET1 and GET2, where the tail-anchored protein is released for insertion. This process is regulated by ATP binding and hydrolysis. ATP binding drives the homodimer towards the closed dimer state, facilitating recognition of newly synthesized TA membrane proteins. ATP hydolysis is required for insertion. Subsequently, the homodimer reverts towards the open dimer state, lowering its affinity for the GET1-GET2 receptor, and returning it to the cytosol to initiate a new round of targeting. Cooperates with the HDEL receptor ERD2 to mediate the ATP-dependent retrieval of resident ER proteins that contain a C-terminal H-D-E-L retention signal from the Golgi to the ER. Involved in low-level resistance to the oxyanions arsenite and arsenate, and in heat tolerance","reactions":[{"id":1458,"direction":"\u003e","locations":"cytoplasm","altext":null,"export":true,"pw_reaction_id":null,"source":null}],"signal_regions":"None","transmembrane_regions":"None","pdb_id":null,"cellular_location":"Cytoplasm. Endoplasmic reticulum. Golgi apparatus.","genbank_gene_id":"AY693164","genbank_protein_id":"51013779","gene_card_id":"GET3","chromosome_location":"chromosome 4","locus":"YDL100C","synonyms":["Arsenical pump-driving ATPase","Arsenite-stimulated ATPase","Golgi to ER traffic protein 3","Guided entry of tail-anchored proteins 3"],"enzyme_classes":["3.6.-.-"],"go_classes":[{"category":"Component","description":" Not Available"},{"category":"Function","description":" binding"},{"category":"Function","description":" nucleoside binding"},{"category":"Function","description":" purine nucleoside binding"},{"category":"Function","description":" adenyl nucleotide binding"},{"category":"Function","description":" adenyl ribonucleotide binding"},{"category":"Function","description":" ATP binding"},{"category":"Process","description":" chemical homeostasis"},{"category":"Process","description":" ion homeostasis"},{"category":"Process","description":" cellular ion homeostasis"},{"category":"Process","description":" cellular cation homeostasis"},{"category":"Process","description":" cellular metal ion homeostasis"},{"category":"Process","description":" biological regulation"},{"category":"Process","description":" regulation of biological quality"},{"category":"Process","description":" homeostatic process"}],"pfams":[],"pathways":[],"gene_sequence":"ATGGATTTAACCGTGGAACCTAATTTGCACTCTTTAATTACCTCTACCACTCATAAGTGGATTTTCGTTGGTGGTAAAGGTGGTGTTGGTAAGACTACTTCATCATGTTCCATTGCTATCCAAATGACTTTGAGTCAACCAAACAAACAGTTCCTACTGATCTCTACTGATCCTGCCCATAACTTAAGTGATGCATTCGGTGAGAAATTTGGTAAAGACGCCAGAAAGGTGACAGGCATGAATAATCTATCATGTATGGAAATCGATCCATCCGCTGCTTTGAAGGATATGAACGACATGGCAGTTTCACGCGCTAACAATAACGGAAGTGACGGTCAAGGTGACGATCTAGGAAGCTTGCTTCAAGGTGGTGCTCTTGCTGATTTGACCGGTTCCATCCCTGGTATCGACGAAGCTTTATCCTTCATGGAAGTCATGAAGCACATTAAAAGGCAAGAACAGGGCGAAGGTGAAACCTTCGATACTGTTATTTTTGACACTGCTCCAACTGGCCACACATTAAGATTTCTACAACTACCAAATACTTTATCCAAGCTTTTGGAAAAGTTCGGTGAAATTACCAACAAATTGGGCCCAATGCTAAACTCTTTTATGGGCGCAGGTAATGTCGATATCTCTGGAAAATTGAACGAGTTAAAGGCTAATGTCGAGACCATCAGACAACAATTCACGGATCCTGACCTAACGACTTTTGTTTGCGTTTGTATCAGTGAATTCTTATCCTTATATGAAACTGAAAGACTAATTCAGGAATTGATTTCCTACGATATGGACGTTAATTCCATCATTGTCAACCAATTATTATTTGCTGAAAACGATCAAGAGCACAACTGTAAGAGATGTCAGGCAAGATGGAAGATGCAAAAGAAGTACTTGGACCAAATCGACGAATTGTACGAAGATTTCCATGTCGTTAAAATGCCATTATGTGCTGGAGAAATCAGAGGCTTAAATAACTTAACAAAGTTCTCACAGTTCCTAAACAAAGAATATAACCCTATTACTGATGGCAAAGTCATTTATGAGTTAGAAGATAAGGAATAG","protein_sequence":"MDLTVEPNLHSLITSTTHKWIFVGGKGGVGKTTSSCSIAIQMALSQPNKQFLLISTDPAHNLSDAFGEKFGKDARKVTGMNNLSCMEIDPSAALKDMNDMAVSRANNNGSDGQGDDLGSLLQGGALADLTGSIPGIDEALSFMEVMKHIKRQEQGEGETFDTVIFDTAPTGHTLRFLQLPNTLSKLLEKFGEITNKLGPMLNSFMGAGNVDISGKLNELKANVETIRQQFTDPDLTTFVCVCISEFLSLYETERLIQELISYDMDVNSIIVNQLLFAENDQEHNCKRCQARWKMQKKYLDQIDELYEDFHVVKMPLCAGEIRGLNNLTKFSQFLNKEYNPITDGKVIYELEDKE"},{"created_at":"2011-05-29T04:30:10.000Z","updated_at":"2011-07-22T17:54:37.000Z","name":"Folic acid synthesis protein FOL1","uniprot_id":"P53848","uniprot_name":"FOL1_YEAST","enzyme":true,"transporter":false,"gene_name":"FOL1","num_residues":824,"molecular_weight":"93119.10156","theoretical_pi":"6.4","general_function":"Involved in dihydropteroate synthase activity","specific_function":"Catalyzes three sequential steps of tetrahydrofolate biosynthesis","reactions":[{"id":1140,"direction":"\u003e","locations":"mitochondrion","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":1457,"direction":"\u003e","locations":"mitochondrion","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":1461,"direction":"\u003e","locations":"mitochondrion","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":1462,"direction":"\u003e","locations":"mitochondrion","altext":null,"export":true,"pw_reaction_id":null,"source":null},{"id":2843,"direction":"\u003e","locations":"Mitochondrion membrane","altext":"2-amino-4-hydroxy-6-(D-erythro-1,2,3-trihydroxypropyl)-7,8-dihydropteridine = 2-amino-4-hydroxy-6-hydroxymethyl-7,8-dihydropteridine + glycolaldehyde.","export":false,"pw_reaction_id":null,"source":null},{"id":2844,"direction":"\u003e","locations":"Mitochondrion membrane","altext":"ATP + 2-amino-4-hydroxy-6-hydroxymethyl-7,8-dihydropteridine = AMP + (2-amino-4-hydroxy-7,8-dihydropteridin-6-yl)methyl diphosphate.","export":false,"pw_reaction_id":null,"source":null},{"id":2845,"direction":"\u003e","locations":"Mitochondrion membrane","altext":"(2-amino-4-hydroxy-7,8-dihydropteridin-6-yl)methyl diphosphate + 4-aminobenzoate = diphosphate + dihydropteroate.","export":false,"pw_reaction_id":null,"source":null},{"id":4141,"direction":"\u003e","locations":null,"altext":null,"export":true,"pw_reaction_id":"PW_R006442","source":"Smpdb"},{"id":4142,"direction":"\u003e","locations":null,"altext":null,"export":true,"pw_reaction_id":"PW_R006443","source":"Smpdb"},{"id":4143,"direction":"\u003e","locations":null,"altext":null,"export":true,"pw_reaction_id":"PW_R006444","source":"Smpdb"}],"signal_regions":"None","transmembrane_regions":"None","pdb_id":null,"cellular_location":"Mitochondrion membrane","genbank_gene_id":null,"genbank_protein_id":null,"gene_card_id":"FOL1","chromosome_location":"chromosome 14","locus":"YNL256W","synonyms":["Dihydroneopterin aldolase","DHNA","FASA","FASB","2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase","6-hydroxymethyl-7,8-dihydropterin pyrophosphokinase","PPPK","7,8-dihydro-6-hydroxymethylpterin-pyrophosphokinase","HPPK","FASC","Dihydropteroate synthase","DHPS","Dihydropteroate pyrophosphorylase","FASD"],"enzyme_classes":["4.1.2.25","2.7.6.3","2.5.1.15"],"go_classes":[{"category":"Component","description":" Not Available"},{"category":"Function","description":" diphosphotransferase activity"},{"category":"Function","description":" 2-amino-4-hydroxy-6-hydroxymethyldihydropteridine diphosphokinase activity"},{"category":"Function","description":" aldehyde-lyase activity"},{"category":"Function","description":" dihydroneopterin aldolase activity"},{"category":"Function","description":" catalytic activity"},{"category":"Function","description":" dihydropteroate synthase activity"},{"category":"Function","description":" transferase activity"},{"category":"Function","description":" transferase activity, transferring phosphorus-containing groups"},{"category":"Function","description":" lyase activity"},{"category":"Function","description":" carbon-carbon lyase activity"},{"category":"Function","description":" transferase activity, transferring alkyl or aryl (other than methyl) groups"},{"category":"Process","description":" folic acid and derivative metabolic process"},{"category":"Process","description":" folic acid and derivative biosynthetic process"},{"category":"Process","description":" pteridine and derivative metabolic process"},{"category":"Process","description":" metabolic process"},{"category":"Process","description":" tetrahydrofolate biosynthetic process"},{"category":"Process","description":" cellular metabolic process"},{"category":"Process","description":" cellular aromatic compound metabolic process"}],"pfams":[{"name":"FolB","identifier":"PF02152"},{"name":"HPPK","identifier":"PF01288"},{"name":"Pterin_bind","identifier":"PF00809"}],"pathways":[{"name":"Folate biosynthesis","kegg_map_id":"00790"},{"name":"tetrahydrofolate biosynthesis","kegg_map_id":null}],"gene_sequence":"ATGTCAAAGCTATTTTCTACTGTCAATTCTGCAAGACATAGTGTACCACTAGGCGGCATGAGAGATTATGTGCACATTAAGAAACTAGAGATGAATACAGTTCTTGGGCCTGATTCCTGGAATCAATTAATGCCTCAGAAATGTCTACTAAGCTTAGATATGGGTACAGATTTTAGTAAATCTGCGGCTACGGATGATTTGAAATATTCTCTAAATTATGCAGTTATTTCTCGTGATTTGACGAATTTCGTCAGCAAAAAAAAGAATTGGGGTTCTGTTTCTAATTTGGCTAAATCTGTGTCTCAATTTGTTATGGACAAATATTCTGGTGTCGAGTGTCTGAATTTAGAAGTGCAGGCGGATACAACGCATATTAGAAGTGACCACATATCTTGTATTATTCAACAAGAAAGAGGGAATCCAGAATCACAGGAATTTGACGTTGTTAGGATATCTGAGTTAAAAATGTTGACTTTGATTGGTGTTTTCACCTTTGAGAGACTTAAGAAACAGTATGTAACTTTGGATATAAAGTTGCCTTGGCCAAAGAAAGCCGAATTGCCACCGCCAGTGCAAAGCATAATTGATAACGTTGTCAAGTTTGTGGAGGAATCAAATTTCAAGACTGTGGAAGCTCTTGTAGAATCTGTGTCAGCTGTTATTGCCCATAACGAGTATTTTCAAAAGTTTCCAGATTCGCCTTTGGTGGTGAAGGTTTTGAAATTAAACGCAATCACAGCCACAGAAGGTGTTGGTGTAAGCTGTATTAGAGAGCCCAGGGAGATTGCGATGGTAAATATTCCATATCTTTCCTCCATACATGAATCGTCTGATATTAAGTTCCAATTGTCTTCATCACAAAACACTCCTATTGAGGGTAAAAATACATGGAAAAGAGCGTTTTTAGCGTTTGGTTCAAACATTGGGGACCGTTTCAAACACATTCAAATGGCGTTGCAATTATTATCAAGGGAAAAAACGGTTAAATTACGGAATATTTCGTCTATTTTTGAAAGTGAACCAATGTATTTCAAAGATCAAACCCCTTTCATGAATGGGTGTGTTGAGGTGGAGACATTACTGACCCCAAGCGAATTATTAAAATTGTGTAAAAAAATTGAATATGAAGAGTTGCAAAGAGTCAAGCATTTTGATAATGGTCCGAGAACAATAGATCTGGATATTGTTATGTTTTTGAATAGCGCCGGAGAAGATATTATAGTAAATGAACCGGATTTGAATATACCGCATCCTAGAATGCTGGAGAGGACTTTCGTTCTTGAGCCGTTATGTGAATTAATATCCCCCGTTCACCTTCATCCTGTGACAGCGGAACCCATTGTAGACCATTTAAAACAGTTATACGACAAACAGCATGATGAAGATACCTTATGGAAATTAGTTCCATTGCCTTATCGTAGTGGTGTGGAGCCTAGATTTTTGAAATTCAAGACCGCTACAAAACTTGACGAATTTACTGGAGAAACAAACAGAATTACTGTTTCACCTACATATATCATGGCTATCTTCAACGCTACACCAGATTCATTTTCCGATGGAGGTGAGCATTTTGCGGACATTGAAAGTCAATTGAATGATATCATTAAATTGTGTAAAGACGCATTATATTTGCATGAGAGCGTCATCATCGACGTTGGAGGGTGTTCTACCAGGCCTAACTCTATTCAGGCGTCTGAGGAAGAAGAAATACGCAGGTCTATCCCATTAATTAAGGCCATTAGAGAAAGCACTGAGTTACCGCAAGATAAAGTCATACTATCCATTGATACTTATCGTTCCAATGTCGCTAAAGAAGCGATTAAAGTTGGAGTGGATATTATTAATGATATTTCGGGAGGTTTATTTGACAGCAACATGTTTGCCGTAATTGCAGAGAACCCAGAAATTTGTTATATTTTATCACACACACGTGGTGATATTTCAACGATGAATAGGCTGGCGCATTACGAAAATTTTGCATTGGGTGATTCTATTCAGCAAGAATTTGTTCATAATACCGACATTCAGCAGCTAGACGACTTGAAAGACAAAACAGTGTTAATCAGGAATGTTGGTCAAGAAATTGGCGAAAGGTATATCAAAGCGATTGATAATGGAGTAAAGCGCTGGCAAATTCTAATCGACCCTGGACTTGGTTTTGCTAAGACCTGGAAGCAAAACTTACAAATTATTAGACATATCCCCATTTTAAAGAACTACTCATTCACCATGAACTCAAACAATTCGCAAGTGTATGTTAACCTCAGAAATATGCCCGTTTTATTGGGTCCATCGCGCAAAAAATTCATTGGACATATCACAAAAGATGTGGATGCGAAGCAAAGAGACTTTGCTACTGGAGCGGTGGTAGCGTCGTGTATTGGTTTCGGCAGCGACATGGTTAGGGTCCATGACGTTAAAAATTGTTCGAAGAGCATTAAATTAGCAGATGCTATTTATAAAGGTTTGGAATAA","protein_sequence":"MSKLFSTVNSARHSVPLGGMRDYVHIKKLEMNTVLGPDSWNQLMPQKCLLSLDMGTDFSKSAATDDLKYSLNYAVISRDLTNFVSKKKNWGSVSNLAKSVSQFVMDKYSGVECLNLEVQADTTHIRSDHISCIIQQERGNPESQEFDVVRISELKMLTLIGVFTFERLKKQYVTLDIKLPWPKKAELPPPVQSIIDNVVKFVEESNFKTVEALVESVSAVIAHNEYFQKFPDSPLVVKVLKLNAITATEGVGVSCIREPREIAMVNIPYLSSIHESSDIKFQLSSSQNTPIEGKNTWKRAFLAFGSNIGDRFKHIQMALQLLSREKTVKLRNISSIFESEPMYFKDQTPFMNGCVEVETLLTPSELLKLCKKIEYEELQRVKHFDNGPRTIDLDIVMFLNSAGEDIIVNEPDLNIPHPRMLERTFVLEPLCELISPVHLHPVTAEPIVDHLKQLYDKQHDEDTLWKLVPLPYRSGVEPRFLKFKTATKLDEFTGETNRITVSPTYIMAIFNATPDSFSDGGEHFADIESQLNDIIKLCKDALYLHESVIIDVGGCSTRPNSIQASEEEEIRRSIPLIKAIRESTELPQDKVILSIDTYRSNVAKEAIKVGVDIINDISGGLFDSNMFAVIAENPEICYILSHTRGDISTMNRLAHYENFALGDSIQQEFVHNTDIQQLDDLKDKTVLIRNVGQEIGERYIKAIDNGVKRWQILIDPGLGFAKTWKQNLQIIRHIPILKNYSFTMNSNNSQVYVNLRNMPVLLGPSRKKFIGHITKDVDAKQRDFATGAVVASCIGFGSDMVRVHDVKNCSKSIKLADAIYKGLE"}]}