Identification |
---|
Name | Transposon Ty2-OR1 Gag-Pol polyprotein |
---|
Synonyms | - TY2A-TY2B
- Transposon Ty2 TYA-TYB polyprotein
- Capsid protein
- CA
- Ty2 protease
- PR
- Integrase
- IN
- Reverse transcriptase/ribonuclease H
- RT
- RT-RH
|
---|
Gene Name | TY2B-OR1 |
---|
Enzyme Class | |
---|
Biological Properties |
---|
General Function | Involved in DNA binding |
---|
Specific Function | Integrase (IN) targets the VLP to the nucleus, where a subparticle preintegration complex (PIC) containing at least integrase and the newly synthesized dsDNA copy of the retrotransposon must transit the nuclear membrane. Once in the nucleus, integrase performs the integration of the dsDNA into the host genome |
---|
Cellular Location | Cytoplasm. Nucleus |
---|
SMPDB Pathways | |
---|
KEGG Pathways | |
---|
SMPDB Reactions | Not Available |
---|
KEGG Reactions | Not Available |
---|
Metabolites | YMDB ID | Name | View |
---|
YMDB00219 | Pyrophosphate | Show | YMDB00220 | Thiamine | Show | YMDB00862 | hydron | Show | YMDB00890 | water | Show |
|
---|
GO Classification | Component |
---|
cytoplasm | cell part | intracellular part | Function |
---|
nucleic acid binding | RNA binding | DNA binding | binding | Process |
---|
cellular macromolecule metabolic process | DNA metabolic process | DNA integration | metabolic process | macromolecule metabolic process |
|
---|
Gene Properties |
---|
Chromosome Location | Not Available |
---|
Locus | YOR192C-B |
---|
Gene Sequence | >5313 bp
ATGGAATCCCAACAATTATCTCAAAATTCACCCACTTTTCATGGTAGCGCCTATGCTTCG
GTTACTTCTAAGGAAGTCCCATCAAATCAAGATCCGTTAGCCGTTTCAGCTTCCAATTTA
CCGGAATTTGATAGAGATTCCACTAAGGTTAATTCTCAACAAGAGACAACACCTGGGACA
TCAGCTGTTCCAGAGAACCATCATCATGTCTCTCCTCAACCTGCTTCAGTACCACCTCCA
CAGAATGGACAGTACCAACAGCACGGCATGATGACCCCAAACAAAGCTATGGCTTCTAAC
TGGGCACATTACCAACAACCATCTATGATGACGTGTTCACATTATCAAACGTCACCTGCG
TATTATCAACCGGACCCACACTATCCGCTGCCACAGTATATCCCACCGCTGAGTACTTCC
TCACCTGATCCAATCGGTTCACAGGATCAACACTCTGAAGTACCTCAAGCTAAGACAAAG
GTGAGAAATAATGTCTTACCACCACACACTTTAACATCAGAAGAAAACTTTTCTACATGG
GTTAAATTTTACATCAGATTTTTGAAGAACTCTAATCTCGGTGACATTATTCCAAATGAC
CAGGGTGAAATCAAAAGACAAATGACTTATGAAGAACATGCGTATATATACAATACCTTC
CAAGCATTTGCCCCATTTCATTTATTGCCAACATGGGTAAAACAAATTTTAGAAATTAAT
TATTCTGACATCCTTACAGTCCTTTGTAAAAGTGTGTCCAAAATGCAAACTAACAATCAA
GAATTAAAAGATTGGATAGCTCTTGCCAACCTTGAGTACAACGGAAGTACATCTGCTGAT
ACATTTGAAATTACAGTCAGCACGATCATTCAAAGGCTAAAAGAAAACAATATCAATGTT
AGCGACAGATTGGCCTGTCAACTAATACTTAAAGGTCTATCCGGTGACTTCAAATACCTA
CGTAATCAATATCGTACCAAAACGAACATGAAACTTTCCCAATTATTCGCTGAAATTCAA
TTAATATATGACGAAAATAAAATCATGAATCTAAATAAACCGTCCCAATACAAACAACAC
AGCGAATACAAAAATGTTTCTCGCACATCTCCAAACACGACTAACACGAAGGTTACAACT
CGTAATTATCATAGAACAAATAGTTCAAAACCAAGAGCAGCAAAAGCTCACAATATTGCT
ACATCCAGTAAATTCTCAAGGGTGAACAATGATCACATTAATGAATCAACCGTTTCATCA
CAATACTTAAGCGATGACAACGAACTTAGTCTTGGCCAGCAACAGAAAGAATCTAAGCCA
ACACGCACAATAGACTCGAATGACGAACTACCTGATCACCTTCTTATTGATTCAGGAGCT
TCGCAAACGCTTGTCAGATCAGCCCATTATTTACACCATGCAACACCCAATTCTGAAATA
AACATAGTCGATGCTCAAAAACAAGACATTCCTATAAATGCCATTGGTAATCTTCACTTC
AACTTTCAGAACGGCACCAAAACATCAATAAAAGCACTACACACACCAAACATAGCCTAT
GATCTATTAAGTTTGAGTGAGCTGGCTAACCAAAATATTACTGCCTGCTTTACCAGAAAC
ACTTTAGAAAGATCGGATGGTACAGTACTAGCTCCCATAGTCAAACATGGAGACTTTTAC
TGGTTATCTAAAAAATACCTAATTCCTTCGCACATTTCAAAGCTAACAATAAACAACGTC
AACAAAAGCAAAAGCGTAAATAAATATCCATATCCGTTAATACATCGAATGCTTGGACAT
GCTAACTTCCGAAGTATTCAGAAGTCTCTTAAGAAGAATGCAGTTACATATTTGAAAGAA
TCGGATATTGAATGGTCTAACGCTAGCACATATCAATGTCCTGACTGTCTAATCGGCAAA
AGCACGAAACATAGGCATGTCAAAGGATCACGACTAAAGTACCAAGAATCATATGAGCCT
TTTCAGTACTTGCATACCGATATATTTGGTCCTGTACATCACTTACCGAAAAGTGCACCT
TCTTACTTTATATCGTTTACAGATGAGAAAACCAGATTCCAATGGGTGTACCCATTACAC
GACCGTCGTGAAGAATCTATCCTCAATGTTTTTACATCGATATTAGCATTTATTAAGAAC
CAATTCAATGCTCGCGTTCTAGTTATCCAGATGGATCGTGGCTCCGAGTACACTAACAAA
ACTCTTCATAAGTTCTTTACGAACAGAGGTATTACTGCATGCTATACAACCACGGCAGAT
TCTAGAGCACACGGTGTCGCTGAACGATTAAATCGTACTTTATTAAACGATTGTCGCACA
CTGCTTCATTGCAGTGGTCTACCAAATCATCTATGGTTCTCAGCAGTCGAATTTTCTACT
ATAATCAGAAATTCATTAGTCTCACCAAAAAACGATAAATCTGCAAGACAACATGCAGGT
TTAGCTGGACTGGACATTACTACTATACTACCTTTCGGTCAACCGGTTATAGTTAACAAC
CATAATCCCGACTCGAAAATACATCCTCGTGGCATTCCAGGTTACGCCTTACATCCATCA
CGAAACTCTTATGGCTATATTATCTATCTTCCATCCTTAAAAAAGACAGTAGATACTACC
AATTACGTTATATTACAAAACAAGCAAACGAAATTGGACCAGTTCGACTACGATACACTC
ACTTTTGATGATGATCTCAATCGTTTAACAGCCCATAACCAATCTTTTATTGAACAAAAT
GAAACGGAGCAGTCATATGATCAAAATACAGAATCTGATCATGACTATCAATCGGAGATT
GAAATAAACTCTGATCCTCTAGTGAACGATTTCTCGTCCCAATCATTGAACCCTTTACAA
TTAGACAAGGAACCAGTCCAAAAGGTACGTGCACCAAAAGAAGTTGATGCCGACATATCT
GAATACAATATTCTTCCATCTACTATACGATCTCGTACACCCCATATCATTAATAAAGAG
AGTACCGAAATGGGTGGTACCATTGAATCAGATACTACTTCACCTAGACACTCGTCTACC
TTCACTGCACGAAACCAAAAGCGACCTGGTAGTCCCAACGATATGATTGATTTGACCTCA
CAGGATAGAGTTAATTATGGACTTGAAAACATCAAAACTACACGTTTGGGTGGTACGGAG
GAACCATATATTCAACGAAATAGTGATACAAATATCAAATACAGGACTACAAATAGTACG
CCCTCAATAGATGACCGTTCGTCCAACAGTGAATCCACTACTCCCATCATCTCCATAGAA
ACAAAGGCTGCATGTGATAATACACCCTCCATTGATACGGATCCGCCAGAATATCGATCT
TCTGACCATGCGACTCCTAATATAATGCCTGACAAATCCTCAAAAAATGTTACGGCTGAT
TCTATTCTTGACGACCTCCCACTTCCTGACTTAACCCATCAATCTCCTACGGACACTTCT
GATGTTTCAAAAGATATTCCACACATACACTCTCGTCAGACTAATTCCAGTTTGGGTGGT
ATGGATGATTCTAATGTTCTGACTACTACCAAAAGTAAGAAAAGATCATTAGAAGATAAT
GAAACTGAAATTGAGGTATCCCGAGACACATGGAATAATAAGAATATGAGAAGTCTGGAA
CCACCAAGATCGAAGAAACGCATAAATTTAATTGCAGCAATAAAAGGAGTGAAATCGATC
AAACCAGTTCGAACGACCTTAAGATATGATGAAGCAATTACTTATAATAAAGACAACAAA
GAAAAAGACAGATATGTTGAAGCTTATCATAAAGAAATTAGCCAACTATTGAAAATGAAC
ACTTGGGATACAAACAAATATTATGATAGAAATGACATAGATCCTAAAAAAGTAATAAAC
TCAATGTTTATATTTAACAAGAAACGTGATGGTACACACAAAGCTAGATTTGTTGCAAGA
GGCGACATTCAACACCCCGATACATATGATTCTGATATGCAATCCAATACCGTACATCAC
TATGCACTGATGACGTCACTGTCAATCGCATTAGACAACGACTATTATATCACACAGCTG
GACATATCCTCTGCTTACTTATATGCTGATATCAAAGAAGAATTATACATAAGACCTCCA
CCACATTTAGGTTTGAATGATAAATTACTACGTTTGAGAAAATCACTCTATGGTTTGAAA
CAAAGTGGTGCAAACTGGTATGAAACCATTAAATCATATTTAATAAATTGTTGCGACATG
CAAGAAGTTCGCGGATGGTCATGCGTATTTAAGAATAGTCAAGTAACAATTTGCTTATTC
GTTGACGACATGATTTTATTTAGCAAAGACTTAAATGCAAATAAGAAAATCATAACAACA
CTCAAGAAACAATACGATACAAAGATAATAAATCTGGGTGAAGGTGATAACGAAATTCAG
TACGACATACTTGGATTAGAGATCAAATATCAAAGAAGCAAGTACATGAAATTAGGTATG
GAAAAATCCTTGACAGAAAAATTACCCAAACTAAACGTACCTTTGAACCCAAAAGGAAAG
AAACTTAGAGCTCCAGGTCAACCAGGTCATTATATAGACCAGGATGAACTAGAAATAGAT
GAAGATGAATACAAAGAGAAAGTGCATGAAATGCAAAAGTTGATTGGTCTAGCTTCATAT
GTTGGATATAAATTTAGATTTGACTTACTATACTACATCAACACACTTGCTCAACATATA
CTATTCCCCTCTAGGCAAGTTTTAGACATGACATATGAGTTAATACAATTCATGTGGGAC
ACTAGAGATAAACAATTAATATGGCACAAAAACAAACCTACCAAGCCAGATAATAAACTA
GTCGCAATAAGCGATGCTTCATATGGTAACCAACCATATTACAAGTCACAAATTGGTAAC
ATTTTCCTACTCAACGGAAAAGTGATTGGAGGAAAGTCGACAAAGGCTTCGTTAACATGC
ACTTCAACTACAGAAGCAGAAATACACGCAGTCAGTGAAGCTATACCGCTATTGAATAAC
CTCAGTCACCTTGTGCAAGAACTTAACAAGAAACCAATTATTAAAGGCTTACTTACTGAT
AGTAGATCAACGATCAGTATAATTAAGTCTACAAATGAAGAGAAATTTAGAAACAGATTT
TTTGGCACGAAGGCAATGAGACTTAGAGATGAAGTATCAGGTAATAATTTATACGTATAC
TACATCGAGACCAATAAGAACATTGCTGATGTGATGACAAAACCTCTTCCGATAAAAACA
TTTAAACTATTAACTAACAAATGGATTCATTAG |
---|
Protein Properties |
---|
Pfam Domain Function | |
---|
Protein Residues | 1770 |
---|
Protein Molecular Weight | 201982.0 |
---|
Protein Theoretical pI | 8.44 |
---|
Signalling Regions | |
---|
Transmembrane Regions | |
---|
Protein Sequence | >Transposon Ty2-OR1 Gag-Pol polyprotein
MESQQLSQNSPTFHGSAYASVTSKEVPSNQDPLAVSASNLPEFDRDSTKVNSQQETTPGT
SAVPENHHHVSPQPASVPPPQNGQYQQHGMMTPNKAMASNWAHYQQPSMMTCSHYQTSPA
YYQPDPHYPLPQYIPPLSTSSPDPIGSQDQHSEVPQAKTKVRNNVLPPHTLTSEENFSTW
VKFYIRFLKNSNLGDIIPNDQGEIKRQMTYEEHAYIYNTFQAFAPFHLLPTWVKQILEIN
YSDILTVLCKSVSKMQTNNQELKDWIALANLEYNGSTSADTFEITVSTIIQRLKENNINV
SDRLACQLILKGLSGDFKYLRNQYRTKTNMKLSQLFAEIQLIYDENKIMNLNKPSQYKQH
SEYKNVSRTSPNTTNTKVTTRNYHRTNSSKPRAAKAHNIATSSKFSRVNNDHINESTVSS
QYLSDDNELSLGQQQKESKPTRTIDSNDELPDHLLIDSGASQTLVRSAHYLHHATPNSEI
NIVDAQKQDIPINAIGNLHFNFQNGTKTSIKALHTPNIAYDLLSLSELANQNITACFTRN
TLERSDGTVLAPIVKHGDFYWLSKKYLIPSHISKLTINNVNKSKSVNKYPYPLIHRMLGH
ANFRSIQKSLKKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLKYQESYEP
FQYLHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTSILAFIKN
QFNARVLVIQMDRGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAERLNRTLLNDCRT
LLHCSGLPNHLWFSAVEFSTIIRNSLVSPKNDKSARQHAGLAGLDITTILPFGQPVIVNN
HNPDSKIHPRGIPGYALHPSRNSYGYIIYLPSLKKTVDTTNYVILQNKQTKLDQFDYDTL
TFDDDLNRLTAHNQSFIEQNETEQSYDQNTESDHDYQSEIEINSDPLVNDFSSQSLNPLQ
LDKEPVQKVRAPKEVDADISEYNILPSTIRSRTPHIINKESTEMGGTIESDTTSPRHSST
FTARNQKRPGSPNDMIDLTSQDRVNYGLENIKTTRLGGTEEPYIQRNSDTNIKYRTTNST
PSIDDRSSNSESTTPIISIETKAACDNTPSIDTDPPEYRSSDHATPNIMPDKSSKNVTAD
SILDDLPLPDLTHQSPTDTSDVSKDIPHIHSRQTNSSLGGMDDSNVLTTTKSKKRSLEDN
ETEIEVSRDTWNNKNMRSLEPPRSKKRINLIAAIKGVKSIKPVRTTLRYDEAITYNKDNK
EKDRYVEAYHKEISQLLKMNTWDTNKYYDRNDIDPKKVINSMFIFNKKRDGTHKARFVAR
GDIQHPDTYDSDMQSNTVHHYALMTSLSIALDNDYYITQLDISSAYLYADIKEELYIRPP
PHLGLNDKLLRLRKSLYGLKQSGANWYETIKSYLINCCDMQEVRGWSCVFKNSQVTICLF
VDDMILFSKDLNANKKIITTLKKQYDTKIINLGEGDNEIQYDILGLEIKYQRSKYMKLGM
EKSLTEKLPKLNVPLNPKGKKLRAPGQPGHYIDQDELEIDEDEYKEKVHEMQKLIGLASY
VGYKFRFDLLYYINTLAQHILFPSRQVLDMTYELIQFMWDTRDKQLIWHKNKPTKPDNKL
VAISDASYGNQPYYKSQIGNIFLLNGKVIGGKSTKASLTCTSTTEAEIHAVSEAIPLLNN
LSHLVQELNKKPIIKGLLTDSRSTISIIKSTNEEKFRNRFFGTKAMRLRDEVSGNNLYVY
YIETNKNIADVMTKPLPIKTFKLLTNKWIH |
---|
References |
---|
External Links | |
---|
General Reference | - Dujon, B., Albermann, K., Aldea, M., Alexandraki, D., Ansorge, W., Arino, J., Benes, V., Bohn, C., Bolotin-Fukuhara, M., Bordonne, R., Boyer, J., Camasses, A., Casamayor, A., Casas, C., Cheret, G., Cziepluch, C., Daignan-Fornier, B., Dang, D. V., de Haan, M., Delius, H., Durand, P., Fairhead, C., Feldmann, H., Gaillon, L., Kleine, K., et, a. l. .. (1997). "The nucleotide sequence of Saccharomyces cerevisiae chromosome XV." Nature 387:98-102.9169874
- Kim, J. M., Vanguri, S., Boeke, J. D., Gabriel, A., Voytas, D. F. (1998). "Transposable elements and genome organization: a comprehensive survey of retrotransposons revealed by the complete Saccharomyces cerevisiae genome sequence." Genome Res 8:464-478.9582191
- Lesage, P., Todeschini, A. L. (2005). "Happy together: the life and times of Ty retrotransposons and their hosts." Cytogenet Genome Res 110:70-90.16093660
|
---|