Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcur_2887 |
Symbol | |
ID | 8604231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermomonospora curvata DSM 43183 |
Kingdom | Bacteria |
Replicon accession | NC_013510 |
Strand | + |
Start bp | 3368268 |
End bp | 3369953 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | X-Pro dipeptidyl-peptidase domain protein |
Protein accession | YP_003300470 |
Protein GI | 269127100 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000279173 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCTCA TCGAGCGCCT GCCCGCTCCG GCCCGCCGGC TGCCGCTCCC CCGCGGCCGC TTCCGCTACA CCGTGCAGCG CGACATCCCG GTACCGATGC CGGACGGGGT GACGCTGCTG GCCGACCGCT ACGCCCCCGC CGGCGTGCCC GACGCGCCGA CCATCTTGGT CCGCACCCCG TACGGGCGCC GCGGGCTGCC GGCCGTCCTG GCCGGGCTGC CGCTGGTCCC CTTCGGGTTC CAGCTCCTGG TGCAAAGCGT CCGCGGCACC TTCGGCTCCG GCGGGGAGTT CGACCCGCTG GGCAGCGAGC AGGCCGACGG GCTGGCCACC GTGCGCTGGA TGCGCAGCCA GCCGTGGTTC ACCGGGTCCT TCGCCACCTA CGGCGCCAGC TACCTGGGGT ATTCCTCCTG GGCGATCGCC GCCGAGGCCG GGCCGGAGCT CAAGGCCATC TCCGCGCAGG TCACCGCCTC CTCCTTCCGC GACGCCGCCT ACGTGGGCGG CTCGTTCGCC CTGGAGACCG TGCTGACCTG GTCGGATCTG ACCTTCCGGC AGGAACGGCC GCTGGGCATG GTCAACGGGG CGCTGCTGGC CAAACGGCGG GCCCGGCGGG CCGCGCGCAG CGGCCGGCCG CTGGCGGAGC TGGACGAGCT GGCCACCGGC GCGATGGTGC GCTTCTACCA GGACCTGCTG GCCAACGACG AGCCCGGCGC CGAGTACTGG CGCAGCCGCG ACTTCACCGG CACCCTCGAC CGGGTCACCG CCCCGGTGAC GCTGCTGGGC GGCTGGTACG ACATCTTCCT GCCCTGGCAG CTGAAGGACT ACCTGGCGCT GCGCGCCGCC GGGCGGCGCC CCTACCTGAC CATCGGCCCG TGGTGGCACT TCGACGCCCG GCACGGGATC GCGGCGCTGC GCGAGTCGCT GGCCTGGTTC CGCGCCCACC TGCGGGGCGA CTTCTCCCGG GTGCGCGCCA ACCCGGTGCG GGTGTACCTC ACCGGGGCCG GGCAGTGGCG CGACTACCGC GACTGGCCGC CGCCGGGCAT GCGGCCGGTC CGCTGGCACC TGCACCCCGG GGGCGGGCTC GGGGAGGACG GGCCGCAGCC GGGCGAGCCC AGCCGCTACC TGTTCGACCC CGCCGACCCG ACCCCCTCGC TGGCCGGGCC GAGCGTGCTG GGCAGCTGCA AGCCGGTGGA CCAGCGCCCG GTGGAAAAAC GCGCCGATGT GCTGGTGTTC ACCTCCGCGC CGCTGCGGGC GGACCTGGAC GTCATCGGCC CGGTGGAGGC GGAGCTGTTC GTGCGCTCCG ACCGCGAGCA CACCGACTTC GTGGTGCGCT TGTGCGACGT CGCCCCCGAC GGCACCTCGC TGAACCTGTG CGAGGGCGCA CGGCGGCTGC GCCCGGGCGA CCCGGCCCCC GGCCCCGACG GGGTCCGCCG GGTGCGGGTG GAGCTGTGGC CGGTCGGCCA CCGCTTCCGC CGCGGCCACC GCGTCCGCGT GCACGTGGCC AGCGGGGCGT TCCCCGTCGT GGCCGTCAAC CCCGGCACCG GCGAGCCGCT GGGCAGCGCC ACCGCCCGCC TCGTCGCACG CCAGCAGGTG CTGCACGACC CGGACCACCC GTCGGCGATC CACCTGCCGG TGGTCGAGCG GGCCGCCGAG CACACCGTGT CCGACCCGGC GGCCGAGCAG GTCTGA
|
Protein sequence | MPLIERLPAP ARRLPLPRGR FRYTVQRDIP VPMPDGVTLL ADRYAPAGVP DAPTILVRTP YGRRGLPAVL AGLPLVPFGF QLLVQSVRGT FGSGGEFDPL GSEQADGLAT VRWMRSQPWF TGSFATYGAS YLGYSSWAIA AEAGPELKAI SAQVTASSFR DAAYVGGSFA LETVLTWSDL TFRQERPLGM VNGALLAKRR ARRAARSGRP LAELDELATG AMVRFYQDLL ANDEPGAEYW RSRDFTGTLD RVTAPVTLLG GWYDIFLPWQ LKDYLALRAA GRRPYLTIGP WWHFDARHGI AALRESLAWF RAHLRGDFSR VRANPVRVYL TGAGQWRDYR DWPPPGMRPV RWHLHPGGGL GEDGPQPGEP SRYLFDPADP TPSLAGPSVL GSCKPVDQRP VEKRADVLVF TSAPLRADLD VIGPVEAELF VRSDREHTDF VVRLCDVAPD GTSLNLCEGA RRLRPGDPAP GPDGVRRVRV ELWPVGHRFR RGHRVRVHVA SGAFPVVAVN PGTGEPLGSA TARLVARQQV LHDPDHPSAI HLPVVERAAE HTVSDPAAEQ V
|
| |