Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_0298 |
Symbol | |
ID | 4058022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008025 |
Strand | + |
Start bp | 289216 |
End bp | 290289 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641229300 |
Product | peptidase M42 |
Protein accession | YP_603770 |
Protein GI | 94984406 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1363] Cellulase M and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTCC TGTCTGCCCC GACCTCCGCC GCCCGGCCCG GGGACTTCGA CCTGCCGTAC ACGACCGACC TCCTCCTGCG CCTGCTGAAC ACCCCCAGCC CGACCGGGTT CACCGAGGCC GCCGTGCGTC TGCTGGAAGG GGAGCTGGAC GCGCTCGGCG TGCCGCACCG CCGCAGCAAG AAGGGGGCGC TGACCTGGGA GATCGCGGGA CAACCTGGCC AGCCGCACAC CACCTTCAGC GGTCACGTGG ACACGCTGGG CGCCATGGTG AAGGAGATCA AGGAGAACGG GCGGCTGCGT CTCTTTCCCT TGGGCGGCTA CGACTGGGCC ACCATCGAGG GCGAGTACGT GCAGGTCCAC ACCGGGCGGG GCGAGGCCGT CACCGGGACG GTCGTCAACA CCCACCAGAG CACCCACGTT CACGGCCCTG CCCTACGGGA GCTGCGGCGC GAGCAGGCGG TGATGGAAGT CCGGCTGGAC GCTCCCACCA CCTCTCCGGA GGAGACGCGG GCGCTGGGCA TCGAGGTGGG CGACTTCGTG AGCTTCGATC CCCGCGCCAC CCTGACGGAC GCCGGGTACA TGAAGAGCCG CCACCTCGAC AACAAGGCCG CGGTTGCCGT GTTCTTGGGC GTGACCCGTG CCCTGCTGGA GGCGCCACCT GCTCGCACGG TCGCCTTCCA CGTCACCACC TACGAGGAGG TCGGGCACGG GGCCGCCACC GGGATTCCGC CCCACACCGA CGAGCTGATC GCGGTGGACA TGGCCGCCGT GGGCGAGGGG CAGACCAGCA GCGAGCACCA CGTCACCCTC TGCGTGGCCG ACAGCGGCGG GCCATATGAC CACGCGCTCG GCAATCGGCT GCGGGCGGCG GCACGGCGGG CCGGGCTGGA GTTGCGGGTA GACCTCTACC CCTACTACGC TTCGGACGGA ACGGCGGCCT GGCGCGCGGG CGGCGACTAT CCGGTTGCCC TGATTGGACC TGGGGTGGAC GCGAGCCATG CTTACGAGCG CACCCACCTG GACGCGTTGC GGGCGACGGC AGAACTGATG CTGGCGCATG TGCGGGGAGA GTGA
|
Protein sequence | MTVLSAPTSA ARPGDFDLPY TTDLLLRLLN TPSPTGFTEA AVRLLEGELD ALGVPHRRSK KGALTWEIAG QPGQPHTTFS GHVDTLGAMV KEIKENGRLR LFPLGGYDWA TIEGEYVQVH TGRGEAVTGT VVNTHQSTHV HGPALRELRR EQAVMEVRLD APTTSPEETR ALGIEVGDFV SFDPRATLTD AGYMKSRHLD NKAAVAVFLG VTRALLEAPP ARTVAFHVTT YEEVGHGAAT GIPPHTDELI AVDMAAVGEG QTSSEHHVTL CVADSGGPYD HALGNRLRAA ARRAGLELRV DLYPYYASDG TAAWRAGGDY PVALIGPGVD ASHAYERTHL DALRATAELM LAHVRGE
|
| |