Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dgeo_2838 |
Symbol | |
ID | 4074067 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Deinococcus geothermalis DSM 11300 |
Kingdom | Bacteria |
Replicon accession | NC_008010 |
Strand | + |
Start bp | 185999 |
End bp | 187279 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641228642 |
Product | twin-arginine translocation pathway signal |
Protein accession | YP_594341 |
Protein GI | 94972301 |
COG category | [R] General function prediction only |
COG ID | [COG2041] Sulfite oxidase and related enzymes |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGATC AGCTCGCCCT CTCCCGCCGC GCCGCCCTGA GCCTGCTTGG GGGTGCCGGT GCTGCCCTGG CTCTCGGGCG CGGCGCTCTG GCCCAGGCGG CCGTTCCCAC CACGCCTGCT CCCACCTTCA GTGGTCCTGG GCCGCTCGAG TCCTGGAACG GCCTAGGACC GCTGATCGCC CTGCCGCAGA AGGTGCCGCT CATTCGGCTG GTGGACCGCC CCCCGCTCTA CGAGACGCCC CGCGCGTACT TTCAAAGTCC GCTTACCCCC GCCGCCGCCT TCTTTGTCCG CTCTAACCTG GCGCTCTTCC CCGCCAGTAT CGACCTCACC ACCTGGCGGC TGAAGGTCGA GGGCAACGTC CGCAGGCCGC GGTCGTTCAG TCTGGCGGCG CTGCTTCGCG ACTTCGAGCC GGTCAGTGTG ACTGCCGTCA TGCAGTGCAC CGGCAACAGC CGTTCGCGTT TTCAACCCCG CCGCCCTGGC GGACAGTGGG GCAACGGGGC GATGGGCTGC GCCACCTGGA CCGGCGTGCG GCTGCGCGAC CTGCTGGACC GTGCAGGCAT CCAGAGCGGC GGCGTGCAGG TGCAGTTTCA GGGGCTGGAC CAGGGAGCGG GGGCACCGGG GAGCGGCGGG GCCGAGTACA AGAAGAGCCT TGACCTCGAT GACCCGGTGC TGGACGAGTG CATTGTCGCC TATGCCATGA ATGGCCAGCC GCTGCCACTC TTGAACGGCT TCCCGGTGCG GCTGGTGGTG CCTGGATACT TCGCCACGTA CTGGATGAAG ACCCTGAGTT TTATCCGCGT GCTCACCGAA CCCGATACCA ATTTCTGGAT GGCCAGTGCG TACCTTCAGC CCGACAATCC CCGTGGCACA ACGACACCGC AGGCGGTCAA GGACAAGAAG GTCAAGTTCC GGCCCGTGGG CAGCATGCCG GTACGCTCCT TTATCATGAC GCCTGACGAG ACCGTCAAGG TCCCGGCGGG CTTGCCGATC ACCGTGCAGG GCCTGGCCCT AAGTGGGCGG GGGGCCGTCA CCAAGGTCGA AGTGTCCACC GATGGCGGCA AAACCTGGCG GAACGCTCAA TTGGGTCAGG ATCTCGGGAA ATACGCCTTC CGGCCCTGGA GCTTTGCGTG GACGCCAAAG CAACCGGGGC AGTACACCCT GGCCGTCCGC GCCACCGATG CGAGCGGGGC CACCCAGACA GATCAGCCCA TCTGGAATCC GTCCGGTTAC CTCTGGAACA CCATCGAGCG CCAGACGGTC ACTGTCGGGC AGACCGGGTA A
|
Protein sequence | MDDQLALSRR AALSLLGGAG AALALGRGAL AQAAVPTTPA PTFSGPGPLE SWNGLGPLIA LPQKVPLIRL VDRPPLYETP RAYFQSPLTP AAAFFVRSNL ALFPASIDLT TWRLKVEGNV RRPRSFSLAA LLRDFEPVSV TAVMQCTGNS RSRFQPRRPG GQWGNGAMGC ATWTGVRLRD LLDRAGIQSG GVQVQFQGLD QGAGAPGSGG AEYKKSLDLD DPVLDECIVA YAMNGQPLPL LNGFPVRLVV PGYFATYWMK TLSFIRVLTE PDTNFWMASA YLQPDNPRGT TTPQAVKDKK VKFRPVGSMP VRSFIMTPDE TVKVPAGLPI TVQGLALSGR GAVTKVEVST DGGKTWRNAQ LGQDLGKYAF RPWSFAWTPK QPGQYTLAVR ATDASGATQT DQPIWNPSGY LWNTIERQTV TVGQTG
|
| |