Gene Dgeo_2838 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2838 
Symbol 
ID4074067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008010 
Strand
Start bp185999 
End bp187279 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content67% 
IMG OID641228642 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_594341 
Protein GI94972301 
COG category[R] General function prediction only 
COG ID[COG2041] Sulfite oxidase and related enzymes 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGATC AGCTCGCCCT CTCCCGCCGC GCCGCCCTGA GCCTGCTTGG GGGTGCCGGT 
GCTGCCCTGG CTCTCGGGCG CGGCGCTCTG GCCCAGGCGG CCGTTCCCAC CACGCCTGCT
CCCACCTTCA GTGGTCCTGG GCCGCTCGAG TCCTGGAACG GCCTAGGACC GCTGATCGCC
CTGCCGCAGA AGGTGCCGCT CATTCGGCTG GTGGACCGCC CCCCGCTCTA CGAGACGCCC
CGCGCGTACT TTCAAAGTCC GCTTACCCCC GCCGCCGCCT TCTTTGTCCG CTCTAACCTG
GCGCTCTTCC CCGCCAGTAT CGACCTCACC ACCTGGCGGC TGAAGGTCGA GGGCAACGTC
CGCAGGCCGC GGTCGTTCAG TCTGGCGGCG CTGCTTCGCG ACTTCGAGCC GGTCAGTGTG
ACTGCCGTCA TGCAGTGCAC CGGCAACAGC CGTTCGCGTT TTCAACCCCG CCGCCCTGGC
GGACAGTGGG GCAACGGGGC GATGGGCTGC GCCACCTGGA CCGGCGTGCG GCTGCGCGAC
CTGCTGGACC GTGCAGGCAT CCAGAGCGGC GGCGTGCAGG TGCAGTTTCA GGGGCTGGAC
CAGGGAGCGG GGGCACCGGG GAGCGGCGGG GCCGAGTACA AGAAGAGCCT TGACCTCGAT
GACCCGGTGC TGGACGAGTG CATTGTCGCC TATGCCATGA ATGGCCAGCC GCTGCCACTC
TTGAACGGCT TCCCGGTGCG GCTGGTGGTG CCTGGATACT TCGCCACGTA CTGGATGAAG
ACCCTGAGTT TTATCCGCGT GCTCACCGAA CCCGATACCA ATTTCTGGAT GGCCAGTGCG
TACCTTCAGC CCGACAATCC CCGTGGCACA ACGACACCGC AGGCGGTCAA GGACAAGAAG
GTCAAGTTCC GGCCCGTGGG CAGCATGCCG GTACGCTCCT TTATCATGAC GCCTGACGAG
ACCGTCAAGG TCCCGGCGGG CTTGCCGATC ACCGTGCAGG GCCTGGCCCT AAGTGGGCGG
GGGGCCGTCA CCAAGGTCGA AGTGTCCACC GATGGCGGCA AAACCTGGCG GAACGCTCAA
TTGGGTCAGG ATCTCGGGAA ATACGCCTTC CGGCCCTGGA GCTTTGCGTG GACGCCAAAG
CAACCGGGGC AGTACACCCT GGCCGTCCGC GCCACCGATG CGAGCGGGGC CACCCAGACA
GATCAGCCCA TCTGGAATCC GTCCGGTTAC CTCTGGAACA CCATCGAGCG CCAGACGGTC
ACTGTCGGGC AGACCGGGTA A
 
Protein sequence
MDDQLALSRR AALSLLGGAG AALALGRGAL AQAAVPTTPA PTFSGPGPLE SWNGLGPLIA 
LPQKVPLIRL VDRPPLYETP RAYFQSPLTP AAAFFVRSNL ALFPASIDLT TWRLKVEGNV
RRPRSFSLAA LLRDFEPVSV TAVMQCTGNS RSRFQPRRPG GQWGNGAMGC ATWTGVRLRD
LLDRAGIQSG GVQVQFQGLD QGAGAPGSGG AEYKKSLDLD DPVLDECIVA YAMNGQPLPL
LNGFPVRLVV PGYFATYWMK TLSFIRVLTE PDTNFWMASA YLQPDNPRGT TTPQAVKDKK
VKFRPVGSMP VRSFIMTPDE TVKVPAGLPI TVQGLALSGR GAVTKVEVST DGGKTWRNAQ
LGQDLGKYAF RPWSFAWTPK QPGQYTLAVR ATDASGATQT DQPIWNPSGY LWNTIERQTV
TVGQTG