Gene Dgeo_2103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_2103 
SymbolengA 
ID4058200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp2211813 
End bp2213138 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content62% 
IMG OID641231142 
ProductGTP-binding protein EngA 
Protein accessionYP_605566 
Protein GI94986202 
COG category[R] General function prediction only 
COG ID[COG1160] Predicted GTPases 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR03594] ribosome-associated GTPase EngA 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAG TCGCGATTGT GGGAAGGCCG AATGTCGGCA AGTCCAGTTT GTTCAATCGC 
CTGGTGGGGC GGCGTGAGGC CGTGGTGGCC GATTTCCCGG GCGTGACGCG GGATGCCAAG
GAAGGGCTGA TGCTCTACCA CAATCACCGC ATTGTCCTGG TGGATACCGG CGGGCTGTGG
AGCGGCGACG AGTGGGAACA GGCCATCCGC GAGAAGGCCG AGTGGGCGAT GGAAGGCGCA
CAAGCGGTGA TCTTTGTGGT GGATCCGCGC GAGGGCCTCA CCGCTGCTGA TTACGAGGTG
GCCGACTGGC TGCGCCGACT GGGCAAGCCA GTGATTGTGG CCGCCAACAA GATCGACAGC
CCCAAGCATG ACGTGTATCT GGCCGAGCTG TGGGGCCTGG GTTTCGGCGA TCCAGTGGCA
ATCAGCGCCG AACACGCCCG CGGACTGGAC GACCTGATGG AGCGCGTGAT GGCGCACCTT
CCCGCCGACG AGGAGGACGT CCCGGAAGTT GCGCCCATCC GAATCTCTCT GATTGGCCGT
CCGAATGTGG GCAAGTCCAG CCTGCTGAAC GCCATAACCC AGAGCGAGCG CGCCATTGTC
GCGGACCAGC CCGGCACCAC CCGCGACAGC CTGGACGTGG AATGGAATTA TGGCGGCCAG
CGCTTCGTGC TGGTGGATAC GGCGGGCATC CGCAAAAAGC CCGACACCGC CATCGAGGAA
TACGCCATCC AGCGCAGCGA GGCCGCGATC GAACGCAGCG ATATCATCTG GCTGGTGGTC
AACGCAACGG AGATCGGTGA CCATGAACTC AAGCTCGCCA ATCTGGCCTA CGACAGCGGC
AAGCCGGTCA TCGTGGTGGT GAACAAGTGG GATCTGGTGC CCGACGAGGC CCTCAAGCAG
ACAGAAAAGG AGCTGAACCA GAAGCTTCAC CACATCGCCT ACGCACCGCG CGTGTACACC
AGTGCGATCA ACGACTACGG CATCCACGAC ATGCTGGCCG AGGCGATGAA ACTCTATGAG
AAGTGGCAAA GCCGCATTCC CACCGCCGAG CTCAACCGCT GGCTGGAAAT CTGGCAGATG
CGTCAGGCAG TGCCCAACTT CCACGGCAAG CCCTTGAAGA TGTACTTCAT GACGCAGGTG
GAAACGGCAC CTCCTACCTT TGCCATCTTC TGCAACCGCG CCGACTTCGT GACCCGTGCC
TATGAGGGCT TCCTCCAAAA CCGTATTCGT GAGGACCTCG GATTGGCCGG GATTCCGGTC
AGGCTCAAGT GGAAGGAGAA AGGGCCGTAT AAGAAGGGGA AGAAGGGCGA GGAGGCCGAG
GCGTAA
 
Protein sequence
MQKVAIVGRP NVGKSSLFNR LVGRREAVVA DFPGVTRDAK EGLMLYHNHR IVLVDTGGLW 
SGDEWEQAIR EKAEWAMEGA QAVIFVVDPR EGLTAADYEV ADWLRRLGKP VIVAANKIDS
PKHDVYLAEL WGLGFGDPVA ISAEHARGLD DLMERVMAHL PADEEDVPEV APIRISLIGR
PNVGKSSLLN AITQSERAIV ADQPGTTRDS LDVEWNYGGQ RFVLVDTAGI RKKPDTAIEE
YAIQRSEAAI ERSDIIWLVV NATEIGDHEL KLANLAYDSG KPVIVVVNKW DLVPDEALKQ
TEKELNQKLH HIAYAPRVYT SAINDYGIHD MLAEAMKLYE KWQSRIPTAE LNRWLEIWQM
RQAVPNFHGK PLKMYFMTQV ETAPPTFAIF CNRADFVTRA YEGFLQNRIR EDLGLAGIPV
RLKWKEKGPY KKGKKGEEAE A