Gene Dgeo_0162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0162 
Symbol 
ID4058408 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp150443 
End bp152131 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content67% 
IMG OID641229159 
Producthypothetical protein 
Protein accessionYP_603634 
Protein GI94984270 
COG category[R] General function prediction only 
COG ID[COG1418] Predicted HD superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR03319] conserved hypothetical protein YmdA/YtgF 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.183423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.57312 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATGT TGTACTTCGT GCTGGCGCTC CTGGTGGGGT TAGCAGGCGG GTTTTTCGTC 
GGACAGGCGC GCGGGCGGCA ACAAAGGGCC ACCCTTGATG ACCAGCTCCA GCGGGAAGCG
CGGGCCGAGG CGGAACGCAT CCGGACACAG GCGGACGCCG AGGCCCGGCA GCTGCGCGAG
CAGGCTGAGC AACGCCTGCA AGACGCAGCG CGGCGCCTGC AAGAAGCCGA CGACCGGGAA
CGCCAAGTCA CCCTTCAACT GGAAGCGCAA AGGGAGCAGC TTCAGGCCGT TCGCGCCCAG
ATCGAGGCCG AGCGGGCACG GGCCGCCCAG GACGCCGCGC GCGAACGCGA GACACTCAGC
GCTGACCGGC AGGAAACCCG GCGCGAACGT GAGGAACTCA AGCGCGAGAT CGAGCGCCTC
AACCGCCGGG CCGAGCAGCT CGACGCGCGG GGTGACAAGC TCGACGCCCT CGAGGAACGT
CTAGAAGGAC AACTCCACGC ACTGGCCCAG CAGGAGGCTG AACTGGCGGA GCGCAGCCGC
CAGGTGGACC TCAAGCTCTA CGAGGTTGCA GGCCTCACCC CCGAAGCTGC GCGCGAACAG
ATCCTCCGGC AGCTTGACGC CGAACTGGAG GAGGAAAAAG CCATCCGGGT CAAGGCGATG
ACCGAGCGGG CAACAGCAGA GGCCAGGCGT ACCGCCCGCA ACGTGATCGC ACAGGCCATT
CAGCGCAGTG CCAGCGAGAC CAGCAGCCAG ATGAGCGTGT CGGTAGTGCC CATTCCCAAT
GACGCCATGA AGGGCCGTCT GATTGGGCGC GAGGGGCGCA ATATCCGCGC GTTTGAGGCG
CTGACCGGCG TGGACCTGAT CATCGACGAC ACGCCCGAGG CGGTCATCTT GTCGAGCTTC
AACCCGGTGC GGCGTGAGGT GGCCCGCCAC GTGCTGGAAG CGCTGGTGGC CGATGGGCGC
ATCCACCCCA CCCGCATTGA GGAGATGGTT CACAAGGCCC AGGATGAGAT GAAGAGCTTC
ATCCACGCCC AGGGCGAGGA GGCGGCCATC GAGTCAGGCG TGGTGGGCCT CAAGCCGGGG
CTGGTGCAGT TGCTCGGAAG GATGTACTTC CGCTCCAGCT ATGGCCAGAA CGTGCTGAAG
CACTCCGTGC AGGTCGCGCA CCTCACCGGC ATCATGGCCG ATGAGCTGGG GCTGGACGCG
GCTCTCGCCC GCCGCGCTGG GCTGATGCAC GACATCGGCA AGAGCATCGA CCGCGAGATC
GAGGGCACCC ACGTCGAGAT CGGCATCAAC CTCGCCAAAC GCTTCGGGGA GCCGCCGGAA
GTGATCGATG CCATCGCGCA CCACCACGAC CCCGAGAACG GCGAGACGCT GTACTCGGTG
TTGGTGGCCG CCGCCGACGC GATCAGCGCC GCCCGGCCCG GAGCCCGCCG CGAGGAACTC
GAAGCCTATG TGCGGCGCCT GGAACAGCTC GAACAGATTG CCATTGCCTT TCCCGGTGTG
CAGCAGGCCT ACGCGATCCA GGCGGGCCGC GAGGTGCGCG TGCTGGTGCA ACCCGAGAAG
GTCACCGACG CCCAAGCCAC CCTGCTCGCC CGTGAGATCG CCGGACGCAT CGAGCAGGAC
ATGGAGTACC CCGGCCAGGT GCAGGTCACA GTGGTGCGCG AGAGCCGCGC CGTGGAGGTC
GCCCGGTAA
 
Protein sequence
MNMLYFVLAL LVGLAGGFFV GQARGRQQRA TLDDQLQREA RAEAERIRTQ ADAEARQLRE 
QAEQRLQDAA RRLQEADDRE RQVTLQLEAQ REQLQAVRAQ IEAERARAAQ DAARERETLS
ADRQETRRER EELKREIERL NRRAEQLDAR GDKLDALEER LEGQLHALAQ QEAELAERSR
QVDLKLYEVA GLTPEAAREQ ILRQLDAELE EEKAIRVKAM TERATAEARR TARNVIAQAI
QRSASETSSQ MSVSVVPIPN DAMKGRLIGR EGRNIRAFEA LTGVDLIIDD TPEAVILSSF
NPVRREVARH VLEALVADGR IHPTRIEEMV HKAQDEMKSF IHAQGEEAAI ESGVVGLKPG
LVQLLGRMYF RSSYGQNVLK HSVQVAHLTG IMADELGLDA ALARRAGLMH DIGKSIDREI
EGTHVEIGIN LAKRFGEPPE VIDAIAHHHD PENGETLYSV LVAAADAISA ARPGARREEL
EAYVRRLEQL EQIAIAFPGV QQAYAIQAGR EVRVLVQPEK VTDAQATLLA REIAGRIEQD
MEYPGQVQVT VVRESRAVEV AR