Gene Dgeo_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0041 
Symbol 
ID4057007 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp37477 
End bp38694 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content66% 
IMG OID641229037 
Productalcohol dehydrogenase GroES-like protein 
Protein accessionYP_603513 
Protein GI94984149 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.950338 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCGC TGGTGTGGCA GGGCATCAAC CGGGTGGGAG TGGAGCGCGT TCCCGATCCC 
ACGATTCTCC AACCCACCGA CGCCATCGTG CGCGTGACCG CGACCGCGAT CTGCGGCTCG
GACCTGCACC TGCTCGACGG CTACATCCCG AGCATGGTGA AGGGCGACAT CCTCGGGCAC
GAGTTCATGG GCGAGGTGGT GGAGGTCGGC TCCGCAGTCC GGCGCATCCG GGTCGGGGAC
CGGGTGATTG TGCCCTTTCC GATCGCCTGC GGCAAATGCT GGTACTGCCA GCACGGCCTG
ACCTCGCTGT GTGACAACTC CAACCCCAAC CCCAAGCTTG CGGAGACGCT GTGGGGCTAC
GCCGGCGCCG GCATCTACGG CTACTCGCAC ATCACAGGGG GATACGCGGG CGGCCAAGCG
CAGTTTGCCC GCACCGTCTA CGCCGACGCC AACCTCTATC CGGTGCCCGA GGGCCTGACC
GACGAGCAGG TGCTCTTCCT GACCGACATC CTCCCCACCG GCTACATGGC CGCCGAACAC
AGCAACATCC AGCCGGGCGA CGTGGTGACG GTATTTGGGG CAGGGCCAGT TGGCCTCTTC
ACGGTCATGA GCGCCTTCCT GCTGGGAGCA GGACGGGTGA TCTCGATTGA CCGCTTCGAC
GACCGCTTGA AGCTCGCGCG CCAGCTGGGC GCCGAGACGA TCAACTACGA GGCGGACAAT
GTCTTTGAGC GCCTGAAGGA ACTGACCGGC GGGCGTGGCC CCGACAGCGT AGTGGACGCG
GTGGGCATGG AGTCGCACGG CACCGGCCTA GGCGGCATCT ACGACGCCGT CAAGCAGACC
ACCCGCGTGC TGGAAACAGA GCGCCCCCAC GCCCTGCGCG CCGCGATCAT GGCCTGCCGC
AAGGGAGGCA CCGTCAGCGT GCCGGGAGTA TACGGCGGCC TAGCGGACAA GATCCCAGTG
GGCGCCTTGA TGAACAAGGG CCTTACCCTG CGCACCGGGC AAACCCACGT TCACCGCTAT
CTTGATACCC TGACCCAACA CATCCTGCGC GGCGACATCG ACCCCACCGT GATCATCACC
CACCGCCTGA GCCTGGACGA GGCGCCGCGG GGCTACCAGC TGTTCAAGCA CAAGCACGAC
GGCTGCATCA AGTGTGTGCT CGACCCCTGG GCCGATCCCA AGGAGCACGC GCCGACGTCG
CCTCAGCCGG AGACCTGA
 
Protein sequence
MKALVWQGIN RVGVERVPDP TILQPTDAIV RVTATAICGS DLHLLDGYIP SMVKGDILGH 
EFMGEVVEVG SAVRRIRVGD RVIVPFPIAC GKCWYCQHGL TSLCDNSNPN PKLAETLWGY
AGAGIYGYSH ITGGYAGGQA QFARTVYADA NLYPVPEGLT DEQVLFLTDI LPTGYMAAEH
SNIQPGDVVT VFGAGPVGLF TVMSAFLLGA GRVISIDRFD DRLKLARQLG AETINYEADN
VFERLKELTG GRGPDSVVDA VGMESHGTGL GGIYDAVKQT TRVLETERPH ALRAAIMACR
KGGTVSVPGV YGGLADKIPV GALMNKGLTL RTGQTHVHRY LDTLTQHILR GDIDPTVIIT
HRLSLDEAPR GYQLFKHKHD GCIKCVLDPW ADPKEHAPTS PQPET