Gene Dgeo_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDgeo_0449 
Symboltdh 
ID4059162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDeinococcus geothermalis DSM 11300 
KingdomBacteria 
Replicon accessionNC_008025 
Strand
Start bp461925 
End bp463010 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content64% 
IMG OID641229461 
ProductL-threonine 3-dehydrogenase 
Protein accessionYP_603921 
Protein GI94984557 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR00692] L-threonine 3-dehydrogenase
[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.333655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCCC CTTCTCGCCT TCCCAACATG CCCACTGTGC CCACCACCAT GAAGGCCCTC 
AGCAAACAGG AAGCTCGCCC GGGCCTTTGG ATGATCGAAA CCGAGGTGCC CACGCCCGGC
CCCAATGATC TGCTGATTCG CGTGAAGAAC AGCTCCATCT GCGGCACGGA TGTCCATATC
TACAGGTGGG ACGAGTGGGC GCAGAAGACC ATTCCTGTTC CGATGGTGGT CGGCCACGAG
TATGTGGGTG TGGTGGCCGG GATGGGCAGT GAGGTGCGCG GCTTTGAGAT CGGTGACCGC
GTGAGCGGAG AGGGCCACGT CACCTGCGGC CACTGCCGCA ACTGCCGCGC AGGCCGCCGC
CACCTCTGCC GCAACACGCT GGGGGTGGGT GTCAACCGGC CCGGCTCCTT TGCGGAATAT
CTCGTGTTGC CCGCCTTCAA CGCCTTTAAG ATTCCCGACG ACATCCCCGA CGAGATCGCC
GCGATCTTTG ATCCCTTTGG CAACGCGGTC CACACCGCCC TCAGCTTTGA TCTGGTGGGC
GAAGATGTGC TGATCACCGG CGCAGGGCCA ATCGGTGTGA TGGCCGCCGC TGTCGCGCGG
CATGTGGGCG CGCGCAACGT GGTCGTGACC GACGTGAACG ACTACCGCCT GGACCTCGCC
CGCCGGATGG GCGCCACCCG CGCCGTGAAT GTCGCACGTG AGGACCTCTG GGAAGTCGCC
CGCTCGGAGC TGGGCATGAC GGAGGGCTTT GACGTGGGGA TGGAGATGAG CGGGAGCGGT
GCCGCCTTCG CGCAGATGAT CCGCGTGATG AACAACGGCG GCAAAATCGC CATCCTGGGC
ATTCCCTCCG GGCACGTGGA CATCGACTGG AACGACGTGA TCTTCAAGGG TCTGACCCTC
AAGGGCATCT ATGGCCGCGA GATGTTCGAG ACGTGGTACA AGATGACGGC CCTGATCCAG
TCGGGCCTGG ACCTCACGCC CGTGCTGACG CACCGTTTCG GCATCGACGA CTACCAGAAG
GGCTTCGACG CGATGCTGGG CGGGCAGAGC GGCAAGGTGA TTCTGGACTG GGAGGCGGGG
GTCTAG
 
Protein sequence
MTAPSRLPNM PTVPTTMKAL SKQEARPGLW MIETEVPTPG PNDLLIRVKN SSICGTDVHI 
YRWDEWAQKT IPVPMVVGHE YVGVVAGMGS EVRGFEIGDR VSGEGHVTCG HCRNCRAGRR
HLCRNTLGVG VNRPGSFAEY LVLPAFNAFK IPDDIPDEIA AIFDPFGNAV HTALSFDLVG
EDVLITGAGP IGVMAAAVAR HVGARNVVVT DVNDYRLDLA RRMGATRAVN VAREDLWEVA
RSELGMTEGF DVGMEMSGSG AAFAQMIRVM NNGGKIAILG IPSGHVDIDW NDVIFKGLTL
KGIYGREMFE TWYKMTALIQ SGLDLTPVLT HRFGIDDYQK GFDAMLGGQS GKVILDWEAG
V