Gene GM21_0049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0049 
Symboltdh 
ID8135348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp60331 
End bp61374 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content64% 
IMG OID644867666 
ProductL-threonine 3-dehydrogenase 
Protein accessionYP_003019894 
Protein GI253698705 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR00692] L-threonine 3-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.000000160501 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCGAAGA CCATGCAGGC GCTGGTTAAG AAATACCCGA AGCCCGGGCT GTGGCTCGAC 
GAAGTCCCCG TCCCGGAGGT TGGGATCAAC GACGTGCTGA TCAAGGTCCA CAAGACCGCG
GTCTGCGGCA CCGATCTGCA CATCTGGGAC TGGAACGACT GGGCCCGTAA AACCATTCCG
GTCCCCATGG TGGTGGGCCA CGAGTTCGTG GGACGGGTGG CCGCCATGGG AAGCAACGTC
GCCGACCTGA ACATCGGGGA CATCGTCTCC GGCGAGGGGC ACATCGTCTG CGGCAGGTGC
CGCAACTGCC TGGCCGGCAG GCGCCACCTC TGCAAGGACA CCAACGGGGT GGGGGTCAAC
CGCGCCGGCG CTTTCGCCGA GTACATCTGC ATCCCGGTCA CCAACGTCTG GCACGCCGAC
CCCACCATCC CCATGGAAAT CCTGGGGATC TTCGATCCCT TCGGCAACGC GACCCACACC
ACCCTCGCCT TCCCCATCCT GGGGGAGGAC GTACTCATCA CCGGCGCCGG CCCGATCGGC
ATCATGGCGA CGGCCATAGC CCGCCACGCC GGGGCGCGCT ACATCGTGGT GACCGACCTG
AACCAGTACC GGCTCGACCT GGCGAAGAAG ATGGGGGCGA CGGTGGCCTT GAACGTCAGG
GAGGGGACCC TGGCACAGGT GCGGCAGCAG CTGGGGATGA AGGAGGGGTT CGACGTGGGG
CTGGAGATGT CGGGAAACGG CGACGCCTTC AAGGAGATGC TGTCCAACAT GTGCCACGGC
GGCAAGATCG CCATGCTGGG GCTCCCTTCT GCGGATATCT CCATCGACTG GAACCAGGTG
ATCTTCAACA TGCTGACCAT CAAGGGGATC TACGGCCGGG AGATGTACGA GACCTGGTAC
CTGATGCAGT CCCTGATCAA GATCGGGCTG GATCTCTCGC CGGTCATCAC GCACCGGATG
CACTACACGC AGTTCGAGGA GGCGTTCCGG GTGATGAGCA CCGGCAACGC GGGGAAGGTG
ATGCTCAACT GGGTCGAGGA GTGA
 
Protein sequence
MPKTMQALVK KYPKPGLWLD EVPVPEVGIN DVLIKVHKTA VCGTDLHIWD WNDWARKTIP 
VPMVVGHEFV GRVAAMGSNV ADLNIGDIVS GEGHIVCGRC RNCLAGRRHL CKDTNGVGVN
RAGAFAEYIC IPVTNVWHAD PTIPMEILGI FDPFGNATHT TLAFPILGED VLITGAGPIG
IMATAIARHA GARYIVVTDL NQYRLDLAKK MGATVALNVR EGTLAQVRQQ LGMKEGFDVG
LEMSGNGDAF KEMLSNMCHG GKIAMLGLPS ADISIDWNQV IFNMLTIKGI YGREMYETWY
LMQSLIKIGL DLSPVITHRM HYTQFEEAFR VMSTGNAGKV MLNWVEE