Gene GM21_0122 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0122 
Symbol 
ID8135425 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp154253 
End bp155458 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content64% 
IMG OID644867742 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_003019966 
Protein GI253698777 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value0.566245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGGCAG TATGCTGGCA TGGCAAGCAG GACGTTCGGG TGGACACCGT GCCCGACCCG 
GAGATCGTAC AGAAGGGGGA CGTCATCGTT AAAGTCGCCC TCACCTGCAT CTGCGGTTCC
GACCTGCACC TTTACAACGG CTACGTCCCC ACCATGAAAA AGGGGGACAT CCTGGGGCAC
GAGTTCGTCG GCGAGATCGT CGCGGCCGGC CCCGGCGTCT CCCGCTTCAG GGTCGGCGAC
CGCGTCATCG TTCCGTTTCC GATCAGTTGC GGCGCGTGCT GGTACTGCAA GCACGAGCTC
TGGTCGCTTT GCGACAACAC CAACCCGAAT TCCTGGATGA TGGAAAACAT CTACGGCGAC
ACCGGCGGCG GGATCTTCGG CTACTCCCAT CTCTACGGAG GATATGCCGG CGGTCAGGCC
GAGTACGTCC GGGTACCTTT CGCCGACGTG GGGCTAGAGA AGATACCGGA CGGGATACCG
TACGAGCAGG TGGTGCTCTT AACCGACATC ATGCCCACCG GCTACCAGGC CGCAGTCTAC
TGCAACATCA ACCCGGGCGA TACCGTCGCC GTCTGGGGGT GCGGGCCGGT GGGGCTCCTG
GCCATGAAGT CGGCCAAGCT TCTGGGGGCC GAGCGGGTGA TCGGCATCGA CCGTTTCCCC
GACCGGCTGC AGATGGCGCA CAGCCAGTGC CAGGCCGAGG TTATCAACTA CGAGGAGGTG
GACGTGGCCG AGCAGCTGCA GAACATGACC GGCGGGCGCG GCCCCGATTC CTGCATCGAC
GCGGTGGGGC TTGAGGCCCG CGGGACCGGC ATCGAGGACG TCTACGACCT GGTGAAGCAG
ACGCTGCGCC TGGAAACCGA CCGTGCTTCC GCGCTGCGCC AGCTGGTGAG GGCGTGCCGC
AAGGGGGGGA CCCTGTCCAT CTCGGGGGTC TACAGCGGGT TCATCGACAA GTTCCCCATG
GGGGCCATCT TCGCCAAAGG GCTCACCGTG CGCGGAGGGC AGGCCCACGT GCACAAGTAC
CTCCCCCACC TGGTGAAGCT GGTCGCGGAG CAGCAGATCG ATCCCTCCTG CATCATCACG
CATTGGATCT CGCTGGAGGA GGCGCCTGCC GGCTACCGCA CCTTCCTGAA GAAGCAGGAT
TCCTGCATCA AGATCGCCCT CAAACCGGAG CACGCCGCCC CGAAAAGCGA ACCCGCCTCA
GCATGA
 
Protein sequence
MRAVCWHGKQ DVRVDTVPDP EIVQKGDVIV KVALTCICGS DLHLYNGYVP TMKKGDILGH 
EFVGEIVAAG PGVSRFRVGD RVIVPFPISC GACWYCKHEL WSLCDNTNPN SWMMENIYGD
TGGGIFGYSH LYGGYAGGQA EYVRVPFADV GLEKIPDGIP YEQVVLLTDI MPTGYQAAVY
CNINPGDTVA VWGCGPVGLL AMKSAKLLGA ERVIGIDRFP DRLQMAHSQC QAEVINYEEV
DVAEQLQNMT GGRGPDSCID AVGLEARGTG IEDVYDLVKQ TLRLETDRAS ALRQLVRACR
KGGTLSISGV YSGFIDKFPM GAIFAKGLTV RGGQAHVHKY LPHLVKLVAE QQIDPSCIIT
HWISLEEAPA GYRTFLKKQD SCIKIALKPE HAAPKSEPAS A