Gene GM21_0400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0400 
Symbol 
ID8135708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp475395 
End bp476453 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content62% 
IMG OID644868018 
ProductAlcohol dehydrogenase zinc-binding domain protein 
Protein accessionYP_003020239 
Protein GI253699050 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones73 
Fosmid unclonability p-value0.654367 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAACG CCAAAGCCTA CTCCGCTGCC AGCGCCACTT CGCCGCTGGC TTCGACCACC 
ATCCCCCGCC GCGAACCGAC CGAGCGCGAC GTGCAGATCG AGATCCTTTT TTGCGGCATC
TGCCACTCCG ACCTGCACTC CGTGCGTAAC GAGTGGAGCA GCGTCATGCC GACGATCTAC
CCCATTGTTC CCGGCCACGA AATAGTCGGA CGTGTAACAA AGGTCGGATC CGCGGTCACC
AATTTCAAAC CGGGCGACCT GGCGGGGGTC GGCTGCCTGG TCGATTCGGA CCAAAGCTGC
CCCCATTGCC ACGATGATCT TGAGCAGTTA TGCCCGAACC AGACCCTCAC CTTCAACTCG
CCCGACAAAC ACCTCGGGGG CGTCACCTAC GGCGGCTACT CCGAGAGCAT CGTGGTGGAC
GAACACTTCG TACTGCACGT TCCGGAGAAC CTGGAACTCG CCGGTGTCGC GCCCTTGCTC
TGCGCGGGGA TCACTACCTA CTCCCCGATA CACCGCTGGG GCGACATCAA GGGCAAAAAG
GTCGGCATCA TCGGCCTGGG CGGCCTGGGT CACATGGGGG TCAAGTTCGC CCGCGCCTTC
GGAGCCCGGG TCGTCGTCTT CACCACCTCG CCCGGAAAGA GAGAGGATGC GCTGCGTCTG
GGGGCGGACG AAGTCATCGT TTCCACCAAC GCCCAAGAGA TGCTGCTGCA CGCCGGGAGT
TTCGATTTCA TCCTCGACAC CATCGCCGCC GATCACGACA TCAACGCATA CCTGAACATG
CTCGCCCACG ACGGCAACCT CACCCTGGTA GGTGCGCCGG AGAAGCCTCT CGCCGTCTCC
GCCTTCGCCC TTCTCTTCGG TCGCCGCAGC CTCTCCGGCT CCATCATCGG CGGCATCAAG
GAGACCCAGG AGATGCTCGA TTTCTGCGGC GCGCACAACA TCACCGCCGA CGTGGAGGTC
ATCCCCATTC AAAAAGTAAA CGAGGCCTAC GAGCGGCTGC TCAAGTCCGA TGTGAAGTAC
CGCTTCTCCA TCGACATGGC TTCGCTCAAA GCCGAATAA
 
Protein sequence
MPNAKAYSAA SATSPLASTT IPRREPTERD VQIEILFCGI CHSDLHSVRN EWSSVMPTIY 
PIVPGHEIVG RVTKVGSAVT NFKPGDLAGV GCLVDSDQSC PHCHDDLEQL CPNQTLTFNS
PDKHLGGVTY GGYSESIVVD EHFVLHVPEN LELAGVAPLL CAGITTYSPI HRWGDIKGKK
VGIIGLGGLG HMGVKFARAF GARVVVFTTS PGKREDALRL GADEVIVSTN AQEMLLHAGS
FDFILDTIAA DHDINAYLNM LAHDGNLTLV GAPEKPLAVS AFALLFGRRS LSGSIIGGIK
ETQEMLDFCG AHNITADVEV IPIQKVNEAY ERLLKSDVKY RFSIDMASLK AE