Gene GM21_0121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0121 
Symbol 
ID8135424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp153355 
End bp154215 
Gene Length861 bp 
Protein Length286 aa 
Translation table11 
GC content64% 
IMG OID644867741 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_003019965 
Protein GI253698776 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value0.984934 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACCG AGGAAGGAAA ACAGTTCCCG CCGCAGCGTC AGGCGCAGCC GGGAAAAGAA 
GCAGAGATGA CGCCGAGGCC CAAAAGCGGC GAGTTCGAAT ACCGGGGGGC CGGGAAGCTG
CAGGGGAAAA CGGCCCTCAT CACCGGCGGC GACAGCGGCA TCGGGCGTGC CGTCGCCATC
GCCTTCGCCC GCGAGGGGGC GAACGTCGCT TTCGGATACC TGGAGGAAGA CCAGGACGCG
AAAGAGACCC GGGACATCGT GGAGCGGGAG GGGGGGCGCT GCCTCGCCTT CCGCGGCGAC
GTGGGTCAGG AGCAGTTCTG CCTCGACATT GTCAAAAAGA CGTTGGAGGC ATTCGGCCGG
CTGGACATAG TGGTGAACAA CGCGGCCGAG CAGCATTACC GCGAGGGCAT CGAAGAGATC
TCCTCGGAGC AGTTGGAGCG GACCTTCAGG ACCAACATCT TTTCCTATTT CTATCTGGTT
AAGGCCGCGC TCAAGCACCT GCAAGAGGGA TCCCGGATCA TCAACACCAC CTCGGTCACC
GCCTACAAGG GAAACCCCAA CCTCCTCGAT TACTCCTCCA CCAAGGGGGC CATCGTCGCC
TTCACCCGCT CCCTCGCGCT GTCGCTCGCC GACAAGGGGA TCCTGGTGAA CGCCGTCGCC
CCCGGTCCCA TCTGGACCCC GCTCATCCCC GGAACCTTCC CGGAGGAAAA GACGGAGCAG
TTCGGCGAGA ACGTGCTTTT GAAGCGGGCG GGACAGCCGG TGGAAGTGGC CCACAGCTAC
GTCTTCCTCG CCTCCGAAGG AGGCTCCTAC ATGACCGGGC AGGTGCTGCA CCCAAACGGC
GGAACAATCG TCGGGGGTTA G
 
Protein sequence
MPTEEGKQFP PQRQAQPGKE AEMTPRPKSG EFEYRGAGKL QGKTALITGG DSGIGRAVAI 
AFAREGANVA FGYLEEDQDA KETRDIVERE GGRCLAFRGD VGQEQFCLDI VKKTLEAFGR
LDIVVNNAAE QHYREGIEEI SSEQLERTFR TNIFSYFYLV KAALKHLQEG SRIINTTSVT
AYKGNPNLLD YSSTKGAIVA FTRSLALSLA DKGILVNAVA PGPIWTPLIP GTFPEEKTEQ
FGENVLLKRA GQPVEVAHSY VFLASEGGSY MTGQVLHPNG GTIVGG