Gene GM21_3403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3403 
Symbol 
ID8138770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3934558 
End bp3935568 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content61% 
IMG OID644871020 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_003023185 
Protein GI253701996 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones137 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGA TACTGGTTAC CGGCGCTGCA GGATTCATCG GATTTCATCT CTCGGAAAAG 
CTCCTCGCCA AGGGGTGCGA GGTGGTCGGC CTGGACAACT TGAACGACTA TTACGAGGTC
GCCCTCAAGG AGGGGAGGCT CTCCCGGCTG GAGGGAAAGC CGGGCTTTCG TTTCGCGCGC
ATGAACCTGG AGGACCGCGA GGGGATCAAG GAACTCTTCG CCGCCGAGAA GTTCGACTCC
GTGGTGAACC TGGCCGCGCA AGCCGGGGTC CGCTACTCGA TCGAAAACCC TTACGTCTAC
ATCGACAGCA ACCTCTCCGG TTTCATCAAC ATCCTGGAGG GGTGCCGCCA CAACAAGGTG
GGACACCTGG TCTACGCCTC CTCATCCTCG GTATACGGCG CCAACACCAC CATGCCTTTT
TCGGTGCACC ACAACGTGGA CCATCCCGTC TCGCTCTACG CCGCCACCAA GAAGGCCAAC
GAGCTGATGG CGCACACCTA TTCCAGCCTC TACGGGCTCC CCACCACGGG GCTGCGCTTT
TTCACCGTAT ATGGGCCTTG GGGGCGCCCC GACATGGCGC TCTTTCTCTT CACCAAGGCG
ATCCTAGAGG GGAAACCGAT CGACGTCTTC AACTACGGGA AGATGCAGCG CGACTTCACC
TTCATCGACG ACATCGTGGA AGGTGTCGCC CGCGTGATCG ACAGCGTCCC CGCAGGCGAC
CCCGGCTGGA GCGGCGCGAA CCCCGATCCG GGAACGAGCT ATGCCCCTTA CAAGATCTAC
AACATCGGCA ACAACAACCC GGTGGAGCTT ATGCGCTTCA TCGAGGTGCT GGAAAAGGCG
CTGGGGAAAG AGGCGCAGAA GAACCTGCTC CCGATTCAGG CCGGCGACGT CCCGGCGACC
TACGCCGACG TCGACGACCT GATGCGGGAC GTCGGCTTCA AGCCGGCCAC CTCCATCGAG
GACGGGATCG CGCGCTTCGT CGCCTGGTAC CGCGATTTCT ACAAGGTTTG A
 
Protein sequence
MAKILVTGAA GFIGFHLSEK LLAKGCEVVG LDNLNDYYEV ALKEGRLSRL EGKPGFRFAR 
MNLEDREGIK ELFAAEKFDS VVNLAAQAGV RYSIENPYVY IDSNLSGFIN ILEGCRHNKV
GHLVYASSSS VYGANTTMPF SVHHNVDHPV SLYAATKKAN ELMAHTYSSL YGLPTTGLRF
FTVYGPWGRP DMALFLFTKA ILEGKPIDVF NYGKMQRDFT FIDDIVEGVA RVIDSVPAGD
PGWSGANPDP GTSYAPYKIY NIGNNNPVEL MRFIEVLEKA LGKEAQKNLL PIQAGDVPAT
YADVDDLMRD VGFKPATSIE DGIARFVAWY RDFYKV