Gene GM21_3511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3511 
Symbol 
ID8138883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4052888 
End bp4053898 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content55% 
IMG OID644871130 
ProductUDP-glucose 4-epimerase 
Protein accessionYP_003023290 
Protein GI253702101 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.000000000024389 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTCAAGA ACAAAACCCT ACTGATAACG GGCGGTACAG GATCGTTCGG AAACGCGGTG 
CTGCAACGTT TTCTCAATAC CGACATCGGC GAGATCCGCA TCCTGAGCCG GGACGAGAAG
AAGCAGGAAG ATATGCGCAT CAGTCTCAAC AACCCGAAGG TGAAGTTCTA CATAGGGGAC
GTGCGCACCT ATGACAGCAT CGACTTTGCC ATGAAGGGAG TCGATCTTGT CTTTCACGCG
GCAGCGTTGA AGCAGGTCCC TTCGTGTGAG TTCTACCCGA TGGAAGCGGT GCGGACCAAC
ATCCTCGGCG CGGAGAACGT ACTCAATGCG GCTTACGCCA ACAAGGTGAA AAAGGTCATT
GTCCTCAGTA CCGACAAGGC GGTGTATCCC ATCAACGCCA TGGGGCTGTC GAAGGCCATG
ATGGAAAAGC TGATGGTGGC GAAAGCGCGC ATGATGTCGG CTGGCGACAC CATTTTCTGC
GCCACCCGCT ACGGCAATGT CATGGCTTCG CGCGGCTCGG TCATCCCCCT TTTCGTGAAG
CAGATCAAGG AGGGTAAACC TCTCACCATA ACGGACCCGA ACATGACGCG GTTCCTGATG
TCGTTGGAGG AGTCGGTGGA CCTGGTCCTT TACGCCTTCC AGCATGCGAG TTCCGGAGAC
ATCTTCATTC AGAAAGCACC GGCTTCGACC ATTCTCGACC TCGCCGTCGC AGTGAAAGAG
GTCTTCCAAG CGAAGAACGA GATCAAGGTT ATTGGAACGA GGCACGGGGA GAAGCTGTAT
GAGTCGCTGG TGAACCGGGA AGAGATGGCA AGAAGCATCG ATCTCGGCGG GCACTACAGG
ATTCCGGCGG ACAACCGGGA TCTCAACTAC AACAAGTTCT TCGTGGAAGG GCAGGTCGAG
ATCGCGGAGA TCGACGACTA TACCTCGCAC AATACGCAGA GACTGACCGT TCCCGAGGTG
AAGGAACTTT TGCTGACCCT CCCGTTCATC CAGGAGGAGC TCAATGCTTA A
 
Protein sequence
MFKNKTLLIT GGTGSFGNAV LQRFLNTDIG EIRILSRDEK KQEDMRISLN NPKVKFYIGD 
VRTYDSIDFA MKGVDLVFHA AALKQVPSCE FYPMEAVRTN ILGAENVLNA AYANKVKKVI
VLSTDKAVYP INAMGLSKAM MEKLMVAKAR MMSAGDTIFC ATRYGNVMAS RGSVIPLFVK
QIKEGKPLTI TDPNMTRFLM SLEESVDLVL YAFQHASSGD IFIQKAPAST ILDLAVAVKE
VFQAKNEIKV IGTRHGEKLY ESLVNREEMA RSIDLGGHYR IPADNRDLNY NKFFVEGQVE
IAEIDDYTSH NTQRLTVPEV KELLLTLPFI QEELNA