Gene GM21_2538 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2538 
Symbol 
ID8137880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2966644 
End bp2967765 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content66% 
IMG OID644870147 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_003022337 
Protein GI253701148 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.00000014544 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAGGGA GCGTTCTAAT CACAGGGGGA GCCGGATTCA TTGGATCTCA CCTGGCAGAT 
GAGCTTCTTC GTCACGGTTA CCGCGTCCGC GTTCTGGACA GTCTGGTGCC GCAGGTTCAT
GGACCGGATG GGAACCGACC CGGCTACCTG GATCCGGAGG TCGAACTGAT CAAGGGGGAC
GTGCGGGACC AGGCCGCCGT GCAGCGGGCT TTGGCCGGGA CCGAGGCGGT GTTCCACCTG
GCCGCCATGG TTGGCGTGGG GCAGAGCATG TACGAGATCG AGCAGTACAC CTCGGTCAAC
AACTGCGGCA CCGCCGCGCT CCTGGAGGCG ATGGCTCACG CCAAGGGGCA TCGCAAGCTG
ATCGTCGCCT CCAGCATGAG CATCTACGGC GAGGGGCGCT ACCGCGACGC CGACGGGCTT
TGCTACGACG ACGTCAGGCG CCCCCTGGAG CAGTTGCAGC GTGGGCGCTG GGAGCCGTTC
AACCGCAGGA GCGAGCCGCT GCGCCCGGTG GCGACGCCCG AGGACAAGTC GCCGTCGCTT
GCCTCGGTCT ACGCCCTTTC CAAGTACGAC CAGGAGCGTA TGGCGCTCAT CGTCGGGGAG
TGTTACCGCA TCCCGGTGAT CGCGCTGCGC TTTTTCAACG TCTACGGCAC CAGGCAGGCG
CTCTCCAACC CCTACACCGG GGTGCTCGCC ATCTTCGCCT CCAGGCTCAT GAACGGCAAC
CCGCCTCGCA TCTACGAGGA CGGGTTGCAG CAGCGCGACT TCGTCAGCGT GCACGACGTG
GTGACCGCCT GCCGCCTGGC GCTTCAGGTG GACCAGCGGC AGGCGCAACT CTTCAACATC
GGCAGCGGCG CCAACATCAG CGTCCTCGAG GTGCTGCAGC GCTTCCGCCG TGTGCTCAAC
TGCGACGGTA TCGAGCCGGA GATCACCGGC AACTACCGGG CCGGCGACAT CAGGCACTGC
TTCGCCGACA TAAGCTCCGC CCGCTCCATC CTGGGATACG CCCCGAGGGT CTCCTTCGAC
GAGGGGCTTG CCGAGCTGGC CGGCTGGCTG GAAGGGGAGG TCGCCATAGA CCGCGTCTCC
GAGGCGCATG CCGAACTCAC CCAGCGGGGG TTGACGCTAT GA
 
Protein sequence
MAGSVLITGG AGFIGSHLAD ELLRHGYRVR VLDSLVPQVH GPDGNRPGYL DPEVELIKGD 
VRDQAAVQRA LAGTEAVFHL AAMVGVGQSM YEIEQYTSVN NCGTAALLEA MAHAKGHRKL
IVASSMSIYG EGRYRDADGL CYDDVRRPLE QLQRGRWEPF NRRSEPLRPV ATPEDKSPSL
ASVYALSKYD QERMALIVGE CYRIPVIALR FFNVYGTRQA LSNPYTGVLA IFASRLMNGN
PPRIYEDGLQ QRDFVSVHDV VTACRLALQV DQRQAQLFNI GSGANISVLE VLQRFRRVLN
CDGIEPEITG NYRAGDIRHC FADISSARSI LGYAPRVSFD EGLAELAGWL EGEVAIDRVS
EAHAELTQRG LTL