Gene GM21_1494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1494 
Symbol 
ID8136823 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1746369 
End bp1748030 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content65% 
IMG OID644869106 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003021308 
Protein GI253700119 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value0.201704 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGCG ACACTATCAC CCAAGGTTTG GAACGGACTC CGCACCGCGC GCTCTTGAAA 
GGGACCGGGC TTCCGCAAAG CGAGATGGGG AAGCCGTTCA TCGGCATCGC TACCAGCTTC
ACCGATCTTA TTCCGGGGCA CGTCGGCATG CGCGACCTGG AGCGTTTCAT CGAGAAGGGG
GTCCACACCG GCGGCGGTTA TTCCTTCTTC TTCGGGATTC CCGGGGTCTG CGACGGCATC
TCCATGGGGC ACAAGGGGAT GCACTACTCG CTCCCCACCC GCGAGTTGAT CGCCGACATG
GTTGAGTCGG TCGCCGAGGC GCATCGCCTG GACGGCCTCG TGCTCTTGAC CAACTGCGAC
AAGATCACCC CGGGCATGCT CATGGCTGCC GCGAGGCTCG ACATCCCCTG CATCGTGGTC
ACCGCCGGTC CCATGATGAG CGGCCGCGGC GACGCAGGCC GGAAGTACTC CTTCGTCACC
GACACCTTCG AGGCCATGGC GCGCTACAAG GCGGGGGTCA TCGACGACGC GGAGCTTGCG
CGCTGCGAGG AGAACGCCTG CCCGGGCATG GGTTCCTGCC AGGGGCTCTT CACCGCCAAC
ACCATGGCCA TACTCACCGA GACCCTCGGC ATGAGCCTGC CGCGCTGCGG CACGGCACTC
GCCGTCTCCG CGCTCAAGCG CCGCATCGCC TTCGCCTCGG GCGAGCGCAT CGTGGACCTG
GTGCGCCAGA ACATCACCCC GCGCTCCATA ATGACCCGCG AGGCGTTCGA GAACGCCATA
AGGGTCGACC TGGCCTTGGG CGGCTCTTCC AACACGGTGC TGCACCTTCT CGCCATCGCC
CACGAGGCAG GGGTCGAGCT TCCCCTTGAG ACCTTCGACA TCCTCGCCAA GGAGACCCCG
CAGCTTGCCT CCATGAACCC GGCGGGCGAG CATTTCATGG AAGACCTGGA CGTGGCCGGC
GGCGTCGCCG GGGTGCTGAA GCAGTTGGGC GACAAGATCC ATGACTGCCC GACCCTGATG
GGGCTCAGCA CCAAGGAGAT CGCGGCGAGC CTTAAGGGAG TCGACGAGGA AGTGATCCAC
CCCCTCTCGA ACCCGGTCAA GAAGGAAGGT GGCATCGCGG TTCTCTTCGG CAACATCTGC
CCCAAGGGCG CTGTGGTCAA GCAGTCGGGC GTATCCGACC AGATGATGAA GTTCACCGGC
ACCGCGCGCT GCTTCGACTC CGAGGACAAG GCGATGGCCG CCATGATGGG TGGCGTGGTG
AAGGGGGGCG ACGTGGTCGT CATCCGCTAC GAAGGGCCCA AAGGGGGACC GGGGATGCGC
GAGATGCTCG CTCCCACCGC CGCGCTCATG GGGCTTGGCC TGGGCGACTC GGTCGCGCTC
ATCACCGACG GGCGCTTCTC CGGCGGCACA CGTGGCCCCT GCATCGGTCA CATCGCGCCC
GAAGCTGCGG CGGGGGGACC GATTGCTTTC ATTGAGGACG GCGACACCAT TGAACTGGAC
ATTCCGGCAC GTTCGCTCAA GGTCATGGTG AGTGACGAAG TGCTGGCAGA AAGGCGCGCC
CGCTGGGTCG CCCCCGAGCC GAAGATCAAG AAGGGTTGGC TCGCCCGCTA CGCGAAGGTG
GTTACCTCGG CCCACACCGG CGCCATCACC ACCGCTGAAT AA
 
Protein sequence
MRSDTITQGL ERTPHRALLK GTGLPQSEMG KPFIGIATSF TDLIPGHVGM RDLERFIEKG 
VHTGGGYSFF FGIPGVCDGI SMGHKGMHYS LPTRELIADM VESVAEAHRL DGLVLLTNCD
KITPGMLMAA ARLDIPCIVV TAGPMMSGRG DAGRKYSFVT DTFEAMARYK AGVIDDAELA
RCEENACPGM GSCQGLFTAN TMAILTETLG MSLPRCGTAL AVSALKRRIA FASGERIVDL
VRQNITPRSI MTREAFENAI RVDLALGGSS NTVLHLLAIA HEAGVELPLE TFDILAKETP
QLASMNPAGE HFMEDLDVAG GVAGVLKQLG DKIHDCPTLM GLSTKEIAAS LKGVDEEVIH
PLSNPVKKEG GIAVLFGNIC PKGAVVKQSG VSDQMMKFTG TARCFDSEDK AMAAMMGGVV
KGGDVVVIRY EGPKGGPGMR EMLAPTAALM GLGLGDSVAL ITDGRFSGGT RGPCIGHIAP
EAAAGGPIAF IEDGDTIELD IPARSLKVMV SDEVLAERRA RWVAPEPKIK KGWLARYAKV
VTSAHTGAIT TAE