Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1494 |
Symbol | |
ID | 8136823 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1746369 |
End bp | 1748030 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644869106 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_003021308 |
Protein GI | 253700119 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 0.201704 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAGCG ACACTATCAC CCAAGGTTTG GAACGGACTC CGCACCGCGC GCTCTTGAAA GGGACCGGGC TTCCGCAAAG CGAGATGGGG AAGCCGTTCA TCGGCATCGC TACCAGCTTC ACCGATCTTA TTCCGGGGCA CGTCGGCATG CGCGACCTGG AGCGTTTCAT CGAGAAGGGG GTCCACACCG GCGGCGGTTA TTCCTTCTTC TTCGGGATTC CCGGGGTCTG CGACGGCATC TCCATGGGGC ACAAGGGGAT GCACTACTCG CTCCCCACCC GCGAGTTGAT CGCCGACATG GTTGAGTCGG TCGCCGAGGC GCATCGCCTG GACGGCCTCG TGCTCTTGAC CAACTGCGAC AAGATCACCC CGGGCATGCT CATGGCTGCC GCGAGGCTCG ACATCCCCTG CATCGTGGTC ACCGCCGGTC CCATGATGAG CGGCCGCGGC GACGCAGGCC GGAAGTACTC CTTCGTCACC GACACCTTCG AGGCCATGGC GCGCTACAAG GCGGGGGTCA TCGACGACGC GGAGCTTGCG CGCTGCGAGG AGAACGCCTG CCCGGGCATG GGTTCCTGCC AGGGGCTCTT CACCGCCAAC ACCATGGCCA TACTCACCGA GACCCTCGGC ATGAGCCTGC CGCGCTGCGG CACGGCACTC GCCGTCTCCG CGCTCAAGCG CCGCATCGCC TTCGCCTCGG GCGAGCGCAT CGTGGACCTG GTGCGCCAGA ACATCACCCC GCGCTCCATA ATGACCCGCG AGGCGTTCGA GAACGCCATA AGGGTCGACC TGGCCTTGGG CGGCTCTTCC AACACGGTGC TGCACCTTCT CGCCATCGCC CACGAGGCAG GGGTCGAGCT TCCCCTTGAG ACCTTCGACA TCCTCGCCAA GGAGACCCCG CAGCTTGCCT CCATGAACCC GGCGGGCGAG CATTTCATGG AAGACCTGGA CGTGGCCGGC GGCGTCGCCG GGGTGCTGAA GCAGTTGGGC GACAAGATCC ATGACTGCCC GACCCTGATG GGGCTCAGCA CCAAGGAGAT CGCGGCGAGC CTTAAGGGAG TCGACGAGGA AGTGATCCAC CCCCTCTCGA ACCCGGTCAA GAAGGAAGGT GGCATCGCGG TTCTCTTCGG CAACATCTGC CCCAAGGGCG CTGTGGTCAA GCAGTCGGGC GTATCCGACC AGATGATGAA GTTCACCGGC ACCGCGCGCT GCTTCGACTC CGAGGACAAG GCGATGGCCG CCATGATGGG TGGCGTGGTG AAGGGGGGCG ACGTGGTCGT CATCCGCTAC GAAGGGCCCA AAGGGGGACC GGGGATGCGC GAGATGCTCG CTCCCACCGC CGCGCTCATG GGGCTTGGCC TGGGCGACTC GGTCGCGCTC ATCACCGACG GGCGCTTCTC CGGCGGCACA CGTGGCCCCT GCATCGGTCA CATCGCGCCC GAAGCTGCGG CGGGGGGACC GATTGCTTTC ATTGAGGACG GCGACACCAT TGAACTGGAC ATTCCGGCAC GTTCGCTCAA GGTCATGGTG AGTGACGAAG TGCTGGCAGA AAGGCGCGCC CGCTGGGTCG CCCCCGAGCC GAAGATCAAG AAGGGTTGGC TCGCCCGCTA CGCGAAGGTG GTTACCTCGG CCCACACCGG CGCCATCACC ACCGCTGAAT AA
|
Protein sequence | MRSDTITQGL ERTPHRALLK GTGLPQSEMG KPFIGIATSF TDLIPGHVGM RDLERFIEKG VHTGGGYSFF FGIPGVCDGI SMGHKGMHYS LPTRELIADM VESVAEAHRL DGLVLLTNCD KITPGMLMAA ARLDIPCIVV TAGPMMSGRG DAGRKYSFVT DTFEAMARYK AGVIDDAELA RCEENACPGM GSCQGLFTAN TMAILTETLG MSLPRCGTAL AVSALKRRIA FASGERIVDL VRQNITPRSI MTREAFENAI RVDLALGGSS NTVLHLLAIA HEAGVELPLE TFDILAKETP QLASMNPAGE HFMEDLDVAG GVAGVLKQLG DKIHDCPTLM GLSTKEIAAS LKGVDEEVIH PLSNPVKKEG GIAVLFGNIC PKGAVVKQSG VSDQMMKFTG TARCFDSEDK AMAAMMGGVV KGGDVVVIRY EGPKGGPGMR EMLAPTAALM GLGLGDSVAL ITDGRFSGGT RGPCIGHIAP EAAAGGPIAF IEDGDTIELD IPARSLKVMV SDEVLAERRA RWVAPEPKIK KGWLARYAKV VTSAHTGAIT TAE
|
| |