Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1954 |
Symbol | |
ID | 5833791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2186317 |
End bp | 2188047 |
Gene Length | 1731 bp |
Protein Length | 576 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641367755 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001639424 |
Protein GI | 163851381 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.160377 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCGC GTCAGACCGA CAAGTCGAAG CTGCCGAGCC GGCACGTGAC GGAGGGGCCC GAGCGGGCGC CCCACCGCTC GTACCTCTAC GCCATGGGCC TGACGACCGA GCAGATCCAC CAGCCGCTGG TCGGCGTCGC CTCGTGCTGG AACGAGGCCG CGCCCTGCAA CATCTCGCTG ATGCGCCAAG CCCAGGCCGT GAAGAAGGGT GTCGCCGCCG CCAAGGGCAC TCCGCGCGAG TTCTGCACCA TCACCGTCAC CGACGGCATC GCCATGGGCC ATGGCGGTAT GCGCGCCTCG CTGCCTTCCC GCGAGGTCAT CGCCGATTCG GTCGAGCTGA CGATCCGCGG CCATTCCTAC GACGCCCTCG TGGGGCTGGC CGGCTGCGAC AAGTCCCTGC CCGGCATGAT GATGGCCATG GTGCGCCTCA ACGTGCCCTC GATCTTCATC TATGGCGGCT CGATCCTGCC GGGCTCGTTC CGCGGCCGGC CGGTTACGGT GCAGGATCTG TTCGAGGCGG TCGGCAAGGT CGCCGTCGGC GACATGAGCC TCGACGACCT CGACGAGCTG GAGCGGGTTG CCTGCCCCTC GGCCGGCGCC TGCGGCGCGC AGTTCACCGC CAACACCATG GCCACCGTCT CCGAGGCGAT CGGCCTCGCG CTGCCCTACT CGGCCGGCGC GCCTGCCCCT TACGAGATCC GCGACCAATT CTGCGCCGCC GCCGGCGAGA AGGTGATGGA GCTGATCGCC AAGAACATCC GCCCGCGCGA CATCGTCACC CGCAAGGCGC TGGAGAACGC CGCCGCGACG GTCGCGGCCT CGGGCGGCTC GACCAACGCG GCCCTGCACC TGCCGGCGAT CGCGCATGAA TGCGGCATCG AGTTCACCCT GTTCGACGTC GCCGAGATCT TCCGCAAGAC CCCCTACATC GCCGACTTGA AGCCCGGCGG GCGCTATGTG GCCAAGGACA TGTTCGAGGT CGGCGGCATC CCGCTGCTGA TGAAGACGCT GCTCGACCAC GGCTACCTGC ACGGCGACTG CCTCACCGTC ACCGGCCGCA CCATCGCCGA GAACCTCGCC AAGGTCGCCT GGAACCCGGA TCAGGACGTG GTGCGCCCGG CCGACAAGCC CATCACCGTC ACCGGCGGCG TGGTGGGCCT GCGCGGCAAT CTCGCCCCCG AGGGCGCGAT CGTGAAGGTC GCCGGCATGC CGCCCGAGGC CCAGGTCTTC ACCGGCCCGG CCCGCGTCTT CGACGGCGAG GAAGCCTGTT TCGAGGCGGT GCAGAACCGC ACCTACAAGC CCGGCGACGT TCTGGTCATC CGTTACGAGG GCCCGAAGGG AGGCCCCGGC ATGCGCGAGA TGCTCTCGAC CACCGCCGCC CTCTACGGCC AGGGCATGGG CGACAAGGTG GCCCTCATCA CCGACGGGCG CTTCTCCGGC GCGACCCGCG GCTTCTGCGT CGGCCATGTC GGCCCCGAGG CTGCCATCGG CGGGCCGATC GGCCTGCTGC GCGACGGCGA CATCATCACC CTCGATGCGA TCAAGGGCAC GCTCGACGTG GCGCTCTCCG ACGAGGAACT GGCCCAGCGT CGCAGCGAAT GGACGCCGCG GGGCAATGCC GCGACCTCCG GCTACCTCTG GAAATACGCG CAGTCCGTCG GGCCTGCAGT GAACGGGGCC GTGACGCATC CGGGCGGCGC GGGGGAGACG AACGTCTATG CCGACATCTA G
|
Protein sequence | MDARQTDKSK LPSRHVTEGP ERAPHRSYLY AMGLTTEQIH QPLVGVASCW NEAAPCNISL MRQAQAVKKG VAAAKGTPRE FCTITVTDGI AMGHGGMRAS LPSREVIADS VELTIRGHSY DALVGLAGCD KSLPGMMMAM VRLNVPSIFI YGGSILPGSF RGRPVTVQDL FEAVGKVAVG DMSLDDLDEL ERVACPSAGA CGAQFTANTM ATVSEAIGLA LPYSAGAPAP YEIRDQFCAA AGEKVMELIA KNIRPRDIVT RKALENAAAT VAASGGSTNA ALHLPAIAHE CGIEFTLFDV AEIFRKTPYI ADLKPGGRYV AKDMFEVGGI PLLMKTLLDH GYLHGDCLTV TGRTIAENLA KVAWNPDQDV VRPADKPITV TGGVVGLRGN LAPEGAIVKV AGMPPEAQVF TGPARVFDGE EACFEAVQNR TYKPGDVLVI RYEGPKGGPG MREMLSTTAA LYGQGMGDKV ALITDGRFSG ATRGFCVGHV GPEAAIGGPI GLLRDGDIIT LDAIKGTLDV ALSDEELAQR RSEWTPRGNA ATSGYLWKYA QSVGPAVNGA VTHPGGAGET NVYADI
|
| |