Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_2675 |
Symbol | |
ID | 5834967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2989008 |
End bp | 2990843 |
Gene Length | 1836 bp |
Protein Length | 611 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641368475 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001640137 |
Protein GI | 163852094 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.261828 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.272323 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCCT ATCGCTCTCG CACCACCACC CATGGCCGCA ACATGGCCGG CGCCCGCGGC TTGTGGCGCG CCACCGGCAT GAAGGACTCG GATTTCGGCA AGCCGATCAT CGCGGTGGTG AACTCGTTCA CGCAGTTCGT GCCCGGCCAC GTCCACCTGA AGGATCTCGG CCAGCTCGTG GCCCGTGAGA TCGAGCAGGC GGGCGGTGTG GCCAAGGAAT TCAACACCAT CGCGGTCGAT GACGGCATCG CCATGGGCCA TGACGGGATG CTCTACTCGC TGCCGTCGCG CGAGCTGATC GCCGACAGCG TCGAGTACAT GGTCAACGCG CATTGCGCCG ACGCCATGGT CTGCATCTCG AATTGCGACA AGATCACCCC CGGCATGCTG ATGGCGGCGC TGCGCCTCAA CATCCCGGCG GTCTTCGTCT CCGGCGGACC GATGGAGGCG GGCAAGGTCG TGATGAACGG CGTCACCCGC AAGTTCGACC TCGTCGATGC GATGGTGGCT GCGGCCGATG ACCGGGTCTC GGACGAGGAC GTCGCCGTCA TCGAGCGCTC GGCCTGCCCG ACCTGCGGCT CGTGCTCGGG CATGTTCACG GCCAATTCCA TGAACTGCCT CACCGAGGCG CTCGGCCTGT CGCTGCCCGG CAACGGCTCG ACGCTCGCGA CCCATGCCGA CCGCAAGCGC CTGTTCGTCG AGGCCGGCCA CCTCATCGTC GATCTGGCCC GCCGCCACTA CGAGCAGGAC GACGCCAGCG TCCTGCCGCG CTCGATCGCG ACGATGGCCG CCTTCGAGAA CGCGATGACC CTCGACATCG CCATGGGCGG TTCGACCAAC ACGGTGCTGC ACCTGCTCGC TGCCGCGCAT GAGGGCGAGG TGCCCTTCAC CATGGCCGAC ATAGACCGGC TGTCACGCCG GGTGCCGGTT CTGTGCAAGG TCGCCCCGGC CGTCGCCAAC GTCCACATGG AGGACGTGCA CCGGGCCGGC GGCATCATGG GGATCCTGGG CGAACTCGAC CGCGCCGGTC TGATCGACCG GTCCGTCGGC AATGTTGGCT CGGGTACGCT CGGCGCGGCG CTCGACCGCT GGGACGTCAA GAAGACGCAA AGCGAGTCGG TCCAGACCTT CTTCCGCGCC GCGCCCGGCG GCGTGCCGAC CCAGGTGGCC TTCAGCCAGG CCTCGCGCTG GGACGAACTC GACCTCGACC GCGAGGCCGG TGTCATCCGC TCGGCCGAGC ACGCCTACTC GAAGGATGGC GGGCTGGCCG TGCTCTACGG CAACCTCGCC GAGGACGGCT GCATCGTGAA GACGGCGGGC GTCGATGCCT CGATCCTGAC CTTCACCGGG ACCGCCCACG TCTTCGAGAG CCAAGATGCG TCGGTGGATG CCATCCTCAA CGGCCGCGTG AAAGCCGGCG AGGTGGTGCT GATCCGCTAC GAGGGCCCCC GCGGCGGCCC CGGCATGCAG GAGATGCTCT ATCCCACGAG CTACCTGAAA TCGAAGGGCC TCGGCAAAGC CTGCGCGCTG GTCACCGACG GCCGATTCTC CGGCGGCTCC TCGGGCCTCT CCATCGGCCA CGTCTCGCCG GAAGCCGCCG AGGGAGGTCT GATCGGCCTC GTCGAGCAGG GCGACCGGAT CGAGATCGAC ATCCCGAACC GACGCATCCA CCTCGCCGTG GACGAGGCCG TGCTCGCCGA GCGCCGCGCC GCCCGCGAGG CGGAAGGCTG GTGTCCGGCC ACGCCCCGCA AGCGCAAGGT CAGCACTGCG CTCCGCGCCT ACGCGATGCT CGCCACCAGC GCCGCCAAGG GGGCGGTGAG ACGAATCGAG CCGTAG
|
Protein sequence | MPAYRSRTTT HGRNMAGARG LWRATGMKDS DFGKPIIAVV NSFTQFVPGH VHLKDLGQLV AREIEQAGGV AKEFNTIAVD DGIAMGHDGM LYSLPSRELI ADSVEYMVNA HCADAMVCIS NCDKITPGML MAALRLNIPA VFVSGGPMEA GKVVMNGVTR KFDLVDAMVA AADDRVSDED VAVIERSACP TCGSCSGMFT ANSMNCLTEA LGLSLPGNGS TLATHADRKR LFVEAGHLIV DLARRHYEQD DASVLPRSIA TMAAFENAMT LDIAMGGSTN TVLHLLAAAH EGEVPFTMAD IDRLSRRVPV LCKVAPAVAN VHMEDVHRAG GIMGILGELD RAGLIDRSVG NVGSGTLGAA LDRWDVKKTQ SESVQTFFRA APGGVPTQVA FSQASRWDEL DLDREAGVIR SAEHAYSKDG GLAVLYGNLA EDGCIVKTAG VDASILTFTG TAHVFESQDA SVDAILNGRV KAGEVVLIRY EGPRGGPGMQ EMLYPTSYLK SKGLGKACAL VTDGRFSGGS SGLSIGHVSP EAAEGGLIGL VEQGDRIEID IPNRRIHLAV DEAVLAERRA AREAEGWCPA TPRKRKVSTA LRAYAMLATS AAKGAVRRIE P
|
| |