Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1996 |
Symbol | |
ID | 5831557 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 2226136 |
End bp | 2227518 |
Gene Length | 1383 bp |
Protein Length | 460 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641367797 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001639466 |
Protein GI | 163851423 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAGC GGTGGACCCC CAAAACCTGG CGCAACCTGC CGATCCATCA GGTCCCGTCC TATCCGGACG CGGGCGCTCT CCAGGCCGTC GAGGCGCAAC TCGCGAGCTT TCCGCCGCTC GTTTTTGCCG GTGAGGCGCG CAAGCTGAAG GCGGCCCTGG CGCGCGTCGG GGCGGGCGAA GCCTTCCTCC TGCAAGGCGG CGACTGCGCC GAGAGCTTCG ACGAGCATTC GGCCGACAAC ATCCGCGACT TCTTCCGCGT CTTCCTGCAG ATGGCGATGG TGCTCACCTT CGCGGGCGGC TCGCCCGTGG TGAAGGTCGG CCGCATTGCC GGGCAGTTCG CCAAGCCGCG CTCCTCGCCG ACCGAGACGC TCGACGGCGT GGCGCTGCCG AGCTACCGCG GCGACATCAT CAACGGCCTG ACCTTCAACG AGGAGGCCCG GATCCCCGAT CCGCGCCGTC AGCTCGAGGC CTACCGGCAG TCGGCGGCGA CGCTGAACCT GCTGCGCGCC TTCGCCACCG GCGGCTACGC CAACCTCGAG AACGCGCATC GCTGGATGCT CGGCTTCGTC AAGGACAGCC CGCAATCCTC GCGCTACCGC GATGTGGCCG AGCGGATGTC GGACGCGCTC GACTTCATGC GCGCCATCGG CATCAACCCG GAGACGCACC AGGAGGTCCG CACGACGGAC TTCTACACCA GCCACGAGGC GCTGCTGCTC GGCTACGAGG AGTCGCTGAC CCGCGTCGAT TCGACGAGCG GCGATTGGTA CGCGACTTCC GGCCATATGC TCTGGATCGG CGACCGCACC CGCCAGCCGG ACCACGCCCA TGTGGAATAT GCCCGCGGCA TCAAGAACCC GATCGGCCTG AAATGCGGGC CCTCGACCAC GGCCGAAGGG CTGATCCGCC TGATCGATCT CCTCAACCCG GAGAACGAGG CCGGGCGCCT CAGCCTGATC TGCCGCTTCG GCGCGGACAA GGTCGGCGAC CATCTGCCGG GGCTGATCCG CACGGTGCAG CGCGAGGGCC GCAACGTGGT GTGGGTGTGC GATCCGATGC ACGGCAACAC TATCGCTGCC GGCCGCTACA AGACGCGGCC GTTCGAGCGG GTGATGCAGG AGATCGAGGG CTTCTTCGGC GTGCACCGCG CGGAAGGCAC GATCGCCGGC GGCATCCATC TGGAAATGAC CGGCAAGGAC GTGACCGAAT GCACCGGCGG CGCCCGTGCC CTGACGGCGG ATGACCTTCA GGACCGCTAC CACACCTACT GCGACCCGCG CCTCAACGCG GAGCAGGCGC TGGAGGTGGC GTTCCTGACC GCCGAACTGG TGAAGCGCGA GCGCGCCGAG ATCGAGCGCC CGCGCCTCGA CGCGGCGGAG TAG
|
Protein sequence | MAERWTPKTW RNLPIHQVPS YPDAGALQAV EAQLASFPPL VFAGEARKLK AALARVGAGE AFLLQGGDCA ESFDEHSADN IRDFFRVFLQ MAMVLTFAGG SPVVKVGRIA GQFAKPRSSP TETLDGVALP SYRGDIINGL TFNEEARIPD PRRQLEAYRQ SAATLNLLRA FATGGYANLE NAHRWMLGFV KDSPQSSRYR DVAERMSDAL DFMRAIGINP ETHQEVRTTD FYTSHEALLL GYEESLTRVD STSGDWYATS GHMLWIGDRT RQPDHAHVEY ARGIKNPIGL KCGPSTTAEG LIRLIDLLNP ENEAGRLSLI CRFGADKVGD HLPGLIRTVQ REGRNVVWVC DPMHGNTIAA GRYKTRPFER VMQEIEGFFG VHRAEGTIAG GIHLEMTGKD VTECTGGARA LTADDLQDRY HTYCDPRLNA EQALEVAFLT AELVKRERAE IERPRLDAAE
|
| |