Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A3771 |
Symbol | |
ID | 4786000 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 3989574 |
End bp | 3990698 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640092354 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_001022959 |
Protein GI | 124268955 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0244727 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGCCC TGTCCGCCAA CGTTCCCGAG GGTTGGGCCG CGCCCGCCGA CAAGACCAGC CAGACCGATG ACGAACGGAT CGAGGACGTG ATGCCACTGC CGCCGCCCGA GCACCTGATC CGCTTCTTCC CGGTGCGCGG CACGCCCGTC GAGGAGCTGG TGTCCAGCAC CCGCCGCCGC ATCCGCGACA TCATGCGCGG CAATGACGAC CGCCTGCTGG TCATCATGGG CCCGTGCTCG ATCCACGACC CGGTGGCGGC GGTGGACTAC GCGCGCAAGC TCAAGGCCCA GCGCGACAAG TACGCCGACA CGCTGGAGAT CGTGATGCGC GTGTACTTCG AGAAGCCGCG CACCACCGTC GGCTGGAAGG GCCTGATCAA TGACCCCTAC CTCGACGAGA GCTTCCGCAT CGACGAAGGC CTGCGCATTG CGCGCCAGCT GCTGCTGGAG ATCGGCCGCC TCGGGCTGCC GGCCGGCAGC GAGTTCCTCG ACGTGATCTC GCCGCAGTAC ATCGGCGACC TGATCTCCTG GGGCGCCATC GGTGCGCGCA CCACCGAAAG CCAGGTCCAC CGCGAGCTCG CCTCGGGCAT CAGCGCGCCG ATCGGCTTCA AGAACGGCAC CGACGGCAAC ATCAAGATCG CCACGGACGC CATCCAGTCC GCCAGCCGGC CGCACCACTT CCTGTCGGTG CACAAGAACG GCCAGGTCGC GATCGTCGAG ACCCGCGGCA ACGCCGATTG CCACGTCATC CTGCGCGGCG GCAAGACGCC GAACTACGAC GCCAGCAGTG TCGGGGCGGC CTGCGCCGAA CTCGGCAAGG CCGGGCTGCC GGCCTCGCTG ATGGTCGACT GCTCCCACGC CAACAGCAGC AAGCAGCACC AGAAGCAGAT CGACGTGGCG CGCGACGTCG CCGACCAGTT GGCCGGCGGC AGCCGCCAGG TCTTCGGCGT GATGGTCGAG AGCCACCTGA GCGCCGGCGC CCAGAAGTTC AGCGCGGGCA AGGACGACCC GGCGAAGCTC GCCTACGGCC AGAGCATCAC GGACGCCTGC ATCGGCTGGG ACGATTCACT GGAGGTGCTG GGCGTGCTCA GCGCCGCGGT GGCGGCCCGG CGCGGACGGG GCTGA
|
Protein sequence | MNALSANVPE GWAAPADKTS QTDDERIEDV MPLPPPEHLI RFFPVRGTPV EELVSSTRRR IRDIMRGNDD RLLVIMGPCS IHDPVAAVDY ARKLKAQRDK YADTLEIVMR VYFEKPRTTV GWKGLINDPY LDESFRIDEG LRIARQLLLE IGRLGLPAGS EFLDVISPQY IGDLISWGAI GARTTESQVH RELASGISAP IGFKNGTDGN IKIATDAIQS ASRPHHFLSV HKNGQVAIVE TRGNADCHVI LRGGKTPNYD ASSVGAACAE LGKAGLPASL MVDCSHANSS KQHQKQIDVA RDVADQLAGG SRQVFGVMVE SHLSAGAQKF SAGKDDPAKL AYGQSITDAC IGWDDSLEVL GVLSAAVAAR RGRG
|
| |