Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1425 |
Symbol | |
ID | 4783938 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 1533405 |
End bp | 1534511 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640089991 |
Product | chorismate synthase |
Protein accession | YP_001020622 |
Protein GI | 124266618 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0865922 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGCA GCACCCTCGG CCACTTGTTT TGCGTCACCA ACTTCGGTGA ATCCCACGGA CCCGCCATCG GTTGCGTGAT CGACGGCTGT CCGCCCGGCA TGAGCCTGAG CGAGGCCGAC ATCCAGCCCG AGCTGGACCG CCGCCGTCCC GGCACCTCGC GCCACGTGAC CCAGCGCAAT GAGCCCGACG CGGTGGAGAT CCTGTCCGGC GTCCACGAGG GCCGCACCAC CGGCACACCG ATCTGCCTGC TGATCCGCAA CACCGACCAG CGCAGCAAGG ACTACGGCAA CATCGTCCAG ACCTTCCGCC CCGGCCACGC CGACTACACC TACTGGCACA AGTACGGCCT GCGCGACCCG CGCGGCGGTG GTCGCAGCTC GGCGCGCCTC ACCGCGCCGA TGGTCGGTGC CGGCGCGGTG GCGAAGAAGT GGCTCAAGGA ACACCACGGC ATCGCCTTTC GCGGCGGCAT GGCGGCGCTG GGCGAGATCG ACATCGCCTT CGAGGGCTGG CAGCATGTGC CGGACAACCC CTTCTTCGCG CCCAACGCCA GCCAGATCGG CCAGCTCGAG GACTTCATGG ACGCCTTGCG CAAGGAGGGC GATTCGGTCG GCGCGCGCAT CGTTGTCGAG GCCACCGGCG TGCCGGTCGG CTGGGGCGAG CCGCTGTTCG ACAAGCTCGA CGCCGACATC GCCCACGTGA TGATGGGGCT CAATGCCGTC AAGGGCGTCG AGATCGGTGC GGGCTTCGCG AGCGTCGCGC ACCGCGGCTC GATGCACGGC GACGAACTCA CGCCCCAGGG CTTTCGCAGC AACCACGCCG GCGGTGTGCT CGGCGGCATC AGCACCGGGC AGGATATCCG CGTATCGATC GCGATCAAGC CCACCAGCTC GATCCGCACG CCACGCCAGT CGATCGACCT GCAGGGCCAG CCGGCGACGG TCGAGACCTT CGGCCGCCAC GACCCCTGCG TCGGCATCCG CGCCACGCCG ATCGCCGAGG CGCTGCTGGC GCTGGTGCTG ATGGACCATG CACTGCGCCA CCGCGCGCAG TGCGGCGACG TGCGCCTGCC GGTGGCGCCG ATCGCGGCGC ACCTGCCGGA CGCCTGA
|
Protein sequence | MSGSTLGHLF CVTNFGESHG PAIGCVIDGC PPGMSLSEAD IQPELDRRRP GTSRHVTQRN EPDAVEILSG VHEGRTTGTP ICLLIRNTDQ RSKDYGNIVQ TFRPGHADYT YWHKYGLRDP RGGGRSSARL TAPMVGAGAV AKKWLKEHHG IAFRGGMAAL GEIDIAFEGW QHVPDNPFFA PNASQIGQLE DFMDALRKEG DSVGARIVVE ATGVPVGWGE PLFDKLDADI AHVMMGLNAV KGVEIGAGFA SVAHRGSMHG DELTPQGFRS NHAGGVLGGI STGQDIRVSI AIKPTSSIRT PRQSIDLQGQ PATVETFGRH DPCVGIRATP IAEALLALVL MDHALRHRAQ CGDVRLPVAP IAAHLPDA
|
| |