Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A2628 |
Symbol | |
ID | 4783649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 2802802 |
End bp | 2804673 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640091199 |
Product | chorismate binding enzyme |
Protein accession | YP_001021817 |
Protein GI | 124267813 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.530164 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGATGA GCGACGACGT TGACGCCTTT GCGCTGCTGG ACGACCGCGC GGCGACGGCC GAGCGGCCGA GCAGCCGGCT CTACACGGAC CATGTGCGGA CGCACCGCTG CGAGGATCCG GCCGCGCTCG ACGCGGTGTG GCGCGCGGTC GAGGCCGACC AGCGCGCCGG CCTGCACGCG GTGCTGCTGG CCGACTACGA ATGGGGCGCG AAGCTGCTGC AGGCCGGTCG CCGCGCCGGC GACGCGCGGG CCGCGCTGCG CGTGCTGATG TTCCGCGAAC TGCGGCGGCT GTCGGCCGAC GAGGCCGGGC ACTGGCTGGC CACGCGGGAG GGTTCGGCCG AGCCTGGCCC GGCCGGCGCG CTGGACGTGC AGGCCAGCGT GGACCGCACC GCCTTCCACG CCGCGATCGG CCGCATCCAC GAGGCCATCC GCGCCGGCGA GACCTACCAG GTCAACTACA CCTACCGGCT GGACTTCCAG GCCCACGGCA CGCCGGTCGC GCTCTACCGC CGGCTGCGTG CGCGCCAGCC GGTGCCCTAT GGCGCCTTCA TCGCGCTGCC GGCCGACGAG GTGGCCGCGA GCGGCGTCGC ACACGTGCTG TCGTGCTCGC CGGAGCTGTT CGTCCGCCAT GCGCAGGGCC GGCTCGTCGC CAAGCCGATG AAGGGCACGG CGCCGCGCCG TGCGGTGCTC GAAGGCGACA GCGAGACCGC CCGGCTGCTG AGTCTGGACA CCAAGAACCG CGCCGAGAAC CTGATGATCG TCGACTTGCT GCGCAACGAC CTCGGTCGCA TCGCCCGCAC CGGCTCGGTG AAGGTGCCGG AGCTGTTCGA GGTGGAGTCG CATGCCACCG TGTTCCAGAT GACCTCGACG GTCAGTGCCG AGCTGGCACC CGGCACCGAC CTGCCGGCGG TGCTGCGCGC GGTGTTTCCC TGCGGCTCGA TCACCGGTGC CCCCAAGCAC CACACGGTGG AGCTGATCGC CGGGCTGGAA AGCACGCCGC GCGGCCTGTA CTGTGGCGCC ATCGGCTGGG TCGACCTGCC GACCGCGGCG CCGGACGGGG CCTGCGGCGA CTTCTGCCTG TCGGTGGCGA TCCGCACGCT GACCCTGGGC CCGCCCGCGC CCGACGGCCT GCGCGCGGGC CGGCTGGGCA TCGGCGCCGG CATCGTGCTG GACAGCGAGG CCGAGGACGA ATACGGGGAA TGCCAGCTGA AGGGGCGTTT CCTGACCGGC CTCGATCCGG GCTTCTCGAT CTTCGAGACC CTGCACGCGA CGCGCGAGCA GGGCGTGCGC CATCTCGACC GTCATCTCGC GCGCCTGCAG CGCAGTGCCA CCGCGCTGGG TTTTCGCTGG GACGAGATCG AACTGCGCGA GGCGCTGCAG GAGCAGGTCG GCCGCCTGCC CGCCGGCACG CCCTGCCGGC TGCGCCTGTC GCTCGACCGC GCCGGCCGCT GCGAGTTCAC CGGCGCGCCG CTCACCGCCC TGCCCGCCGG CCCGGTGACG CTGCGGCTGG AGCCGCAGCC GCTGCCCGAG CCGCGGCCGC TGGCGGGCCA CAAGACCACG CTGCGCGCCG CCTACGACGC CGGCGTGCGG GCCGCCGAGG CAGTCGGCGC CTTCGACAGC CTGTTCTTCA CGGCCGACGG CCGACTGGTC GAGGGCGGCC GCAGCAACGT GTTCCTGCAA CTCGACGGCC GCTGGTGGAC ACCGCCGCTC GCCGACGGCG CGCTGCCCGG CGTGATGCGT GGCCTGCTGC TCGAGGATCC GGCCTGGGCT GCCGCGGAGC GCCCGCTGAC GCGCGCCGAC CTGGCGCGGG CCGAGGCCGT GGTCGTGTGC AACGCGCTGC GCGGCGCCGT GCCGGCCCGG TTGGCGACAT GA
|
Protein sequence | MPMSDDVDAF ALLDDRAATA ERPSSRLYTD HVRTHRCEDP AALDAVWRAV EADQRAGLHA VLLADYEWGA KLLQAGRRAG DARAALRVLM FRELRRLSAD EAGHWLATRE GSAEPGPAGA LDVQASVDRT AFHAAIGRIH EAIRAGETYQ VNYTYRLDFQ AHGTPVALYR RLRARQPVPY GAFIALPADE VAASGVAHVL SCSPELFVRH AQGRLVAKPM KGTAPRRAVL EGDSETARLL SLDTKNRAEN LMIVDLLRND LGRIARTGSV KVPELFEVES HATVFQMTST VSAELAPGTD LPAVLRAVFP CGSITGAPKH HTVELIAGLE STPRGLYCGA IGWVDLPTAA PDGACGDFCL SVAIRTLTLG PPAPDGLRAG RLGIGAGIVL DSEAEDEYGE CQLKGRFLTG LDPGFSIFET LHATREQGVR HLDRHLARLQ RSATALGFRW DEIELREALQ EQVGRLPAGT PCRLRLSLDR AGRCEFTGAP LTALPAGPVT LRLEPQPLPE PRPLAGHKTT LRAAYDAGVR AAEAVGAFDS LFFTADGRLV EGGRSNVFLQ LDGRWWTPPL ADGALPGVMR GLLLEDPAWA AAERPLTRAD LARAEAVVVC NALRGAVPAR LAT
|
| |