Gene Mpe_A2628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2628 
Symbol 
ID4783649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2802802 
End bp2804673 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content74% 
IMG OID640091199 
Productchorismate binding enzyme 
Protein accessionYP_001021817 
Protein GI124267813 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.530164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGATGA GCGACGACGT TGACGCCTTT GCGCTGCTGG ACGACCGCGC GGCGACGGCC 
GAGCGGCCGA GCAGCCGGCT CTACACGGAC CATGTGCGGA CGCACCGCTG CGAGGATCCG
GCCGCGCTCG ACGCGGTGTG GCGCGCGGTC GAGGCCGACC AGCGCGCCGG CCTGCACGCG
GTGCTGCTGG CCGACTACGA ATGGGGCGCG AAGCTGCTGC AGGCCGGTCG CCGCGCCGGC
GACGCGCGGG CCGCGCTGCG CGTGCTGATG TTCCGCGAAC TGCGGCGGCT GTCGGCCGAC
GAGGCCGGGC ACTGGCTGGC CACGCGGGAG GGTTCGGCCG AGCCTGGCCC GGCCGGCGCG
CTGGACGTGC AGGCCAGCGT GGACCGCACC GCCTTCCACG CCGCGATCGG CCGCATCCAC
GAGGCCATCC GCGCCGGCGA GACCTACCAG GTCAACTACA CCTACCGGCT GGACTTCCAG
GCCCACGGCA CGCCGGTCGC GCTCTACCGC CGGCTGCGTG CGCGCCAGCC GGTGCCCTAT
GGCGCCTTCA TCGCGCTGCC GGCCGACGAG GTGGCCGCGA GCGGCGTCGC ACACGTGCTG
TCGTGCTCGC CGGAGCTGTT CGTCCGCCAT GCGCAGGGCC GGCTCGTCGC CAAGCCGATG
AAGGGCACGG CGCCGCGCCG TGCGGTGCTC GAAGGCGACA GCGAGACCGC CCGGCTGCTG
AGTCTGGACA CCAAGAACCG CGCCGAGAAC CTGATGATCG TCGACTTGCT GCGCAACGAC
CTCGGTCGCA TCGCCCGCAC CGGCTCGGTG AAGGTGCCGG AGCTGTTCGA GGTGGAGTCG
CATGCCACCG TGTTCCAGAT GACCTCGACG GTCAGTGCCG AGCTGGCACC CGGCACCGAC
CTGCCGGCGG TGCTGCGCGC GGTGTTTCCC TGCGGCTCGA TCACCGGTGC CCCCAAGCAC
CACACGGTGG AGCTGATCGC CGGGCTGGAA AGCACGCCGC GCGGCCTGTA CTGTGGCGCC
ATCGGCTGGG TCGACCTGCC GACCGCGGCG CCGGACGGGG CCTGCGGCGA CTTCTGCCTG
TCGGTGGCGA TCCGCACGCT GACCCTGGGC CCGCCCGCGC CCGACGGCCT GCGCGCGGGC
CGGCTGGGCA TCGGCGCCGG CATCGTGCTG GACAGCGAGG CCGAGGACGA ATACGGGGAA
TGCCAGCTGA AGGGGCGTTT CCTGACCGGC CTCGATCCGG GCTTCTCGAT CTTCGAGACC
CTGCACGCGA CGCGCGAGCA GGGCGTGCGC CATCTCGACC GTCATCTCGC GCGCCTGCAG
CGCAGTGCCA CCGCGCTGGG TTTTCGCTGG GACGAGATCG AACTGCGCGA GGCGCTGCAG
GAGCAGGTCG GCCGCCTGCC CGCCGGCACG CCCTGCCGGC TGCGCCTGTC GCTCGACCGC
GCCGGCCGCT GCGAGTTCAC CGGCGCGCCG CTCACCGCCC TGCCCGCCGG CCCGGTGACG
CTGCGGCTGG AGCCGCAGCC GCTGCCCGAG CCGCGGCCGC TGGCGGGCCA CAAGACCACG
CTGCGCGCCG CCTACGACGC CGGCGTGCGG GCCGCCGAGG CAGTCGGCGC CTTCGACAGC
CTGTTCTTCA CGGCCGACGG CCGACTGGTC GAGGGCGGCC GCAGCAACGT GTTCCTGCAA
CTCGACGGCC GCTGGTGGAC ACCGCCGCTC GCCGACGGCG CGCTGCCCGG CGTGATGCGT
GGCCTGCTGC TCGAGGATCC GGCCTGGGCT GCCGCGGAGC GCCCGCTGAC GCGCGCCGAC
CTGGCGCGGG CCGAGGCCGT GGTCGTGTGC AACGCGCTGC GCGGCGCCGT GCCGGCCCGG
TTGGCGACAT GA
 
Protein sequence
MPMSDDVDAF ALLDDRAATA ERPSSRLYTD HVRTHRCEDP AALDAVWRAV EADQRAGLHA 
VLLADYEWGA KLLQAGRRAG DARAALRVLM FRELRRLSAD EAGHWLATRE GSAEPGPAGA
LDVQASVDRT AFHAAIGRIH EAIRAGETYQ VNYTYRLDFQ AHGTPVALYR RLRARQPVPY
GAFIALPADE VAASGVAHVL SCSPELFVRH AQGRLVAKPM KGTAPRRAVL EGDSETARLL
SLDTKNRAEN LMIVDLLRND LGRIARTGSV KVPELFEVES HATVFQMTST VSAELAPGTD
LPAVLRAVFP CGSITGAPKH HTVELIAGLE STPRGLYCGA IGWVDLPTAA PDGACGDFCL
SVAIRTLTLG PPAPDGLRAG RLGIGAGIVL DSEAEDEYGE CQLKGRFLTG LDPGFSIFET
LHATREQGVR HLDRHLARLQ RSATALGFRW DEIELREALQ EQVGRLPAGT PCRLRLSLDR
AGRCEFTGAP LTALPAGPVT LRLEPQPLPE PRPLAGHKTT LRAAYDAGVR AAEAVGAFDS
LFFTADGRLV EGGRSNVFLQ LDGRWWTPPL ADGALPGVMR GLLLEDPAWA AAERPLTRAD
LARAEAVVVC NALRGAVPAR LAT