Gene Mpe_A1425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1425 
Symbol 
ID4783938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1533405 
End bp1534511 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content70% 
IMG OID640089991 
Productchorismate synthase 
Protein accessionYP_001020622 
Protein GI124266618 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0082] Chorismate synthase 
TIGRFAM ID[TIGR00033] chorismate synthase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0865922 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGCA GCACCCTCGG CCACTTGTTT TGCGTCACCA ACTTCGGTGA ATCCCACGGA 
CCCGCCATCG GTTGCGTGAT CGACGGCTGT CCGCCCGGCA TGAGCCTGAG CGAGGCCGAC
ATCCAGCCCG AGCTGGACCG CCGCCGTCCC GGCACCTCGC GCCACGTGAC CCAGCGCAAT
GAGCCCGACG CGGTGGAGAT CCTGTCCGGC GTCCACGAGG GCCGCACCAC CGGCACACCG
ATCTGCCTGC TGATCCGCAA CACCGACCAG CGCAGCAAGG ACTACGGCAA CATCGTCCAG
ACCTTCCGCC CCGGCCACGC CGACTACACC TACTGGCACA AGTACGGCCT GCGCGACCCG
CGCGGCGGTG GTCGCAGCTC GGCGCGCCTC ACCGCGCCGA TGGTCGGTGC CGGCGCGGTG
GCGAAGAAGT GGCTCAAGGA ACACCACGGC ATCGCCTTTC GCGGCGGCAT GGCGGCGCTG
GGCGAGATCG ACATCGCCTT CGAGGGCTGG CAGCATGTGC CGGACAACCC CTTCTTCGCG
CCCAACGCCA GCCAGATCGG CCAGCTCGAG GACTTCATGG ACGCCTTGCG CAAGGAGGGC
GATTCGGTCG GCGCGCGCAT CGTTGTCGAG GCCACCGGCG TGCCGGTCGG CTGGGGCGAG
CCGCTGTTCG ACAAGCTCGA CGCCGACATC GCCCACGTGA TGATGGGGCT CAATGCCGTC
AAGGGCGTCG AGATCGGTGC GGGCTTCGCG AGCGTCGCGC ACCGCGGCTC GATGCACGGC
GACGAACTCA CGCCCCAGGG CTTTCGCAGC AACCACGCCG GCGGTGTGCT CGGCGGCATC
AGCACCGGGC AGGATATCCG CGTATCGATC GCGATCAAGC CCACCAGCTC GATCCGCACG
CCACGCCAGT CGATCGACCT GCAGGGCCAG CCGGCGACGG TCGAGACCTT CGGCCGCCAC
GACCCCTGCG TCGGCATCCG CGCCACGCCG ATCGCCGAGG CGCTGCTGGC GCTGGTGCTG
ATGGACCATG CACTGCGCCA CCGCGCGCAG TGCGGCGACG TGCGCCTGCC GGTGGCGCCG
ATCGCGGCGC ACCTGCCGGA CGCCTGA
 
Protein sequence
MSGSTLGHLF CVTNFGESHG PAIGCVIDGC PPGMSLSEAD IQPELDRRRP GTSRHVTQRN 
EPDAVEILSG VHEGRTTGTP ICLLIRNTDQ RSKDYGNIVQ TFRPGHADYT YWHKYGLRDP
RGGGRSSARL TAPMVGAGAV AKKWLKEHHG IAFRGGMAAL GEIDIAFEGW QHVPDNPFFA
PNASQIGQLE DFMDALRKEG DSVGARIVVE ATGVPVGWGE PLFDKLDADI AHVMMGLNAV
KGVEIGAGFA SVAHRGSMHG DELTPQGFRS NHAGGVLGGI STGQDIRVSI AIKPTSSIRT
PRQSIDLQGQ PATVETFGRH DPCVGIRATP IAEALLALVL MDHALRHRAQ CGDVRLPVAP
IAAHLPDA