Gene Mpe_A1697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1697 
Symbolmcp 
ID4785487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1820198 
End bp1821823 
Gene Length1626 bp 
Protein Length541 aa 
Translation table11 
GC content74% 
IMG OID640090268 
Productmethyl-accepting chemotaxis protein 
Protein accessionYP_001020892 
Protein GI124266888 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.796497 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.294844 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGACAC GGCGCCGCCG CGTCGCCAAC GAATCAGAAC CTGCCGCGCC CAAGGAGGGC 
GCCACCATGA ACTGGATCCA GCGCCTGACC ATCGCCCGCC GCATCGGCGT CGGCTTCGGC
ACCCTGCTCG TCCTGGCCGC CGTGATGGCC GGCCTGGCGC TGCTGGCCCT GCAACGCACC
GGCGACGCCG TCGACCGCAT CGTGCAGGGT GAATGGGTCA AGGCCGGCGC CGCCGCCAGC
ATCGACACGC TCACCCGCGC CAACGCCCGG CGCACGATGG AGCTGTTCTT CGTCGAGGGC
TCCGCCGCCG CGGCCGTGCG CGAGCGCATC GCCGCCAACC GCCAGGGCAT CGACGAGGCC
ATGGCCACGC TCGAGCGTCT GGTCACGCTG CCCGACGGGC GCGAGAAGCT GGCCGCGATG
AAGACAGCGC GCGTGGCCTA CGTGACCGCG TTCTCCGAGG TCGACCGCCT GCTCAAGGCC
GGCCAGCGTG AGGCCGCCCA GGCGCAGCTG CTGGGCAGCA CCCTGCCGGC GCTCGACGCG
CTGCAGCAGC GCGTGCTGGA CATGTCGCAG TTCCAGGCCC GGCTGGCCCG CGAGACGGGG
GCCGAGGTCG CGGCCCGCAT CCACCAGGCG CTGATCGGCC TGGGCGTGCT CGGCCTGGGC
ATGCTGGCGG CCGCGGCGGG GCTGGGCACC TGGCTGGCCC GCGCCATCGC GCGGCCCATC
GAACAGGCCG TGGCCGTGGC CGAACGCGTG GCCTCGGGCG AGCTGGGCCA TCGCCTGGAG
TCCCGCCAGG GCGGCGAGCC CGGCCGCCTG ATGCACGCCC TGGCGCAGAT GGACCAGATC
CTCACGCGGC TGGTGGGCCG CGTGCGCGAG GCCAGCGAGA GCATCGCCAC GGGCTCGTCG
CAGATCGCCA CCGGCACCAC CGACCTCTCG CAGCGCACCG AGGAGCAGGC CTCCAACCTG
CAGCAGACCG CGGCCTCGAT GGAGCAGCTG GCCGGCACCG TGCGCCACAG CGCCGACGCC
GCGCGCAGCG CGTCCAGCCT CGCCCAGCAG GCGCGCGACG AGGCCGCGCA GGGTGGCGAG
CTGATGCAGA CCGTGGGCCA GACCATGCAG CGCATCAGCG AGGCCAGCCG CCGCATCGGC
GAGATCAACG CCGTCATCGA CGGCATCGCC TTCCAGACCA ACATCCTGGC GCTCAACGCC
GCGGTGGAAG CGGCGCGCGC CGGCGAGCAC GGCCGCGGCT TCGCGGTGGT GGCGTCGGAG
GTGCGGGCGC TGGCACAACG CAGCGCCGAG GCGGCGCGCT CGATCAAGGA GCTGGTGGCC
GGCAGCGTGG AAAGCGTGAG CCACGGCCAG GCCAGCGTGG CGCAGGCCAC GCAGCAGATC
GACGCCATCG TGGCCCGCGT GCGCAGCGTG AGCGACACCG TGGCGCAGAT CAGCGGCGCG
GCCGACGAGC AGTCGCGCGG CATCGAGCAG GTCAACCAGG CGGTGACGCA GCTCGACCAG
GTGACGCAAC AGAACGCGGC GCTGGTAGAG GAGAGCGCCG CGGCCTCCGA CAGCCTGAAG
CACCAGGCCG AGCAGATGGT GGCGGCGGTG GGCGTGTTCC GGGTGGGTGC TGCGGTCGCG
GCCTGA
 
Protein sequence
MPTRRRRVAN ESEPAAPKEG ATMNWIQRLT IARRIGVGFG TLLVLAAVMA GLALLALQRT 
GDAVDRIVQG EWVKAGAAAS IDTLTRANAR RTMELFFVEG SAAAAVRERI AANRQGIDEA
MATLERLVTL PDGREKLAAM KTARVAYVTA FSEVDRLLKA GQREAAQAQL LGSTLPALDA
LQQRVLDMSQ FQARLARETG AEVAARIHQA LIGLGVLGLG MLAAAAGLGT WLARAIARPI
EQAVAVAERV ASGELGHRLE SRQGGEPGRL MHALAQMDQI LTRLVGRVRE ASESIATGSS
QIATGTTDLS QRTEEQASNL QQTAASMEQL AGTVRHSADA ARSASSLAQQ ARDEAAQGGE
LMQTVGQTMQ RISEASRRIG EINAVIDGIA FQTNILALNA AVEAARAGEH GRGFAVVASE
VRALAQRSAE AARSIKELVA GSVESVSHGQ ASVAQATQQI DAIVARVRSV SDTVAQISGA
ADEQSRGIEQ VNQAVTQLDQ VTQQNAALVE ESAAASDSLK HQAEQMVAAV GVFRVGAAVA
A