Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A1697 |
Symbol | mcp |
ID | 4785487 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | - |
Start bp | 1820198 |
End bp | 1821823 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 640090268 |
Product | methyl-accepting chemotaxis protein |
Protein accession | YP_001020892 |
Protein GI | 124266888 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.796497 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.294844 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGACAC GGCGCCGCCG CGTCGCCAAC GAATCAGAAC CTGCCGCGCC CAAGGAGGGC GCCACCATGA ACTGGATCCA GCGCCTGACC ATCGCCCGCC GCATCGGCGT CGGCTTCGGC ACCCTGCTCG TCCTGGCCGC CGTGATGGCC GGCCTGGCGC TGCTGGCCCT GCAACGCACC GGCGACGCCG TCGACCGCAT CGTGCAGGGT GAATGGGTCA AGGCCGGCGC CGCCGCCAGC ATCGACACGC TCACCCGCGC CAACGCCCGG CGCACGATGG AGCTGTTCTT CGTCGAGGGC TCCGCCGCCG CGGCCGTGCG CGAGCGCATC GCCGCCAACC GCCAGGGCAT CGACGAGGCC ATGGCCACGC TCGAGCGTCT GGTCACGCTG CCCGACGGGC GCGAGAAGCT GGCCGCGATG AAGACAGCGC GCGTGGCCTA CGTGACCGCG TTCTCCGAGG TCGACCGCCT GCTCAAGGCC GGCCAGCGTG AGGCCGCCCA GGCGCAGCTG CTGGGCAGCA CCCTGCCGGC GCTCGACGCG CTGCAGCAGC GCGTGCTGGA CATGTCGCAG TTCCAGGCCC GGCTGGCCCG CGAGACGGGG GCCGAGGTCG CGGCCCGCAT CCACCAGGCG CTGATCGGCC TGGGCGTGCT CGGCCTGGGC ATGCTGGCGG CCGCGGCGGG GCTGGGCACC TGGCTGGCCC GCGCCATCGC GCGGCCCATC GAACAGGCCG TGGCCGTGGC CGAACGCGTG GCCTCGGGCG AGCTGGGCCA TCGCCTGGAG TCCCGCCAGG GCGGCGAGCC CGGCCGCCTG ATGCACGCCC TGGCGCAGAT GGACCAGATC CTCACGCGGC TGGTGGGCCG CGTGCGCGAG GCCAGCGAGA GCATCGCCAC GGGCTCGTCG CAGATCGCCA CCGGCACCAC CGACCTCTCG CAGCGCACCG AGGAGCAGGC CTCCAACCTG CAGCAGACCG CGGCCTCGAT GGAGCAGCTG GCCGGCACCG TGCGCCACAG CGCCGACGCC GCGCGCAGCG CGTCCAGCCT CGCCCAGCAG GCGCGCGACG AGGCCGCGCA GGGTGGCGAG CTGATGCAGA CCGTGGGCCA GACCATGCAG CGCATCAGCG AGGCCAGCCG CCGCATCGGC GAGATCAACG CCGTCATCGA CGGCATCGCC TTCCAGACCA ACATCCTGGC GCTCAACGCC GCGGTGGAAG CGGCGCGCGC CGGCGAGCAC GGCCGCGGCT TCGCGGTGGT GGCGTCGGAG GTGCGGGCGC TGGCACAACG CAGCGCCGAG GCGGCGCGCT CGATCAAGGA GCTGGTGGCC GGCAGCGTGG AAAGCGTGAG CCACGGCCAG GCCAGCGTGG CGCAGGCCAC GCAGCAGATC GACGCCATCG TGGCCCGCGT GCGCAGCGTG AGCGACACCG TGGCGCAGAT CAGCGGCGCG GCCGACGAGC AGTCGCGCGG CATCGAGCAG GTCAACCAGG CGGTGACGCA GCTCGACCAG GTGACGCAAC AGAACGCGGC GCTGGTAGAG GAGAGCGCCG CGGCCTCCGA CAGCCTGAAG CACCAGGCCG AGCAGATGGT GGCGGCGGTG GGCGTGTTCC GGGTGGGTGC TGCGGTCGCG GCCTGA
|
Protein sequence | MPTRRRRVAN ESEPAAPKEG ATMNWIQRLT IARRIGVGFG TLLVLAAVMA GLALLALQRT GDAVDRIVQG EWVKAGAAAS IDTLTRANAR RTMELFFVEG SAAAAVRERI AANRQGIDEA MATLERLVTL PDGREKLAAM KTARVAYVTA FSEVDRLLKA GQREAAQAQL LGSTLPALDA LQQRVLDMSQ FQARLARETG AEVAARIHQA LIGLGVLGLG MLAAAAGLGT WLARAIARPI EQAVAVAERV ASGELGHRLE SRQGGEPGRL MHALAQMDQI LTRLVGRVRE ASESIATGSS QIATGTTDLS QRTEEQASNL QQTAASMEQL AGTVRHSADA ARSASSLAQQ ARDEAAQGGE LMQTVGQTMQ RISEASRRIG EINAVIDGIA FQTNILALNA AVEAARAGEH GRGFAVVASE VRALAQRSAE AARSIKELVA GSVESVSHGQ ASVAQATQQI DAIVARVRSV SDTVAQISGA ADEQSRGIEQ VNQAVTQLDQ VTQQNAALVE ESAAASDSLK HQAEQMVAAV GVFRVGAAVA A
|
| |