Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_B0133 |
Symbol | mcp |
ID | 4787736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008826 |
Strand | + |
Start bp | 117635 |
End bp | 119215 |
Gene Length | 1581 bp |
Protein Length | 526 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640092542 |
Product | methyl-accepting chemotaxis protein |
Protein accession | YP_001023147 |
Protein GI | 124262677 |
COG category | [N] Cell motility [T] Signal transduction mechanisms |
COG ID | [COG0840] Methyl-accepting chemotaxis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.845076 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000223015 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATCTCT CTCGCACGAG CATCAAGACC AAGCTCATCC TCGCCTTTGC CCTGCTGGCC GCCATCGTCG CGATGGTGTC GCTCCAGGCG CTTTCCTCCC TCTCCGGCGC CAGTCACCAG TTCCGCGTGT TCGTCGACGG CGTGTCGCAG CGCGAGGCGC TGGCCAACCA GGTCATGGAC GCCGCTGCGC AGCGCGCGAT CGCCGCGCGC AACCTGGTGC TGGTGACCTC GGATCAGGAC CGCGAGATCG AGAAGGCCGC GGTGGTTCGC GCCCACGAAG AGACCGGCGC CGCGCTGAAA CGCCTCAACG GCGCCGTGCA CGCCAAGGGC TCGGACGCCA CCGCGCGCGA TCGCGAGCTG CTGGCAAAGA TCGAGGAGGT CGAGCAGCGC TACGGCCCGG TGGCGCTGGC CATCGTGGAC ATGGCGCTGC ACGGCCGCAA CCAGGACGCC ATCGAGAAGA TGAACGCCGA GTGCCGGCCG CTGCTGGCCG AGCTGTTGAA GGCGGCATCG AGCTACATCC AGTACAGCGC CGAGCGCAGC GATGCGGCGG CGAAGGCTGC CGAGGCGGCA GTCGCGCAGG ACCGTCTCGT GCTGACGCTG GCCAGCGTGG CCGCGGCTGC CGGCGCCATC TTCCTGGGCT GGATGCTGAG CCGCTCCATC ACCAGCCCGT TGCGCCGCGC CGTCGCGCTG GCCGAGGCTG TTGCCTCCGG CGACCTGACC ACGCGCATCG ATGTGGATCG CCAGGACGAG ACCGGCCAGC TGCTGGCGGC GTTGCGTCGC ATGAACGAGA GTCTCGCGGA CATGGTGTCG CGCGTGCGGG CCTCGGCCGA CGGCATCGTC ACTGCCTCGG AGCAGATCGC CAGCGGCAAC CAGGACCTGT CGACCCGCAC GGAGCATCAG GCCGGCGCCC TGCAGCAGAC CGCTTCGTCG ATGCAGGAGA TGACCGACGC GGTGCAGGTC ACTGCCGTGA GTTCCGGCCA GGCCAGCCAA CTCGCCCACG ACGCCGCGCA GACCGCGGGC CTCGGTGGCG AAGCGGTGCA GCGTGTGGTG TCGACCATGG GCGAGATCAC CGAGTCCAGC AGGCAGATCG CCGAGATCGT TGGCGTGATC GACAGCCTGG CCTTCCAGAC GAACATCTTG GCGCTCAACG CCGCTGTCGA GGCAGCGCGC GCCGGCGAGC AGGGCCGCGG CTTCGCGGTG GTTGCCGCCG AGGTGCGCAG CCTCGCCCAG CGCAGCGCCC AGGCCGCCAA GGAGATCAAG GTGCTGATCG GCCGCAGCGT CGAGAAGGTG GAGGCGGGCG AGGCGCAGGT GGCCGAGGCC GGCCGCACGA TGACCGGCCT GGTCTCCGGC GTGCGCCGGG TCAGCGACTT GATCGCGCAG ATCAACGAGT CCGCGCGCGA GCAAAGCGGC GGCATCCGTC AGGTGAACCA GGCCGTGGCC TCGCTGGACA GCGGCACGCA GCAGAACGCC GCGCTGGTCG AGGAAAGCAG CGCCGCGTCG AGCAGCCTGC TCCAGCAAGC CGGCGCGCTG CAGCAGGCGA TGGCCTTCTT CCGCACGAAC GCCGAGCCAG CCGCGGCCTG A
|
Protein sequence | MNLSRTSIKT KLILAFALLA AIVAMVSLQA LSSLSGASHQ FRVFVDGVSQ REALANQVMD AAAQRAIAAR NLVLVTSDQD REIEKAAVVR AHEETGAALK RLNGAVHAKG SDATARDREL LAKIEEVEQR YGPVALAIVD MALHGRNQDA IEKMNAECRP LLAELLKAAS SYIQYSAERS DAAAKAAEAA VAQDRLVLTL ASVAAAAGAI FLGWMLSRSI TSPLRRAVAL AEAVASGDLT TRIDVDRQDE TGQLLAALRR MNESLADMVS RVRASADGIV TASEQIASGN QDLSTRTEHQ AGALQQTASS MQEMTDAVQV TAVSSGQASQ LAHDAAQTAG LGGEAVQRVV STMGEITESS RQIAEIVGVI DSLAFQTNIL ALNAAVEAAR AGEQGRGFAV VAAEVRSLAQ RSAQAAKEIK VLIGRSVEKV EAGEAQVAEA GRTMTGLVSG VRRVSDLIAQ INESAREQSG GIRQVNQAVA SLDSGTQQNA ALVEESSAAS SSLLQQAGAL QQAMAFFRTN AEPAAA
|
| |