Gene Mpe_B0133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_B0133 
Symbolmcp 
ID4787736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008826 
Strand
Start bp117635 
End bp119215 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content70% 
IMG OID640092542 
Productmethyl-accepting chemotaxis protein 
Protein accessionYP_001023147 
Protein GI124262677 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.845076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.000223015 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATCTCT CTCGCACGAG CATCAAGACC AAGCTCATCC TCGCCTTTGC CCTGCTGGCC 
GCCATCGTCG CGATGGTGTC GCTCCAGGCG CTTTCCTCCC TCTCCGGCGC CAGTCACCAG
TTCCGCGTGT TCGTCGACGG CGTGTCGCAG CGCGAGGCGC TGGCCAACCA GGTCATGGAC
GCCGCTGCGC AGCGCGCGAT CGCCGCGCGC AACCTGGTGC TGGTGACCTC GGATCAGGAC
CGCGAGATCG AGAAGGCCGC GGTGGTTCGC GCCCACGAAG AGACCGGCGC CGCGCTGAAA
CGCCTCAACG GCGCCGTGCA CGCCAAGGGC TCGGACGCCA CCGCGCGCGA TCGCGAGCTG
CTGGCAAAGA TCGAGGAGGT CGAGCAGCGC TACGGCCCGG TGGCGCTGGC CATCGTGGAC
ATGGCGCTGC ACGGCCGCAA CCAGGACGCC ATCGAGAAGA TGAACGCCGA GTGCCGGCCG
CTGCTGGCCG AGCTGTTGAA GGCGGCATCG AGCTACATCC AGTACAGCGC CGAGCGCAGC
GATGCGGCGG CGAAGGCTGC CGAGGCGGCA GTCGCGCAGG ACCGTCTCGT GCTGACGCTG
GCCAGCGTGG CCGCGGCTGC CGGCGCCATC TTCCTGGGCT GGATGCTGAG CCGCTCCATC
ACCAGCCCGT TGCGCCGCGC CGTCGCGCTG GCCGAGGCTG TTGCCTCCGG CGACCTGACC
ACGCGCATCG ATGTGGATCG CCAGGACGAG ACCGGCCAGC TGCTGGCGGC GTTGCGTCGC
ATGAACGAGA GTCTCGCGGA CATGGTGTCG CGCGTGCGGG CCTCGGCCGA CGGCATCGTC
ACTGCCTCGG AGCAGATCGC CAGCGGCAAC CAGGACCTGT CGACCCGCAC GGAGCATCAG
GCCGGCGCCC TGCAGCAGAC CGCTTCGTCG ATGCAGGAGA TGACCGACGC GGTGCAGGTC
ACTGCCGTGA GTTCCGGCCA GGCCAGCCAA CTCGCCCACG ACGCCGCGCA GACCGCGGGC
CTCGGTGGCG AAGCGGTGCA GCGTGTGGTG TCGACCATGG GCGAGATCAC CGAGTCCAGC
AGGCAGATCG CCGAGATCGT TGGCGTGATC GACAGCCTGG CCTTCCAGAC GAACATCTTG
GCGCTCAACG CCGCTGTCGA GGCAGCGCGC GCCGGCGAGC AGGGCCGCGG CTTCGCGGTG
GTTGCCGCCG AGGTGCGCAG CCTCGCCCAG CGCAGCGCCC AGGCCGCCAA GGAGATCAAG
GTGCTGATCG GCCGCAGCGT CGAGAAGGTG GAGGCGGGCG AGGCGCAGGT GGCCGAGGCC
GGCCGCACGA TGACCGGCCT GGTCTCCGGC GTGCGCCGGG TCAGCGACTT GATCGCGCAG
ATCAACGAGT CCGCGCGCGA GCAAAGCGGC GGCATCCGTC AGGTGAACCA GGCCGTGGCC
TCGCTGGACA GCGGCACGCA GCAGAACGCC GCGCTGGTCG AGGAAAGCAG CGCCGCGTCG
AGCAGCCTGC TCCAGCAAGC CGGCGCGCTG CAGCAGGCGA TGGCCTTCTT CCGCACGAAC
GCCGAGCCAG CCGCGGCCTG A
 
Protein sequence
MNLSRTSIKT KLILAFALLA AIVAMVSLQA LSSLSGASHQ FRVFVDGVSQ REALANQVMD 
AAAQRAIAAR NLVLVTSDQD REIEKAAVVR AHEETGAALK RLNGAVHAKG SDATARDREL
LAKIEEVEQR YGPVALAIVD MALHGRNQDA IEKMNAECRP LLAELLKAAS SYIQYSAERS
DAAAKAAEAA VAQDRLVLTL ASVAAAAGAI FLGWMLSRSI TSPLRRAVAL AEAVASGDLT
TRIDVDRQDE TGQLLAALRR MNESLADMVS RVRASADGIV TASEQIASGN QDLSTRTEHQ
AGALQQTASS MQEMTDAVQV TAVSSGQASQ LAHDAAQTAG LGGEAVQRVV STMGEITESS
RQIAEIVGVI DSLAFQTNIL ALNAAVEAAR AGEQGRGFAV VAAEVRSLAQ RSAQAAKEIK
VLIGRSVEKV EAGEAQVAEA GRTMTGLVSG VRRVSDLIAQ INESAREQSG GIRQVNQAVA
SLDSGTQQNA ALVEESSAAS SSLLQQAGAL QQAMAFFRTN AEPAAA