Gene Mpe_A2920 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2920 
Symbolaer 
ID4784550 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp3104066 
End bp3105715 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content69% 
IMG OID640091491 
Productaerotaxis sensor receptor 
Protein accessionYP_001022108 
Protein GI124268104 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein
[COG2202] FOG: PAS/PAC domain 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGA ACACCCCCGT GACAGCCAGC GAGTACCCCT TCCCCGCCGG CGAAACCCTG 
GTCTCGACGA CCGACCTGAA GGGGCGCATC ACCTACTGCA ACCCGGCCTT CATCACGGTC
AGCGGCTACG CACGGGAAGA ACTGCTGGGC CAGCCGCACA ACATGATCCG CCACCCCGAC
ATGCCCGAGG AGGCCTTCCG CGACATGTGG GCCACCATCG CCTCCGGCCA ACCGTGGTCG
GCGCCGGTCA AGAACCGTCG CAAGGACGGT AGCTGCTACT GGGTGATCGC GAACGTCACG
CCGCTCATGA GCGGCGACCA GCCGACCGGC TACATGTCGG TGCGGACGGC CCCGGATCGC
GCCGATGTCG AGCGTGCCGA GGAACTCTAT GCAGTGATGC GTACCGAGAA GGCCGCCGGA
CAACTGGTCC ACCGTTTCGA CGCCGGCCGC CTGCGGGTCG ACACCTGGCT GGGCCGCCTC
TCGAATGTGC TGCGCCTGCC GGTTCAGGTG AAGCTCGGTG TCGTCGCCGC GCTGCTGGGA
TCGGCCGGCT TCGTGACGGG TGCGGCCGTC ACCGGCGGTA GCCTGCTGCA GTGGCCTGCG
CTGGGCCCGG CGCTGACGGC CATCGCCACG CTGCTGCTCT GCTCGGTGGC CGCCGCCGCC
TACTTGAACC GGATGACGAT CGCTCCGCCC GCGCAGCTGG TGCAGTTCGC CAATCGCATG
GCCGCCGGCG ACCTGACGCA GAAGATCGAT ATCGACCGCC ACGACCTCGT CGGCCAGCTC
ACGAAAGGAC TGAACCAGTT GAACGTGAAC CTGCAGTCCA TCGTGCGCGA CGCGCGCAAC
GAGATCGAGC AGATGCGCGT CGTCACCGGC GAGATCGCCT CGGGCAACCA GGATCTCTCG
GGACGGACCG AGTCGCAGGC GAGCAGCCTG CAGGAGACCG CCGCCTCGAT GGAACAGATC
ACCGGCACCG TGCGCCAGAG CGCCGATTCC GCCGAGCAGG CCACGCGCCT GGCCACCCAG
GCCACGGCGG TGACCCAGCG CAGCAGCGAC GCGGTGCAGG ACGTCACCCG CACGATGGGC
GAGATCAGCG CATCGTCGCA GCGCATCGGC GAGATCATCC AGGTGATCGA CGGGATCGCA
TTCCAGACCA ACATCCTCGC GCTCAACGCC GCGGTCGAGG CCGCCCGGGC CGGTGAACAG
GGACGCGGCT TCTCGGTCGT GGCCTCCGAG GTGCGGGCGC TCGCGCAGCG CACTTCGTCG
GCGGCGAAGG AGGTCAAGCA ACTGATCGAG GACTCGGCCG CCAAGGTCGA CACCGGCAGC
CGCCTCACCG ACGCGGCCCG GTCGACGATG GACGACGCCT TGCGCACGGT GCAGCAGGTC
GGCCAACTGA TCAGCGAGAT CAGCCACGGC GCGCGCGAGC AGCTGACCGG CATCTCGCAG
ATCAACGAAG CGGTCACGCA GCTCGACACG ATCACGCAGC AGAACGCCGC GCTGGTCGAG
CAGATGGCGG CATCGGCGGT GTCGCTGTCG CAGCAGTCGG GCACGCTGGC GGAAACGGTG
AAGGTGTTCC GCCTCGATGG CAGCGCATCG GCCACGCCCG ACGCGGTGGC GTTGCGCCGC
TCGATGAAGC AAGTCACCCA CGCTGCCTGA
 
Protein sequence
MRLNTPVTAS EYPFPAGETL VSTTDLKGRI TYCNPAFITV SGYAREELLG QPHNMIRHPD 
MPEEAFRDMW ATIASGQPWS APVKNRRKDG SCYWVIANVT PLMSGDQPTG YMSVRTAPDR
ADVERAEELY AVMRTEKAAG QLVHRFDAGR LRVDTWLGRL SNVLRLPVQV KLGVVAALLG
SAGFVTGAAV TGGSLLQWPA LGPALTAIAT LLLCSVAAAA YLNRMTIAPP AQLVQFANRM
AAGDLTQKID IDRHDLVGQL TKGLNQLNVN LQSIVRDARN EIEQMRVVTG EIASGNQDLS
GRTESQASSL QETAASMEQI TGTVRQSADS AEQATRLATQ ATAVTQRSSD AVQDVTRTMG
EISASSQRIG EIIQVIDGIA FQTNILALNA AVEAARAGEQ GRGFSVVASE VRALAQRTSS
AAKEVKQLIE DSAAKVDTGS RLTDAARSTM DDALRTVQQV GQLISEISHG AREQLTGISQ
INEAVTQLDT ITQQNAALVE QMAASAVSLS QQSGTLAETV KVFRLDGSAS ATPDAVALRR
SMKQVTHAA