Gene Mpe_A0335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0335 
Symbol 
ID4786885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp365220 
End bp366404 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content67% 
IMG OID640088890 
Productmethyl-accepting chemotaxis protein I 
Protein accessionYP_001019532 
Protein GI124265528 
COG category[N] Cell motility
[T] Signal transduction mechanisms 
COG ID[COG0840] Methyl-accepting chemotaxis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.809135 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.684623 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTTGC TGAACTACCG ACTGACCACT CGCATCGCCG CAGGCTTCAC CATCATGGTG 
GTGCTGACGC TGGCGCTCGG GGCCCTCAGC GTCTGGAACG TGCGCTCGGC CACCGGCGAG
GCCTATGCCC TGCTCGCCGC CGTGAACGAT CCGGCCGTAT CGACCAAGCT GGGCGACCTG
CAGAAGAACG CGCAGCGCAC GCTGTGGACG ATCGCCGGCC TGGTGCTGGC GACCGCGGTG
TTCGGCGCCT GGCTGGGCTG GGCGATCCGC CAGAGCGTGC GGGGGCCCGT CGAAGACGTG
GTGGCGTCGG TGTCGCGCAT CGCCGCCGGC GACCTGGCGA CGAAGATCTC GTCCAGCGGC
CGCGACGAGA TCGCCTGGCT GAACCACGAA CTCAACCAGA TGCGCAAGAA GCTGCTGAGT
ACGATCGCCC AGGTGCGGGA ATCGGCCGAG CAGGTGTCCG TGGCCTCGAA CGAGATCGCC
TCGGGCAACA CCGACCTGAG CACCCGCACC GAGACCCAGG CGAGCGGCCT GCAGCAGACC
GCCAGCTCGA TGGAGCAGCT CACGTCGACC GTGCGTCAGA ACGCCGACAA CGCGCAGCAG
GCCAACCAGC TGGTGGTCAG CGCCAGCGAC GTGGCCAGCC GCGGCGGCGA GGTGATGACG
CAGGTGGTCT CGACCATGAA CGACATCAAC AGCAGCGCCC GCAAGATCGC CGACATCATC
GGCGTGATCG ACGGCATTGC CTTCCAGACC AACATCCTGG CGCTCAACGC GGCGGTGGAA
GCCGCTCGCG CCGGCGAGCA AGGGCGCGGC TTTGCTGTGG TGGCCGGTGA GGTGCGCAAC
CTGGCGCAGC GCAGCGCCGC TGCCGCCAAG GAAATCAAGA CGTTGATCGG CGACTCGGTC
GACAAGGTGG AAACCGGCAC GCGGCTGGTC GATCAGGCCG GCTCGACGAT GGACGAGATC
CTCGCCAGCG TGCGCCAGGT AACGCACATC ATGAGCGAGA TCAGCGTCGC CAGCCGCGAG
CAGAGCGCAG GCATCGAGCA GGTGAACCGC TCGATCGAGC AGATGGACAG CTCGACCCAG
CAGAACGCCG CGCTGGTGGA ACAGGCCGCC GCCGCATCGC ATTCGCTGCG CGATCAGTCG
CACAAGCTGA CCGAAGCGGT GAAGGTGTTC AAGCTGGCGG CCTGA
 
Protein sequence
MNLLNYRLTT RIAAGFTIMV VLTLALGALS VWNVRSATGE AYALLAAVND PAVSTKLGDL 
QKNAQRTLWT IAGLVLATAV FGAWLGWAIR QSVRGPVEDV VASVSRIAAG DLATKISSSG
RDEIAWLNHE LNQMRKKLLS TIAQVRESAE QVSVASNEIA SGNTDLSTRT ETQASGLQQT
ASSMEQLTST VRQNADNAQQ ANQLVVSASD VASRGGEVMT QVVSTMNDIN SSARKIADII
GVIDGIAFQT NILALNAAVE AARAGEQGRG FAVVAGEVRN LAQRSAAAAK EIKTLIGDSV
DKVETGTRLV DQAGSTMDEI LASVRQVTHI MSEISVASRE QSAGIEQVNR SIEQMDSSTQ
QNAALVEQAA AASHSLRDQS HKLTEAVKVF KLAA