Gene Mpe_A1861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1861 
Symbol 
ID4786741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1989567 
End bp1990580 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content71% 
IMG OID640090431 
Producthypothetical protein 
Protein accessionYP_001021054 
Protein GI124267050 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0824281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGCG CACTGAAGCG AGGGCTGTTC CTGCTGTTGG TGCTGGCGTT CGCTGGCGGC 
GGTCTGGCGG CCTGGTGGCT GACGCAGCCC CTGGCGCTGG CCTCGCCCAG CGTCGAGCTG
TCGGTGGAGC CGGGCACGTC GCCACGCGAG ATCGCCCAGG GGTGGGTCGA CGCAGGGGTC
CGCGCGCCGC CGCTGCTGCT CTACGAGTGG TTCCGCTGGT CCGGCCAAGC GCGCAAGATC
CGCGCGGGCA GCTACGAGAT CGGCCCGGGC ACCACGCCGC TCGCGCTGCT GAACAAGATG
GTGCGCGGCG ACGAGGCCCA GGCCACCGTG CGGCTGATCG AAGGCTGGAC CTTCCGGCAG
TTCCGCGCCG AGCTCGCGAA GGCCGAGGCG CTGAAGCCGG ACACAGCCTC GATGAACGAT
GCCGAAGTGA TGGCGGCGCT GGGCTCACCG GGCCGATCGC CGGAGGGCTG GTTCTTCCCC
GACACCTACG CCTACAGCAA GGGGGCGAGC GACCTCGCCG TGCTGCAGCG TGCCCACCGC
GCGATGCAGC GCCGCCTCGA GGCGGCCTGG CTCGAGCGGA TGCCCGACAC GCCGCTGAAG
AGCCCCGAAG AGGCCCTGAC GCTGGCGTCG ATCATCGAGA AGGAGACGGG CCAGACGGCA
GACCGCGGCA AGGTCGCCAG CGTGTTCGTG AACCGGCTGC GCATCGGCAT GCCGCTGCAG
ACCGACCCCA CGGTGATCTA CGGGCTCGGC GAGGCCTTCG ACGGCAACCT GCGGCGGCGC
GACCTGCAGG CCGACACGCC CTACAACACC TACCTGCGCA CGGGCCTGAC CCCAACACCG
ATCTCGATGC CGGGCAAGGC CTCGTTGATC GCGGCGGTGC GGCCGGAGAC GAGCCGGGCG
CTGTACTTCG TCGCCCGGGG CGACGGCAGC AGCCAGTTCA GCGAAAACCT CGCCGACCAT
AATCGGGCCG TGAACCGATA CCAGCGCGGC GGCGGCCGCG GCGCCGCGCC ATGA
 
Protein sequence
MKRALKRGLF LLLVLAFAGG GLAAWWLTQP LALASPSVEL SVEPGTSPRE IAQGWVDAGV 
RAPPLLLYEW FRWSGQARKI RAGSYEIGPG TTPLALLNKM VRGDEAQATV RLIEGWTFRQ
FRAELAKAEA LKPDTASMND AEVMAALGSP GRSPEGWFFP DTYAYSKGAS DLAVLQRAHR
AMQRRLEAAW LERMPDTPLK SPEEALTLAS IIEKETGQTA DRGKVASVFV NRLRIGMPLQ
TDPTVIYGLG EAFDGNLRRR DLQADTPYNT YLRTGLTPTP ISMPGKASLI AAVRPETSRA
LYFVARGDGS SQFSENLADH NRAVNRYQRG GGRGAAP