Gene Mpe_A0113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0113 
SymbolssuA 
ID4784515 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp117100 
End bp118137 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content70% 
IMG OID640088660 
Productsulfonate binding protein 
Protein accessionYP_001019310 
Protein GI124265306 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00282136 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCATT CGGATTCTTC TTCCTCGTCC CGCTGGAGAC GGCTGCCGCA GTCGGCCCGT 
CGCCACCTGC TGCAGGCCTT CGGCGCCGCC GCCGGCGCCG CGGCGCTCGG CACCTGGCCG
CTGGCGCGTG CACAGTCCTC GGGCGCGGCG CGCGAGCCGC TGCGCGTGGG CTACCAGAAG
TCGGCCAGCC TGTTCGTGCT GCAGAAGGCG CAGGGCTCGC TGGAGAAGAA GCTCGCGCCG
CTGGGCGTGG GTGTGAAGTG GATCGAGTTC CCGGCCGGGC CGCAGTTGCT GGAAGGCCTG
AACGTCGGCT CGGTCGACAT CGGCCATGTG GGCGAGGCGC CGCCGATCTT CGCGCAGGCG
GCCGGCGCCG ACTTCGTCTA CATCGGCCAC GACCCGGCCG CGCCGGAGGC CGAGGCCATC
GTCGTGCCGC AGGGCTCGGC GATCAGGAGC GTGGCCGAGC TCAACGGCAG GAAGGTCGCG
CTGAACAAGG GCTCGAACGT GCACTACCTG CTGGTGCGCG CGCTCGAGAA GGCCGGCCTG
AAGTACGCCG ACATCCAGCC GGTCTTCCTG CCGCCGGCCG ATGCGCGTGC CGCCTTCGAG
AAGGGCGCGG TCGATGCCTG GGCGATCTGG GATCCCTTCC TCGCCGCGGT CGAGAAGCAG
ACCGGTGCGC GCGTGCTGGT CGACGGCCGC AACGGCGTCG CCAACAACTA CCTGTTCTAC
CTGGCCGAGC GCAAGTTCGT GCAGAAGAAC GGCGACGTGA TCCAGGCGCT GTTCGCCGAT
TCGCAGGAGC AGGGCCGCTG GCTGAAGGCC GACTTGAAGC GCGCCGCGGC GATCATTGCG
CCACTGCAGG GCCTGGACCC GGAGATCGTC GAGCTCGCGC TGCGCCGCTA CAACTTCAAT
GTCACGCCGC TCAGCGAGCA GGTCGCGGCG CAGCAGCAGC AGATCGCCGA CGTGTTCCAC
GAGCTCAAGC TGATCCCCAA GCCGATCCGC GTGGCCGACG CGCTGCCCGC GGTGCGCGTC
GCGCAGAAGC AGCCCTGA
 
Protein sequence
MSHSDSSSSS RWRRLPQSAR RHLLQAFGAA AGAAALGTWP LARAQSSGAA REPLRVGYQK 
SASLFVLQKA QGSLEKKLAP LGVGVKWIEF PAGPQLLEGL NVGSVDIGHV GEAPPIFAQA
AGADFVYIGH DPAAPEAEAI VVPQGSAIRS VAELNGRKVA LNKGSNVHYL LVRALEKAGL
KYADIQPVFL PPADARAAFE KGAVDAWAIW DPFLAAVEKQ TGARVLVDGR NGVANNYLFY
LAERKFVQKN GDVIQALFAD SQEQGRWLKA DLKRAAAIIA PLQGLDPEIV ELALRRYNFN
VTPLSEQVAA QQQQIADVFH ELKLIPKPIR VADALPAVRV AQKQP