Gene Mpe_A0114 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A0114 
SymbolssuA 
ID4784516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp118148 
End bp119143 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content71% 
IMG OID640088661 
Productsulfonate binding protein 
Protein accessionYP_001019311 
Protein GI124265307 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.802065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00280198 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCTTCC ATCGACATGC GTTGTTCGCC GGCGTGCTGG CGCTGGCCAC CGGGCTGCTG 
GGGTTCGCGC CGGGGGCGCA CAGCCAGCCG GCCGCACCGA AGGAGATCCG CATCGGCTTC
CAGAAGAGCG CCGTCAACCT GGTGATCCTC AAGCAGCAGG GTGCGCTGGA GAAGCGCTTT
CCCGACAGCA AGGTGTCGTG GATCGAGTTC CCGGCCGGGC CGCAGCTGCT GGAGGCACTG
GCGGTCGGCA GCCTGGAGAT CGGTCTGACC GGCGACTCGC CGCCGGTGTT CGCGCAGGCG
GCCGGCAAGG ACCTGCGCTA CGTCGGCGCC GAGCCGCCCA AGCCGCAGAG TTCGGCCATC
CTCGTGAAGC CCGACTCGCC GCTGCGCACG CTGGCCGACC TGAAGGGCAG GAAGGTCGCG
TTCCAGAAGG GCTCCAGCGC GCATTACCTC GTGGTGCGCG CGCTGGCGCA GGCCGGGCTG
CAGTGGAGCG ACATCACGCC GATCTACCTG CCGCCGTCGG ACGCGCGTGC CGCCTTCGAG
CGCGGCAGCG TCGACGCCTG GGCCATCTGG GACCCCTACT ACGCCGCGAC CGAGCTCGAC
ATCCAACCGC GCGTGCTGAG CAATGGCGTG GGCCTGTCGG GCAACAACTC CTTCTACCTG
GCATCGACCG CGTTCACGCA GAACCACCCG CAAGCGGTGC AGGTCCTGCT CGACGAGCTG
ACGCGGGCCG ATGCCTACGT GCAGTCGCAC CGCAAGGAGT CCGCGCAGTT CATCGCCGAC
TTCAGCGGCC TGAGCCTGGC GACGGTGCAC CTGTTCATTT CGCGCCGCCC GCCATCGCCG
GTGAAGCCGC TGTCGCCGGC GCTGGTGGCC GACCAGCAGC GTGTGGCCGA TGCCTTCCAG
CAGCTCGGGC TGATCCCCAA GCCGGTGGCG GTGGCCGAGA TCGTGTGGCA GCCCGGCGCC
CCGGGGGCGG CGCGCCTCGC GAACGCCGCC CGCTGA
 
Protein sequence
MSFHRHALFA GVLALATGLL GFAPGAHSQP AAPKEIRIGF QKSAVNLVIL KQQGALEKRF 
PDSKVSWIEF PAGPQLLEAL AVGSLEIGLT GDSPPVFAQA AGKDLRYVGA EPPKPQSSAI
LVKPDSPLRT LADLKGRKVA FQKGSSAHYL VVRALAQAGL QWSDITPIYL PPSDARAAFE
RGSVDAWAIW DPYYAATELD IQPRVLSNGV GLSGNNSFYL ASTAFTQNHP QAVQVLLDEL
TRADAYVQSH RKESAQFIAD FSGLSLATVH LFISRRPPSP VKPLSPALVA DQQRVADAFQ
QLGLIPKPVA VAEIVWQPGA PGAARLANAA R