Gene EcSMS35_2183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2183 
Symbol 
ID6144480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2190501 
End bp2191460 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content53% 
IMG OID641617059 
Productalkanesulfonate transporter substrate-binding subunit 
Protein accessionYP_001744233 
Protein GI170683457 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.680381 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAGC TCATTAAACT GGTGCTGGCG GGTTTACTTA GCGTCTCTAC ACTTGCTGTT 
GCCGCGGAGT CTTCGCCAGA GGCGTTACGA ATTGGCTATC AGAAAGGCAG TATTGGTATG
GTGCTGGCAA AAAGCCACCA ATTACTGGAA AAACGCTATC CGCAAACAAA AATCTCCTGG
GTGGAGTTCC CGGCTGGCCC ACAGATGCTG GAGGCGTTAA ACGTTGGCAG TATTGATCTC
GGCAGTACCG GGGATATTCC GCCAATCTTT GCCCAGGCTG CCGGGGCTGA TTTGGTGTAC
GTGGGCGTCG AGCCGCCGAA GCCCAAAGCC GAAGTGATTC TGGTGGCAGA TAACAGCCCA
ATCAAAACCG TAGCCGATCT TAAAGGTCAT AAAGTTGCCT TTCAGAAAGG TTCCAGCTCG
CACAATCTTT TACTACGCGC ACTACGCCAG GCCGGGCTTA AATTCACGGA TATCCAGCCC
ACTTACCTGA CGCCTGCCGA TGCCCGCGCC GCGTTCCAGC AAGGTAACGT TGACGCCTGG
GCTATCTGGG ATCCCTACTA CTCCGCTGCA TTATTACAGG GCGGCGTGCG GGTGTTGAAA
GACGGCACCG ATCTCAATCA AACCGGATCG TTTTATCTGG CAGCCCGTCC GTATGCAGAA
AAAAACGGCG CTTTTATTCA GGGCGTACTG GCAACCTTTA GTGAGGCCGA TGCGTTAACC
CGCAGCCAGC GCGAACAAAG CATCGCTTTA CTGGCAAAAA CGATGGGCTT ACCGGCACCG
GTTATTGCCT CGTATCTGGA TCATCGTCCT CCCACCACCA TCAAACCGTT GAGCGCTGAA
GTTGCCGCCT TACAGCAGCA AACGGCAGAT CTGTTTTATG AAAACCGTCT GGTGCCGAAA
AAAGTCGATA TTCGCCAGCG CATCTGGCAA CCCACTCAAC TGGAAGGAAA ACAATTATGA
 
Protein sequence
MRKLIKLVLA GLLSVSTLAV AAESSPEALR IGYQKGSIGM VLAKSHQLLE KRYPQTKISW 
VEFPAGPQML EALNVGSIDL GSTGDIPPIF AQAAGADLVY VGVEPPKPKA EVILVADNSP
IKTVADLKGH KVAFQKGSSS HNLLLRALRQ AGLKFTDIQP TYLTPADARA AFQQGNVDAW
AIWDPYYSAA LLQGGVRVLK DGTDLNQTGS FYLAARPYAE KNGAFIQGVL ATFSEADALT
RSQREQSIAL LAKTMGLPAP VIASYLDHRP PTTIKPLSAE VAALQQQTAD LFYENRLVPK
KVDIRQRIWQ PTQLEGKQL