Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2183 |
Symbol | |
ID | 6144480 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2190501 |
End bp | 2191460 |
Gene Length | 960 bp |
Protein Length | 319 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617059 |
Product | alkanesulfonate transporter substrate-binding subunit |
Protein accession | YP_001744233 |
Protein GI | 170683457 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.680381 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAAGC TCATTAAACT GGTGCTGGCG GGTTTACTTA GCGTCTCTAC ACTTGCTGTT GCCGCGGAGT CTTCGCCAGA GGCGTTACGA ATTGGCTATC AGAAAGGCAG TATTGGTATG GTGCTGGCAA AAAGCCACCA ATTACTGGAA AAACGCTATC CGCAAACAAA AATCTCCTGG GTGGAGTTCC CGGCTGGCCC ACAGATGCTG GAGGCGTTAA ACGTTGGCAG TATTGATCTC GGCAGTACCG GGGATATTCC GCCAATCTTT GCCCAGGCTG CCGGGGCTGA TTTGGTGTAC GTGGGCGTCG AGCCGCCGAA GCCCAAAGCC GAAGTGATTC TGGTGGCAGA TAACAGCCCA ATCAAAACCG TAGCCGATCT TAAAGGTCAT AAAGTTGCCT TTCAGAAAGG TTCCAGCTCG CACAATCTTT TACTACGCGC ACTACGCCAG GCCGGGCTTA AATTCACGGA TATCCAGCCC ACTTACCTGA CGCCTGCCGA TGCCCGCGCC GCGTTCCAGC AAGGTAACGT TGACGCCTGG GCTATCTGGG ATCCCTACTA CTCCGCTGCA TTATTACAGG GCGGCGTGCG GGTGTTGAAA GACGGCACCG ATCTCAATCA AACCGGATCG TTTTATCTGG CAGCCCGTCC GTATGCAGAA AAAAACGGCG CTTTTATTCA GGGCGTACTG GCAACCTTTA GTGAGGCCGA TGCGTTAACC CGCAGCCAGC GCGAACAAAG CATCGCTTTA CTGGCAAAAA CGATGGGCTT ACCGGCACCG GTTATTGCCT CGTATCTGGA TCATCGTCCT CCCACCACCA TCAAACCGTT GAGCGCTGAA GTTGCCGCCT TACAGCAGCA AACGGCAGAT CTGTTTTATG AAAACCGTCT GGTGCCGAAA AAAGTCGATA TTCGCCAGCG CATCTGGCAA CCCACTCAAC TGGAAGGAAA ACAATTATGA
|
Protein sequence | MRKLIKLVLA GLLSVSTLAV AAESSPEALR IGYQKGSIGM VLAKSHQLLE KRYPQTKISW VEFPAGPQML EALNVGSIDL GSTGDIPPIF AQAAGADLVY VGVEPPKPKA EVILVADNSP IKTVADLKGH KVAFQKGSSS HNLLLRALRQ AGLKFTDIQP TYLTPADARA AFQQGNVDAW AIWDPYYSAA LLQGGVRVLK DGTDLNQTGS FYLAARPYAE KNGAFIQGVL ATFSEADALT RSQREQSIAL LAKTMGLPAP VIASYLDHRP PTTIKPLSAE VAALQQQTAD LFYENRLVPK KVDIRQRIWQ PTQLEGKQL
|
| |