Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3017 |
Symbol | |
ID | 6146626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3105922 |
End bp | 3107289 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617886 |
Product | sulfate permease family inorganic anion transporter |
Protein accession | YP_001745037 |
Protein GI | 170680723 |
COG category | [R] General function prediction only |
COG ID | [COG2252] Permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGGAG ACATCCTACA AACACCGGAC GCACCAAAGC CACAGGGCGC GCTGGATAAT TATTTTAAAA TTACCGCTCG TGGCAGTACC GTTCGTCAGG AAGTACTGGC TGGCTTAACG ACCTTTCTGG CCATGGTTTA TTCCGTTATC GTCGTTCCGG GAATGCTGGG CAAAGCAGGT TTTCCTCCCG CAGCTGTGTT TGTTGCCACC TGTCTGGTCG CGGGCTTCGG CTCGTTGCTG ATGGGATTAT GGGCCAATTT GCCAATGGCG ATTGGTTGCG CGATTTCCTT GACGGCGTTT ACCGCATTCA GTCTGGTACT CGGGCAACAA ATTAGCGTTC CTGTCGCACT GGGCGCGGTC TTTCTGATGG GAGTCATCTT CACCGCCATT TCCGTAACCG GTGTGCGTAC CTGGATCTTA CGTAATTTAC CGATGGGGAT CGCTCACGGT ACAGGTATCG GTATCGGGCT GTTTCTGCTG CTGATTGCTG CTAACGGTGT GGGAATGGTT ATCAAAAACC CGATTGAAGG CTTGCCAGTG GCGCTCGGTG CGTTTACCTC CTTCCCGGTG ATGATGAGCT TACTAGGGCT GGCGGTCATC TTCGGTCTGG AGAAGTGTCG CGTACCAGGC GGGATCTTGT TGGTGATTAT TGCAATTTCG ATCATCGGCT TAATCTTTGA CCCAGCGGTG AAGTACCACG GTCTGGTGGC GATGCCAAGC CTGACTGGCG AAGATGGTAA GTCTCTGATT TTCAGCCTCG ATATTATGGG CGCACTCCAG CCAACTGTAC TTCCGAGTGT ACTGGCATTG GTGATGACCG CAGTGTTCGA CGCCACTGGC ACCATCCGTG CCGTCGCCGG TCAGGCGAAT TTGTTGGATA AAGACAACCA GATCATCAAC GGCGGCAAAG CCCTGACCAG TGACTCAGTA AGTTCAATAT TCTCCGGCCT GGTGGGCGCA GCGCCCGCAG CGGTTTATAT CGAATCAGCG GCAGGAACCG CCGCCGGGGG TAAAACAGGG TTAACCGCAA CCGTAGTGGG GGCGTTATTC CTGCTGATTC TGTTCTTATC ACCGCTGTCA TTTTTGATCC CTGGTTACGC CACTGCACCC GCTCTGATGT ACGTAGGTTT GCTGATGTTA AGTAACGTCT CGAAGCTGGA TTTCAATGAT TTTATTGACG CTATGGCTGG CCTGGTGTGT GCCGTGTTCA TCGTTCTGAC TTGTAATATC GTTACCGGTA TTATGCTGGG CTTTGTGACA CTGGTCGTAG GCCGCGTCTT TGCACGCGAA TGGCAAAAGC TGAATATTGG TACGGTGATC ATTACTGCCG CACTGGTCGC ATTTTACGCG GGTGGTTGGG CAATCTAA
|
Protein sequence | MSGDILQTPD APKPQGALDN YFKITARGST VRQEVLAGLT TFLAMVYSVI VVPGMLGKAG FPPAAVFVAT CLVAGFGSLL MGLWANLPMA IGCAISLTAF TAFSLVLGQQ ISVPVALGAV FLMGVIFTAI SVTGVRTWIL RNLPMGIAHG TGIGIGLFLL LIAANGVGMV IKNPIEGLPV ALGAFTSFPV MMSLLGLAVI FGLEKCRVPG GILLVIIAIS IIGLIFDPAV KYHGLVAMPS LTGEDGKSLI FSLDIMGALQ PTVLPSVLAL VMTAVFDATG TIRAVAGQAN LLDKDNQIIN GGKALTSDSV SSIFSGLVGA APAAVYIESA AGTAAGGKTG LTATVVGALF LLILFLSPLS FLIPGYATAP ALMYVGLLML SNVSKLDFND FIDAMAGLVC AVFIVLTCNI VTGIMLGFVT LVVGRVFARE WQKLNIGTVI ITAALVAFYA GGWAI
|
| |