Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4527 |
Symbol | |
ID | 6145780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4626066 |
End bp | 4627415 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641619343 |
Product | sulfate permease family inorganic anion transporter |
Protein accession | YP_001746455 |
Protein GI | 170681500 |
COG category | [R] General function prediction only |
COG ID | [COG2252] Permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACGC CATCAGCGCG TACCGGCGGT TCACTCGACG CCTGGTTTAA AATTTCACAA CGTGGAAGCA CTGTCCGTCA GGAAGTGGTT GCCGGGTTAA CAACGTTTCT GGCGATGGTC TACTCGGTTA TCGTCGTTCC AGGCATGTTG GGTAAAGCGG GCTTCCCGCC TGCGGCAGTT TTCGTTGCGA CCTGTCTGGT TGCCGGACTC GGTTCTATCG TGATGGGTCT GTGGGCTAAT CTGCCGTTGG CGATTGGTTG CGCCATCTCC CTGACGGCGT TTACCGCATT CAGTCTGGTG CTGGGGCAAC ATATTAGCGT ACCTGTCGCG CTGGGTGCCG TGTTCCTGAT GGGTGTGCTG TTTACGGTCA TTTCTGCCAC GGGTATCCGT AGCTGGATTT TGCGCAACTT GCCGCACGGT GTGGCGCACG GCACGGGGAT CGGTATCGGT CTGTTCCTGC TGCTGATTGC TGCGAATGGT GTCGGTCTGG TGATTAAAAA CCCGCTTGAT GGTCTGCCCG TTGCGCTGGG TGATTTCACG ACCTTTCCGG TGATGATGTC ACTGGTAGGT CTGGCGGTGA TCATCGGCCT GGAAAAACTG AAAGTCCCTG GTGGCATTCT GCTGACCATT ATCGGTATCT CAATTGTCGG TTTGATCTTC GATCCTAACG TCCATTTCTC CGGCGTTTTC GCTATGCCTT CATTGAGCGA TGAAAACGGC AATTCACTGA TTGGCAGCCT GGATATTATG GGCGCGCTGA ATCCTGTAGT CCTGCCAAGC GTTCTGGCGC TGGTGATGAC GGCAGTATTT GATGCCACCG GAACTATCCG CGCCGTCGCC GGTCAGGCGA ACCTGCTGGA TAAAGACGGG CAGATCATCG ACGGCGGGAA AGCACTGACC ACTGACTCCA TGAGCAGCGT TTTCTCTGGC CTGGTGGGTG CAGCTCCGGC AGCGGTATAC ATCGAGTCTG CGGCGGGTAC GGCGGCGGGC GGTAAAACCG GGTTGACGGC TATCACCGTT GGCGTGCTGT TCCTGCTGAT TCTGTTCCTC TCTCCGCTCT CTTACCTCGT TCCGGGGTAC GCAACGGCTC CGGCGTTGAT GTACGTTGGC CTGCTGATGC TGAGCAACGT GGCGAAAATC GACTTTGCTG ATTTTGTTGA TGCGATGGCG GGTCTGGTTA CGGCGGTATT CATCGTGCTG ACCTGTAACA TCGTAACAGG CATCATGATC GGCTTCGCGA CTCTGGTGAT TGGTCGTCTG GTTTCCGGCG AATGGCGCAA GTTGAACATC GGTACGGTCG TTATTGCCGT GGCGCTGGTG ACCTTCTATG CGGGTGGCTG GGCTATCTAA
|
Protein sequence | MSTPSARTGG SLDAWFKISQ RGSTVRQEVV AGLTTFLAMV YSVIVVPGML GKAGFPPAAV FVATCLVAGL GSIVMGLWAN LPLAIGCAIS LTAFTAFSLV LGQHISVPVA LGAVFLMGVL FTVISATGIR SWILRNLPHG VAHGTGIGIG LFLLLIAANG VGLVIKNPLD GLPVALGDFT TFPVMMSLVG LAVIIGLEKL KVPGGILLTI IGISIVGLIF DPNVHFSGVF AMPSLSDENG NSLIGSLDIM GALNPVVLPS VLALVMTAVF DATGTIRAVA GQANLLDKDG QIIDGGKALT TDSMSSVFSG LVGAAPAAVY IESAAGTAAG GKTGLTAITV GVLFLLILFL SPLSYLVPGY ATAPALMYVG LLMLSNVAKI DFADFVDAMA GLVTAVFIVL TCNIVTGIMI GFATLVIGRL VSGEWRKLNI GTVVIAVALV TFYAGGWAI
|
| |