Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3044 |
Symbol | |
ID | 5594218 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 3054635 |
End bp | 3056002 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640922161 |
Product | sulfate permease family inorganic anion transporter |
Protein accession | YP_001459663 |
Protein GI | 157162345 |
COG category | [R] General function prediction only |
COG ID | [COG2252] Permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 51 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGGAG ACATCCTACA AACACCGGAC GCACCAAAGC CACAGGGCGC GCTGGATAAT TATTTTAAAA TTACCGCTCG TGGCAGTACC GTTCGTCAGG AAGTACTGGC TGGCTTAACG ACCTTTCTGG CCATGGTTTA TTCCGTTATC GTCGTTCCGG GAATGCTGGG CAAAGCAGGT TTTCCTCCCG CAGCTGTGTT TGTTGCCACC TGTCTGGTCG CGGGCTTCGG CTCGTTGCTG ATGGGATTAT GGGCTAATTT GCCAATGGCG ATTGGTTGCG CGATTTCCTT GACGGCGTTT ACCGCATTCA GTCTGGTACT CGGGCAACAA ATTAGCGTTC CTGTCGCACT GGGCGCGGTA TTTCTGATGG GCGTCATCTT CACCGCCATT TCCGTAACCG GTGTGCGTAC CTGGATCTTA CGTAATTTGC CGATGGGTAT CGCTCACGGT ACAGGTATCG GTATCGGGCT GTTTCTGCTG CTGATTGCTG CTAACGGTGT GGGTATGGTT ATCAAAAACC CGATTGAAGG CTTGCCAGTT GCGCTCGGTG CGTTTACCTC CTTCCCGGTG ATGATGAGCT TGCTGGGGCT GGCGGTCATC TTCGGCCTGG AGAAGTGTCG CGTACCCGGC GGGATCTTGT TGGTGATTAT TGCAATTTCG ATCATCGGCT TAATCTTTGA CCCAGCGGTG AAATACCACG GTCTGGTGGC GATGCCAAGC CTGACTGGCG AAGATGGTAA GTCTCTGATT TTCAGCCTCG ATATTATGGG TGCACTCCAG CCAACTGTAC TTCCGAGTGT ACTGGCATTG GTGATGACCG CAGTGTTCGA CGCTACTGGC ACCATCCGTG CCGTCGCCGG TCAGGCGAAT TTGTTGGATA AAGACAACCA GATCATCAAC GGCGGCAAAG CCCTGACCAG TGACTCAGTA AGTTCAATAT TCTCCGGCCT GGTGGGCGCA GCGCCCGCAG CGGTTTATAT CGAATCAGCG GCAGGAACCG CCGCCGGGGG TAAAACAGGT TTAACCGCAA CCGTAGTGGG GGCGTTATTC CTGCTGATTC TGTTCTTATC ACCGCTGTCA TTTTTGATCC CTGGTTACGC CACTGCACCC GCTCTGATGT ACGTAGGTTT GCTGATGTTA AGTAACGTCT CGAAGCTGGA TTTCAATGAT TTTATTGACG CTATGGCTGG CCTGGTGTGT GCCGTGTTCA TCGTTCTGAC TTGTAATATC GTTACCGGTA TTATGCTGGG CTTTGTGACA CTGGTCGTAG GCCGCGTCTT TGCACGCGAA TGGCAAAAGC TGAATATTGG TACGGTGATC ATTACTGCCG CACTGGTCGC ATTTTACGCG GGTGGTTGGG CAATCTAA
|
Protein sequence | MSGDILQTPD APKPQGALDN YFKITARGST VRQEVLAGLT TFLAMVYSVI VVPGMLGKAG FPPAAVFVAT CLVAGFGSLL MGLWANLPMA IGCAISLTAF TAFSLVLGQQ ISVPVALGAV FLMGVIFTAI SVTGVRTWIL RNLPMGIAHG TGIGIGLFLL LIAANGVGMV IKNPIEGLPV ALGAFTSFPV MMSLLGLAVI FGLEKCRVPG GILLVIIAIS IIGLIFDPAV KYHGLVAMPS LTGEDGKSLI FSLDIMGALQ PTVLPSVLAL VMTAVFDATG TIRAVAGQAN LLDKDNQIIN GGKALTSDSV SSIFSGLVGA APAAVYIESA AGTAAGGKTG LTATVVGALF LLILFLSPLS FLIPGYATAP ALMYVGLLML SNVSKLDFND FIDAMAGLVC AVFIVLTCNI VTGIMLGFVT LVVGRVFARE WQKLNIGTVI ITAALVAFYA GGWAI
|
| |