Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4307 |
Symbol | |
ID | 5591293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4310244 |
End bp | 4311593 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640923407 |
Product | sulfate permease family inorganic anion transporter |
Protein accession | YP_001460852 |
Protein GI | 157163534 |
COG category | [R] General function prediction only |
COG ID | [COG2252] Permeases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 46 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTACGC CATCAGCGCG TACCGGCGGT TCACTCGACG CCTGGTTTAA AATTTCACAA CGTGGAAGCA CTGTCCGTCA GGAAGTGGTT GCCGGGTTAA CAACGTTTCT GGCGATGGTC TACTCGGTCA TCGTCGTTCC AGGTATGTTG GGTAAAGCAG GCTTCCCGCC TGCGGCAGTT TTCGTTGCAA CCTGTCTGGT TGCCGGACTC GGTTCTATCG TGATGGGTCT GTGGGCTAAT CTGCCGTTGG CGATTGGTTG CGCCATCTCC CTGACGGCGT TTACCGCATT CAGCCTGGTG CTGGGGCAAC ATATTAGCGT ACCTGTCGCG CTGGGTGCCG TGTTCCTGAT GGGTGTGCTG TTTACGGTAA TTTCTGCCAC GGGTATCCGT AGCTGGATTT TGCGCAACTT GCCTCACGGT GTGGCGCACG GCACGGGGAT CGGTATCGGT CTGTTCCTGC TGCTCATTGC CGCTAATGGT GTCGGTCTGG TGATTAAAAA CCCGCTTGAT GGTCTGCCCG TTGCGCTGGG TGATTTCGCG ACCTTCCCGG TGATTATGTC ACTGGTAGGT CTGGCGGTGA TCATCGGCCT GGAAAAACTG AAAGTCCCTG GTGGCATTCT GCTGACCATT ATCGGTATCT CAATTGTCGG TTTGATCTTC GATCCTAACG TCCATTTCTC CGGCGTTTTC GCCATGCCTT CATTGAGCGA TGAAAACGGC AATTCACTGA TTGGCAGCCT GGACATTATG GGCGCGCTGA ATCCTGTAGT CCTGCCAAGC GTTCTGGCGC TGGTGATGAC GGCAGTATTT GATGCCACCG GAACTATCCG TGCCGTCGCC GGCCAGGCGA ACCTGCTGGA TAAAGATGGG CAGATCATCG ACGGTGGGAA AGCACTGACC ACTGACTCCA TGAGCAGCGT TTTCTCTGGC CTGGTGGGTG CTGCTCCGGC AGCGGTATAC ATCGAGTCTG CGGCGGGTAC GGCGGCGGGC GGTAAAACCG GTTTGACGGC TATCACCGTT GGCGTGCTGT TCCTGCTGAT TCTGTTCCTC TCTCCGCTCT CTTACCTCGT TCCGGGGTAT GCAACGGCTC CGGCGCTGAT GTACGTTGGC CTGCTGATGC TGAGCAACGT GGCGAAAATC GACTTTGCTG ATTTTGTTGA TGCGATGGCG GGTCTGGTTA CGGCGGTATT CATCGTGCTG ACCTGTAACA TCGTAACAGG CATCATGATC GGCTTCGCGA CTCTGGTGAT TGGTCGTCTG GTTTCCGGCG AATGGCGCAA GTTGAACATC GGTACGGTCG TTATCGCCGT GGCGCTGGTG ACCTTCTATG CGGGTGGCTG GGCTATCTAA
|
Protein sequence | MSTPSARTGG SLDAWFKISQ RGSTVRQEVV AGLTTFLAMV YSVIVVPGML GKAGFPPAAV FVATCLVAGL GSIVMGLWAN LPLAIGCAIS LTAFTAFSLV LGQHISVPVA LGAVFLMGVL FTVISATGIR SWILRNLPHG VAHGTGIGIG LFLLLIAANG VGLVIKNPLD GLPVALGDFA TFPVIMSLVG LAVIIGLEKL KVPGGILLTI IGISIVGLIF DPNVHFSGVF AMPSLSDENG NSLIGSLDIM GALNPVVLPS VLALVMTAVF DATGTIRAVA GQANLLDKDG QIIDGGKALT TDSMSSVFSG LVGAAPAAVY IESAAGTAAG GKTGLTAITV GVLFLLILFL SPLSYLVPGY ATAPALMYVG LLMLSNVAKI DFADFVDAMA GLVTAVFIVL TCNIVTGIMI GFATLVIGRL VSGEWRKLNI GTVVIAVALV TFYAGGWAI
|
| |