Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0663 |
Symbol | |
ID | 5595372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 681197 |
End bp | 682660 |
Gene Length | 1464 bp |
Protein Length | 487 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640919844 |
Product | sodium:sulfate symporter family protein |
Protein accession | YP_001457426 |
Protein GI | 157160108 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0471] Di- and tricarboxylate transporters |
TIGRFAM ID | [TIGR00785] anion transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 55 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTTAG CAAAAGATAA TATATGGAAA CTATTGGCCC CACTGGTGGT GATGGGTGTC ATGTTTCTTA TCCCTGTCCC CGACGGTATG CCACCACAGG CATGGCATTA CTTCGCTGTG TTTGTGGCAA TGATTGTCGG CATGATCCTC GAGCCAATTC CGGCAACAGC GATCAGTTTT ATTGCGGTTA CTATTTGCGT TATTGGCAGT AATTACCTGC TCTTTGATGC CAAAGAATTA GCTGACCCAG CGTTTAATGC GCAAAAACAG GCGCTGAAAT GGGGTCTGGC TGGTTTTTCC AGCACCACGG TATGGCTGGT ATTTGGCGCA TTTATTTTTG CATTAGGGTA TGAAGTTTCC GGGTTAGGTC GTCGCATTGC CCTTTTCCTG GTGAAATTCA TGGGCAAACG CACGCTGACG TTGGGTTATG CGATTGTCAT TATCGACATT CTGCTGGCAC CGTTTACACC GTCCAACACC GCGCGTACCG GGGGTACGGT TTTTCCGGTC ATTAAAAACC TGCCGCCGCT GTTTAAATCA TTCCCCAACG ATCCGTCCGC GCGTCGTATT GGCGGCTATT TGATGTGGAT GATGGTCATT AGTACCAGTC TGAGTTCGTC CATGTTTGTC ACCGGTGCGG CACCAAACGT GCTGGGTCTG GAGTTCGTCA GCAAAATTGC CGGTATCCAG ATTAGCTGGT TGCAGTGGTT CCTCTGCTTC CTGCCGGTTG GGGTTATCTT GCTTATCATT GCGCCGTGGC TTTCCTACGT GCTGTACAAA CCGGAAATCA CACACAGTGA AGAAGTGGCA ACCTGGGCGG GTGATGAACT AAAAACCATG GGTGCGCTGA CACGCAGAGA GTGGACGCTG ATTGGCCTTG TATTGCTCAG CTTAGGTTTG TGGGTATTTG GCAGTGAAGT CATTAATGCT ACTGCGGTTG GTCTGCTGGC AGTTTCGCTA ATGCTGGCTC TGCACGTTGT GCCGTGGAAA GACATTACCC GCTATAACAG CGCATGGAAC ACGCTGGTCA ACCTGGCAAC TCTGGTTGTG ATGGCTAACG GCCTGACTCG TTCTGGTTTT ATTGACTGGT TCGCCGGTAC CATGAGTACG CACCTGGAAG GATTCTCACC AAACGCAACG GTGATTGTAC TGGTTCTGGT GTTCTACTTT GCACACTACC TGTTTGCCAG CCTGTCTGCG CACACCGCAA CCATGCTGCC GGTTATTCTG GCCGTCGGTA AAGGTATTCC GGGCGTACCA ATGGAACAAC TGTGTATCCT GCTGGTGCTG TCTATCGGTA TCATGGGCTG TCTGACGCCG TATGCAACCG GTCCTGGGGT GATTATTTAC GGCTGTGGCT ATGTGAAATC AAAAGATTAC TGGCGTCTTG GCGCAATCTT CGGGGTGATT TACATCTCTA TGTTGCTGTT GGTTGGCTGG CCGATTCTCG CCATGTGGAA CTAA
|
Protein sequence | MSLAKDNIWK LLAPLVVMGV MFLIPVPDGM PPQAWHYFAV FVAMIVGMIL EPIPATAISF IAVTICVIGS NYLLFDAKEL ADPAFNAQKQ ALKWGLAGFS STTVWLVFGA FIFALGYEVS GLGRRIALFL VKFMGKRTLT LGYAIVIIDI LLAPFTPSNT ARTGGTVFPV IKNLPPLFKS FPNDPSARRI GGYLMWMMVI STSLSSSMFV TGAAPNVLGL EFVSKIAGIQ ISWLQWFLCF LPVGVILLII APWLSYVLYK PEITHSEEVA TWAGDELKTM GALTRREWTL IGLVLLSLGL WVFGSEVINA TAVGLLAVSL MLALHVVPWK DITRYNSAWN TLVNLATLVV MANGLTRSGF IDWFAGTMST HLEGFSPNAT VIVLVLVFYF AHYLFASLSA HTATMLPVIL AVGKGIPGVP MEQLCILLVL SIGIMGCLTP YATGPGVIIY GCGYVKSKDY WRLGAIFGVI YISMLLLVGW PILAMWN
|
| |