Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5323 |
Symbol | |
ID | 6970834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4964719 |
End bp | 4966104 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643388984 |
Product | sugar transporter family protein |
Protein accession | YP_002273393 |
Protein GI | 209399281 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2211] Na+/melibiose symporter and related transporters |
TIGRFAM ID | [TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.333242 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCACA TCACAACGGA AGATCCGGCA ACTTTGCGCC TGCCCTTTAA AGAGAAACTC TCTTACGGTA TCGGCGATCT GGCCTCTAAC ATCCTGCTGG ATATTGGTAC GCTTTATCTT TTGAAGTTTT ATACCGACGT TCTGGGGCTA CCTGGCACCT ATGGCGGCAT TATCTTTTTG ATCTCGAAAT TCTTTACCGC CTTTACCGAT ATGGGAACCG GCATCATGTT GGATTCCCGG CGTAAGATTG GCCCGAAAGG TAAGTTCCGC CCTTTCATTT TGTACGCGTC ATTCCCGGTC ACCTTATTGG CAATCGCTAA CTTTATCGGC ACACCGTTCG ATGTCACTGG TAAAACGGTG ATGGCCACTA TTCTGTTTAT GCTCTACGGG CTGTTTTTCA GCATGATGAA CTGCTCCTAT GGCGCGATGG TGCCTGCTAT TACCAAAAAC CCCAACGAGC GCGCATCACT GGCGGCATGG CGTCAGGGAG GCGCTACGCT GGGCCTGCTG CTGTGCACGG TGGGATTCGT GCCGGTTATG AATCTTATCG AAGGTAATCA GCAACTTGGC TATATCTTCG CCGCCACGCT GTTTTCACTG TTCGGCCTGC TGTTTATGTG GATCTGCTAC TCGGGCGTGA AAGAGCGTTA TGTCGAAACC CAACCAGCCA ATCCGGCGCA AAAGCCTGGC CTGCTGCAAT CTTTCCGCGC AATTGCCGGT AACCGCCCAC TGTTCATTCT GTGCATTGCC AACCTCTGCA CTTTAGGGGC GTTTAACGTC AAGCTCGCCA TCCAGGTCTA TTACACCCAG TACGTACTTA ACGATCCCAT CCTGTTGTCA TATATGGGAT TTTTCAGCAT GGGCTGTATT TTCATCGGTG TGTTCCTGAT GCCTGGCGCA GTCAGGCGTT TTGGTAAGAA GAAGGTCTAT ATCGGCGGCC TACTGATTTG GGTGCTGGGC GATCTGCTCA ACTATTTCTT CAGCGGCGGT TCGGTCAGCT TCGTGGCGTT CTCCTGCCTG GCATTCTTCG GCTCAGCGTT TGTTAACAGC CTGAACTGGG CGCTGGTTTC CGACACCGTC GAGTACGGCG AGTGGCGTAC CGGCGTTCGT TCGGAAGGAA CGGTCTACAC CGGCTTCACC TTCTTTCGCA AAGTGTCTCA GGCGCTGGCT GGTTTCTTCC CCGGCTGGAT GCTGACGCAA ATCGGTTATG TGCCGAACGT GGCGCAGGCT GACCACACCA TTGAAGGGTT GCGCCAGCTG ATCTTCATCT ACCCAAGCGC ACTGGCGGTA GTCACCATTG TGGCGATGGG CTGCTTCTAC AGCCTGAACG AGAAGATGTA TGTCCGCATT GTTGAAGAAA TAGAAGCCCG TAAACGCACG GCGTAA
|
Protein sequence | MSHITTEDPA TLRLPFKEKL SYGIGDLASN ILLDIGTLYL LKFYTDVLGL PGTYGGIIFL ISKFFTAFTD MGTGIMLDSR RKIGPKGKFR PFILYASFPV TLLAIANFIG TPFDVTGKTV MATILFMLYG LFFSMMNCSY GAMVPAITKN PNERASLAAW RQGGATLGLL LCTVGFVPVM NLIEGNQQLG YIFAATLFSL FGLLFMWICY SGVKERYVET QPANPAQKPG LLQSFRAIAG NRPLFILCIA NLCTLGAFNV KLAIQVYYTQ YVLNDPILLS YMGFFSMGCI FIGVFLMPGA VRRFGKKKVY IGGLLIWVLG DLLNYFFSGG SVSFVAFSCL AFFGSAFVNS LNWALVSDTV EYGEWRTGVR SEGTVYTGFT FFRKVSQALA GFFPGWMLTQ IGYVPNVAQA DHTIEGLRQL IFIYPSALAV VTIVAMGCFY SLNEKMYVRI VEEIEARKRT A
|
| |