Gene ECH74115_5323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5323 
Symbol 
ID6970834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4964719 
End bp4966104 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content53% 
IMG OID643388984 
Productsugar transporter family protein 
Protein accessionYP_002273393 
Protein GI209399281 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.333242 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCACA TCACAACGGA AGATCCGGCA ACTTTGCGCC TGCCCTTTAA AGAGAAACTC 
TCTTACGGTA TCGGCGATCT GGCCTCTAAC ATCCTGCTGG ATATTGGTAC GCTTTATCTT
TTGAAGTTTT ATACCGACGT TCTGGGGCTA CCTGGCACCT ATGGCGGCAT TATCTTTTTG
ATCTCGAAAT TCTTTACCGC CTTTACCGAT ATGGGAACCG GCATCATGTT GGATTCCCGG
CGTAAGATTG GCCCGAAAGG TAAGTTCCGC CCTTTCATTT TGTACGCGTC ATTCCCGGTC
ACCTTATTGG CAATCGCTAA CTTTATCGGC ACACCGTTCG ATGTCACTGG TAAAACGGTG
ATGGCCACTA TTCTGTTTAT GCTCTACGGG CTGTTTTTCA GCATGATGAA CTGCTCCTAT
GGCGCGATGG TGCCTGCTAT TACCAAAAAC CCCAACGAGC GCGCATCACT GGCGGCATGG
CGTCAGGGAG GCGCTACGCT GGGCCTGCTG CTGTGCACGG TGGGATTCGT GCCGGTTATG
AATCTTATCG AAGGTAATCA GCAACTTGGC TATATCTTCG CCGCCACGCT GTTTTCACTG
TTCGGCCTGC TGTTTATGTG GATCTGCTAC TCGGGCGTGA AAGAGCGTTA TGTCGAAACC
CAACCAGCCA ATCCGGCGCA AAAGCCTGGC CTGCTGCAAT CTTTCCGCGC AATTGCCGGT
AACCGCCCAC TGTTCATTCT GTGCATTGCC AACCTCTGCA CTTTAGGGGC GTTTAACGTC
AAGCTCGCCA TCCAGGTCTA TTACACCCAG TACGTACTTA ACGATCCCAT CCTGTTGTCA
TATATGGGAT TTTTCAGCAT GGGCTGTATT TTCATCGGTG TGTTCCTGAT GCCTGGCGCA
GTCAGGCGTT TTGGTAAGAA GAAGGTCTAT ATCGGCGGCC TACTGATTTG GGTGCTGGGC
GATCTGCTCA ACTATTTCTT CAGCGGCGGT TCGGTCAGCT TCGTGGCGTT CTCCTGCCTG
GCATTCTTCG GCTCAGCGTT TGTTAACAGC CTGAACTGGG CGCTGGTTTC CGACACCGTC
GAGTACGGCG AGTGGCGTAC CGGCGTTCGT TCGGAAGGAA CGGTCTACAC CGGCTTCACC
TTCTTTCGCA AAGTGTCTCA GGCGCTGGCT GGTTTCTTCC CCGGCTGGAT GCTGACGCAA
ATCGGTTATG TGCCGAACGT GGCGCAGGCT GACCACACCA TTGAAGGGTT GCGCCAGCTG
ATCTTCATCT ACCCAAGCGC ACTGGCGGTA GTCACCATTG TGGCGATGGG CTGCTTCTAC
AGCCTGAACG AGAAGATGTA TGTCCGCATT GTTGAAGAAA TAGAAGCCCG TAAACGCACG
GCGTAA
 
Protein sequence
MSHITTEDPA TLRLPFKEKL SYGIGDLASN ILLDIGTLYL LKFYTDVLGL PGTYGGIIFL 
ISKFFTAFTD MGTGIMLDSR RKIGPKGKFR PFILYASFPV TLLAIANFIG TPFDVTGKTV
MATILFMLYG LFFSMMNCSY GAMVPAITKN PNERASLAAW RQGGATLGLL LCTVGFVPVM
NLIEGNQQLG YIFAATLFSL FGLLFMWICY SGVKERYVET QPANPAQKPG LLQSFRAIAG
NRPLFILCIA NLCTLGAFNV KLAIQVYYTQ YVLNDPILLS YMGFFSMGCI FIGVFLMPGA
VRRFGKKKVY IGGLLIWVLG DLLNYFFSGG SVSFVAFSCL AFFGSAFVNS LNWALVSDTV
EYGEWRTGVR SEGTVYTGFT FFRKVSQALA GFFPGWMLTQ IGYVPNVAQA DHTIEGLRQL
IFIYPSALAV VTIVAMGCFY SLNEKMYVRI VEEIEARKRT A