Gene ECH74115_4951 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4951 
Symbol 
ID6968228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4592055 
End bp4593455 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content54% 
IMG OID643388634 
Productsugar transporter, glycoside-pentoside-hexuronide family 
Protein accessionYP_002273061 
Protein GI209400085 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID[TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.841107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCTA CACCGATTAC TACCGCTGAT ATCGCTAAAG GTAAAATTGA CGATGCGTTA 
TCTGTACGGG AAAAAATAGG CTACGGCCTG GGTGACGCAG GCGGCACCGT AATAACTTGC
CTGATCATGA ATTTTCTCAC CTTTTTCTAC ACCGACGTTT TTGGATTAAC TCCGGCGCTG
GTTGGCACGC TGTTTATTGC ACTGCGCGTG TTTGATGCCA TCTCCGACCC GGTGATGGGC
GTCATTGCCG ACCGGACGCA AAGCCGATGG GGGCGCTTTC GTCCGTGGCA GCTATGGATT
GCCATTCCCA TCGGCATTAT CGGCATCCTG ACGTTCACCG TGCCAGATGC CAGCATGGGA
GTAAAAATCG CCTGGGCGTT CGGTACTTAC CTGCTCCTTT CAGTCGGTTA TACCGCCATC
AACGTACCGT ACTGCGCGCG GATCAACACC ATGACCACCC GCCACAATGA AGTGATCTCC
TGCCAGTCCT GGCGATTCGT TCTCTGCGGC GTAGCGGGAT TTCTGGTTTC GGTAGGCTTA
CCGTGGATGG TAGCTCTCTT CGGTCAGGGC AACGCTGCAC GCGGCTATCA ACTGGGCGTC
GGGGTATTGT GCGCCATTGC CGTGGTGATG TTCCTGTGCT GTTTCTTCTG GGTTCGTGAA
CGGGTGCCGC TCTCCACAAT GGGGAAATTT ACCCTGCGCG AACATCTTGC CGGGCTGCGG
AACAACGACC AACTGCTGCT GATGCTGGTC ATGTCTTTCC TGCTGATTAA CGTCTTTAAC
ATTCGCGGCG GTGGGTATAT GTACTTCATT ACCTACGTCT TACAAGGCAG CACGGGCTAC
ACGTCGCTGT TCTTCACCAT GGTCACCTTC GCCTCCATTA TCGGCTCGGT GATTGTCAGC
CCGTTAACGC GGCGTTTCGA TACCGTCAAA ATTTATTACT ACACCAACCT GCTCCTCGCT
GCACTGGCGG TGTTGATGTG GTTCCTGCCC TCCGGCCCGG CTTATCAAAC GCTGTGGCTG
GCGGTGATCC TCGGTAATGG CGTGATTCTT GGCTTCACAT TGCCACTGCA CTTCTCATTG
ATGGCCTTTG CCGATGACTA CGGCGAGTGG AAAACCCACG TACGTTCTTC CGGCATGAAC
TTCGCCTTCA ATCTGTTTTT CATCAAGCTG GCCTGGGCCT CCAGCGCCGG GATCATCAGC
CTGCTGTTTA TTTTTGTCGC CTACCAGCCT GGCGTGGAAA ACCAGACCGC CAGTTCGCTT
GGCGGGATCG CGGCGATGGA AACATTACTG CCTGCGCTAT TCCACCTGCT GCTGGCAGGG
GCGATCCGCT TTTGCAAACT CAATAATCCT ATGATGTCAC GCATAGCTAC CGACCTGCGT
CAGCGTCATG TACAGCCTTA A
 
Protein sequence
MTSTPITTAD IAKGKIDDAL SVREKIGYGL GDAGGTVITC LIMNFLTFFY TDVFGLTPAL 
VGTLFIALRV FDAISDPVMG VIADRTQSRW GRFRPWQLWI AIPIGIIGIL TFTVPDASMG
VKIAWAFGTY LLLSVGYTAI NVPYCARINT MTTRHNEVIS CQSWRFVLCG VAGFLVSVGL
PWMVALFGQG NAARGYQLGV GVLCAIAVVM FLCCFFWVRE RVPLSTMGKF TLREHLAGLR
NNDQLLLMLV MSFLLINVFN IRGGGYMYFI TYVLQGSTGY TSLFFTMVTF ASIIGSVIVS
PLTRRFDTVK IYYYTNLLLA ALAVLMWFLP SGPAYQTLWL AVILGNGVIL GFTLPLHFSL
MAFADDYGEW KTHVRSSGMN FAFNLFFIKL AWASSAGIIS LLFIFVAYQP GVENQTASSL
GGIAAMETLL PALFHLLLAG AIRFCKLNNP MMSRIATDLR QRHVQP