Gene ECH74115_5745 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5745 
Symbol 
ID6972200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5380625 
End bp5381581 
Gene Length957 bp 
Protein Length318 aa 
Translation table11 
GC content50% 
IMG OID643389378 
Productputative sugar ABC transporter, periplasmic sugar-binding protein 
Protein accessionYP_002273771 
Protein GI209397917 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAAAC GCTTACTTGT AGTCTCTGCA GTCTCGGCAG CCATGTCGTC TATGGCGTTG 
GCCGCTCCAT TAACCGTAGG ATTTTCGCAG GTCGGATCGG AATCCGGCTG GCGCGCCGCA
GAAACCAATG TGGCGAAAAG TGAGGCCGAA AAACGCGGAA TTACGCTGAA AATTGCCGAT
GGTCAGCAAA AGCAGGAAAA CCAGATTAAA GCGGTACGTT CCTTCGTTGC ACAAGGGGTG
GATGCGATCT TTATCGCTCC GGTGGTCGCG ACAGGTTGGG AACCGGTATT AAAAGAGGCG
AAAGATGCCG AAATCCCGGT CTTCTTGCTC GACCGTTCCA TCGATGTGAA AGACAAATCT
CTCTATATGA CCACCGTTAC TGCCGACAAC ATCCTCGAAG GTAAGTTGAT CGGTGACTGG
CTGGTAAAAG AAGTGAATGG CAAACCATGC AACGTGGTGG AGCTGCAGGG CACCGTTGGG
GCCAGCGTCG CCATTGACCG TAAGAAAGGC TTTGCCGAAG CCATTAAGAA TGCGCCAAAT
ATCAAAATCA TCCGCTCGCA GTCAGGTGAC TTCACCCGCA GTAAAGGCAA AGAAGTCATG
GAGAGCTTTA TCAAAGCGGA AAACAACGGC AAAAACATCT GCATGGTTTA CGCCCATAAC
GACGACATGG TGATTGGTGC AATTCAGGCA ATTAAAGAAG CGGGCCTGAA ACCGGGCAAA
GATATCCTCA CGGGTTCCAT TGACGGTGTA CCGGACATCT ACAAAGCGAT GATGGATGGC
GAAGCGAACG CCAGTGTTGA ACTGACGCCG AACATGGCGG GACCCGCTTT CGATGCGCTG
GAGAAATACA AAAAAGACGG CACCATGCCT GAAAAGCTGA CGTTAACCAA ATCCACCCTT
TACCTGCCTG ATACCGCAAA AGAAGAGTTA GAGAAGAAGA AAAATATGGG GTATTGA
 
Protein sequence
MWKRLLVVSA VSAAMSSMAL AAPLTVGFSQ VGSESGWRAA ETNVAKSEAE KRGITLKIAD 
GQQKQENQIK AVRSFVAQGV DAIFIAPVVA TGWEPVLKEA KDAEIPVFLL DRSIDVKDKS
LYMTTVTADN ILEGKLIGDW LVKEVNGKPC NVVELQGTVG ASVAIDRKKG FAEAIKNAPN
IKIIRSQSGD FTRSKGKEVM ESFIKAENNG KNICMVYAHN DDMVIGAIQA IKEAGLKPGK
DILTGSIDGV PDIYKAMMDG EANASVELTP NMAGPAFDAL EKYKKDGTMP EKLTLTKSTL
YLPDTAKEEL EKKKNMGY