Gene ECH74115_2099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2099 
Symbol 
ID6970214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1996774 
End bp1998324 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content51% 
IMG OID643386000 
Productputative ABC transporter periplasmic-binding protein yddS precursor 
Protein accessionYP_002270489 
Protein GI209398371 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGAT CGATATCGTT TCGTCCCACA TTGCTCGCGC TCGTCCTTGC CACAACTTTC 
CCGGTTGCGC ACGCCGCCGT ACCGAAAGAT ATGCTGGTGA TTGGTAAGGC CGCCGATCCA
CAAACCCTCG ACCCGGCGGT AACAATAGAT AATAACGACT GGACAGTGAC CTACCCGTCT
TATCAGCGAC TGGTTCAGTA CAAAACGGAC GGTGATAAAG GCTCAACCGA CGTTGAAGGC
GATCTGGCAA GTAGCTGGAA AGCGTCTAAC GATCAAAAAG AGTGGACGTT CACCCTGAAA
GATAATGCTA AATTTGCCGA TGGCACACCT GTCACTGCCG AAGCAGTAAA ACTCTCTTTT
GAGCGGTTAC TAAAAATCGG CCAGGGGCCA GCAGAAGCAT TTCCCAAAGA TTTAAAGATT
GATGCTCCCG ACGAACATAC GGTGAAGTTT ACCCTTAGCC AGCCATTTGC ACCGTTCCTC
TACACGCTGG CGAATGACGG TGCATCCATT ATCAATCCGG CGGTATTAAA GGAACATGCG
GCGGATGATG CCCGCGGCTT CCTCGCGCAA AATACCGCCG GTTCCGGACC ATTTATGCTG
AAAAGCTGGC AAAAAGGTCA GCAATTAGTT CTGGTGCCAA ATCCGCATTA CCCCGGCAAT
AAACCGAACT TTAAGCGAGT ATCGGTAAAA ATTATCGGTG AAAGTGCCTC CCGTCGCCTG
CAGCTCTCCC GTGGCGACAT TGACATTGCC GATGCGCTGC CGGTGGATCA ACTCAACGCC
CTGAAGCAGG AAAACAAAGT CAATGTGGCA GAGTATCCGT CACTGCGCGT CACCTATCTG
TATCTGAATA ACAGCAAAGC GCCACTTAAT CAGGCGGATC TGCGGCGGGC CATTTCCTGG
TCTACCGATT ATCAGGGCAT GGTTAACGGC ATTCTGAGTG GTAACGGAAA ACAAATGCGC
GGCCCGATTC CGGAAGGCAT GTGGGGCTAC GATGCGACGG CAATGCAATA CAACCATGAC
GAAACGAAAG CCAAAGCCGA ATGGGATAAA GTGACGAGCA AACCCACCAG CCTGACGTTT
CTCTACTCCG ATAACGATCC GAACTGGGAA CCGATCGCTC TGGCAACACA ATCCAGCCTC
AACAAGCTGG GCATCAATGT GAAGCTGGAA AAGCTGGCGA ACGCCACCAT GCGCGACAGA
GTGGGTAAAG GTGATTACGA CATTGCGATT GGCAACTGGA GTCCGGATTT TGCCGACCCG
TATATGTTTA TGAATTACTG GTTTGAGTCA GACAAAAAAG GTCTGCCGGG TAACCGCTCG
TTCTATGAAA ACAGTGAGGT CGATAAGTTA CTGCGCAATG CGCTTGCGAC CACCGACCAG
ACGCAGCGTA CCCGAGACTA CCAGCAGGCA CAGAAAATCG TCATTGATGA CGCTGCTTAT
GTGTACCTGT TCCAGAAAAA CTACCAACTG GCGATGAACA AAGAGGTGAA AGGCTTTGTG
TTCAATCCCA TGCTGGAACA GGTCTTCAAT ATCAATACCA TGAGTAAATA A
 
Protein sequence
MKRSISFRPT LLALVLATTF PVAHAAVPKD MLVIGKAADP QTLDPAVTID NNDWTVTYPS 
YQRLVQYKTD GDKGSTDVEG DLASSWKASN DQKEWTFTLK DNAKFADGTP VTAEAVKLSF
ERLLKIGQGP AEAFPKDLKI DAPDEHTVKF TLSQPFAPFL YTLANDGASI INPAVLKEHA
ADDARGFLAQ NTAGSGPFML KSWQKGQQLV LVPNPHYPGN KPNFKRVSVK IIGESASRRL
QLSRGDIDIA DALPVDQLNA LKQENKVNVA EYPSLRVTYL YLNNSKAPLN QADLRRAISW
STDYQGMVNG ILSGNGKQMR GPIPEGMWGY DATAMQYNHD ETKAKAEWDK VTSKPTSLTF
LYSDNDPNWE PIALATQSSL NKLGINVKLE KLANATMRDR VGKGDYDIAI GNWSPDFADP
YMFMNYWFES DKKGLPGNRS FYENSEVDKL LRNALATTDQ TQRTRDYQQA QKIVIDDAAY
VYLFQKNYQL AMNKEVKGFV FNPMLEQVFN INTMSK