Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2099 |
Symbol | |
ID | 6970214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 1996774 |
End bp | 1998324 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643386000 |
Product | putative ABC transporter periplasmic-binding protein yddS precursor |
Protein accession | YP_002270489 |
Protein GI | 209398371 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGAT CGATATCGTT TCGTCCCACA TTGCTCGCGC TCGTCCTTGC CACAACTTTC CCGGTTGCGC ACGCCGCCGT ACCGAAAGAT ATGCTGGTGA TTGGTAAGGC CGCCGATCCA CAAACCCTCG ACCCGGCGGT AACAATAGAT AATAACGACT GGACAGTGAC CTACCCGTCT TATCAGCGAC TGGTTCAGTA CAAAACGGAC GGTGATAAAG GCTCAACCGA CGTTGAAGGC GATCTGGCAA GTAGCTGGAA AGCGTCTAAC GATCAAAAAG AGTGGACGTT CACCCTGAAA GATAATGCTA AATTTGCCGA TGGCACACCT GTCACTGCCG AAGCAGTAAA ACTCTCTTTT GAGCGGTTAC TAAAAATCGG CCAGGGGCCA GCAGAAGCAT TTCCCAAAGA TTTAAAGATT GATGCTCCCG ACGAACATAC GGTGAAGTTT ACCCTTAGCC AGCCATTTGC ACCGTTCCTC TACACGCTGG CGAATGACGG TGCATCCATT ATCAATCCGG CGGTATTAAA GGAACATGCG GCGGATGATG CCCGCGGCTT CCTCGCGCAA AATACCGCCG GTTCCGGACC ATTTATGCTG AAAAGCTGGC AAAAAGGTCA GCAATTAGTT CTGGTGCCAA ATCCGCATTA CCCCGGCAAT AAACCGAACT TTAAGCGAGT ATCGGTAAAA ATTATCGGTG AAAGTGCCTC CCGTCGCCTG CAGCTCTCCC GTGGCGACAT TGACATTGCC GATGCGCTGC CGGTGGATCA ACTCAACGCC CTGAAGCAGG AAAACAAAGT CAATGTGGCA GAGTATCCGT CACTGCGCGT CACCTATCTG TATCTGAATA ACAGCAAAGC GCCACTTAAT CAGGCGGATC TGCGGCGGGC CATTTCCTGG TCTACCGATT ATCAGGGCAT GGTTAACGGC ATTCTGAGTG GTAACGGAAA ACAAATGCGC GGCCCGATTC CGGAAGGCAT GTGGGGCTAC GATGCGACGG CAATGCAATA CAACCATGAC GAAACGAAAG CCAAAGCCGA ATGGGATAAA GTGACGAGCA AACCCACCAG CCTGACGTTT CTCTACTCCG ATAACGATCC GAACTGGGAA CCGATCGCTC TGGCAACACA ATCCAGCCTC AACAAGCTGG GCATCAATGT GAAGCTGGAA AAGCTGGCGA ACGCCACCAT GCGCGACAGA GTGGGTAAAG GTGATTACGA CATTGCGATT GGCAACTGGA GTCCGGATTT TGCCGACCCG TATATGTTTA TGAATTACTG GTTTGAGTCA GACAAAAAAG GTCTGCCGGG TAACCGCTCG TTCTATGAAA ACAGTGAGGT CGATAAGTTA CTGCGCAATG CGCTTGCGAC CACCGACCAG ACGCAGCGTA CCCGAGACTA CCAGCAGGCA CAGAAAATCG TCATTGATGA CGCTGCTTAT GTGTACCTGT TCCAGAAAAA CTACCAACTG GCGATGAACA AAGAGGTGAA AGGCTTTGTG TTCAATCCCA TGCTGGAACA GGTCTTCAAT ATCAATACCA TGAGTAAATA A
|
Protein sequence | MKRSISFRPT LLALVLATTF PVAHAAVPKD MLVIGKAADP QTLDPAVTID NNDWTVTYPS YQRLVQYKTD GDKGSTDVEG DLASSWKASN DQKEWTFTLK DNAKFADGTP VTAEAVKLSF ERLLKIGQGP AEAFPKDLKI DAPDEHTVKF TLSQPFAPFL YTLANDGASI INPAVLKEHA ADDARGFLAQ NTAGSGPFML KSWQKGQQLV LVPNPHYPGN KPNFKRVSVK IIGESASRRL QLSRGDIDIA DALPVDQLNA LKQENKVNVA EYPSLRVTYL YLNNSKAPLN QADLRRAISW STDYQGMVNG ILSGNGKQMR GPIPEGMWGY DATAMQYNHD ETKAKAEWDK VTSKPTSLTF LYSDNDPNWE PIALATQSSL NKLGINVKLE KLANATMRDR VGKGDYDIAI GNWSPDFADP YMFMNYWFES DKKGLPGNRS FYENSEVDKL LRNALATTDQ TQRTRDYQQA QKIVIDDAAY VYLFQKNYQL AMNKEVKGFV FNPMLEQVFN INTMSK
|
| |