Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0435 |
Symbol | |
ID | 6970779 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 444943 |
End bp | 445974 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643384487 |
Product | extracellular solute-binding protein, family 1 |
Protein accession | YP_002269001 |
Protein GI | 209399886 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1840] ABC-type Fe3+ transport system, periplasmic component |
TIGRFAM ID | [TIGR01254] ABC transporter periplasmic binding protein, thiB subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATGA CCTTTACCTC TACTCTGATT GCTTCTGCCG TTGCGCTGGC GACTTTAACG GGTGCCGCGC AGGCAAAAGG CCGGCTGGTG GTCTATTGCA GCGCCACCAA TGAAATGTGT GAGGCCGAAA CCAAAGCGTT TGGCGAGAAA TATGATGTAA AAACCTCGTT TATCCGCAAC GGTTCCGGAA GCACCCTGGC AAAAGTCGAT GCTGAAAAGA AAAACCCCCA GGCTGACGTC TGGTACGGCG GCACTTTAGA CCCGCAATCG CAGGCTGGCG AGATGGGCCT GCTACAGGCG TATAAATCCC CTAACCTTGA GCAGATCATG GAGAAATTCC GCGATCCGGC AAAAGTGAAA GGGAATCTGT CCTCTGCCGT TTATGTGGGG ATCCTCGGTT TTGGCGTAAA CACCGATCGT CTGAAAGAGA AAAACCTGCC GGTGCCGAAG TGCTGGAAAG ATCTCACCAA ACCGGAATAT AAAGGCGAAA TTCAAATTGC CGATCCGCAA AGTTCCGGTA CTGCCTACAC CGCCCTGGCT ACTTTTGCGC AGCTGTGGGG GGAAGATCAG GCCTTTGATT ATCTAAAACA ACTGAACGCC AACGTCTCTC AGTACACCAA ATCAGGCATA GCCCCGGCAC GTAACGCCGC CCGTGGCGAA ACGGCGATTG GTATTGGCTT CCTGCATGAC TACTCGCTGG AAAAAGAACA GGGCGCGCCG CTGGAGTTGA TTTCCCCGTG TGAAGGTACG GGCTACGAAA TTGGCGGCGT CAGCATTCTG AAAGGCGCGC GCAACCTCGA CAACGCCAAA CTGTTTGTGG ACTGGGTATT ATCAAAAGAA GCTCAGGAAC TGGCGTGGAA GAAAGGGAAG TCTTATCAGA TCCTGACTAA TACCACCGCC GAAACGTCGC CAAATTCGCT GAAGCTCGAC GACCTGAAAT TAATCAACTA CGACATGGAT AAATATGGTT CCACGGATGT TCGTAAGGCA TTGATTAATA AATGGGTCAG CGAAGTGAAG ATGGGTAAAT AA
|
Protein sequence | MKMTFTSTLI ASAVALATLT GAAQAKGRLV VYCSATNEMC EAETKAFGEK YDVKTSFIRN GSGSTLAKVD AEKKNPQADV WYGGTLDPQS QAGEMGLLQA YKSPNLEQIM EKFRDPAKVK GNLSSAVYVG ILGFGVNTDR LKEKNLPVPK CWKDLTKPEY KGEIQIADPQ SSGTAYTALA TFAQLWGEDQ AFDYLKQLNA NVSQYTKSGI APARNAARGE TAIGIGFLHD YSLEKEQGAP LELISPCEGT GYEIGGVSIL KGARNLDNAK LFVDWVLSKE AQELAWKKGK SYQILTNTTA ETSPNSLKLD DLKLINYDMD KYGSTDVRKA LINKWVSEVK MGK
|
| |