Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0531 |
Symbol | |
ID | 6967412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 535262 |
End bp | 536962 |
Gene Length | 1701 bp |
Protein Length | 566 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643384578 |
Product | extracellular solute-binding protein |
Protein accession | YP_002269092 |
Protein GI | 209398118 |
COG category | [R] General function prediction only |
COG ID | [COG4533] ABC-type uncharacterized transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.585387 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.74128 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACTGC TCAACCGCCT TAACCAGTAT CAACGTCTGT GGCAACCTTC CGCCGGAAAG CCGCAAACCG TCACCGTCAG CGAACTGGCC GAACGCTGTT TTTGCAGCGA ACGCCATGTT CGTACGCTGT TGCGTCAGGC ACAGGAAGCG GGATGGCTGG AGTGGCAGGC GCAGTCAGGA CGCGGAAAGC GCGGACAATT ACGCTTTCTG GTCACGCCAG AATCGCTACG CAATGCGATG ATGGAAAAGG CACTGGAAAC CGGAAAGCAG CAAGATGTGC TGGAGCTGGC GCAACTGGCC CCAGGTGAGC TGCGCACTCT GTTACAGCCG TTTATGGGCG GACAATGGCA AAACGATACA CCCACGTTGC GTATTCCCTA CTATCGCCCG CTCGAACCGC TACAACCGGG CTTTTTGCCC GGCCGTGCCG AGCAGCATCT CGCCGGACAG ATATTTTCCG GCCTGACCCG CTTCGATAAT AATACCCAGC GCCCGATTGG CGATTTAGCG CATCACTGGG AAACCTCTGC TGACGGGTTA CGCTGGGACT TTTATCTTCG TTCAACCCTA CACTGGCATA ACGGCGATGC AGTAAAAGCC TCACACTTAC ACCAGCGATT ATTGATGCTG TTACAACTGC CAGCACTGGA TCAATTATTT ATTAGCGTGA AGCGTATTGA GGTCACCCAT CCGCAGTGTC TGACCTTCTT TTTACATCGC CCCGATTACT GGCTTGCGCA CCGGCTGGCG AGCTATTGCA GCCATCTGGC GCATCCGCAA TTCCCACTGA TTGGCACGGG TCCTTTTCGC TTAACACAAT TCACAGCTGA ACTGGTGCGC CTGGAAAGCC ATGATTATTA CCATTTACGT CATCCGCTGC TTAAAGCGGT TGAGTACTGG ATAACTCCGC CGCTTTTCGA AAAAGATTTG GGAACCAGTT GTCGGCATCC CGTGCAAATC ACCATCGGCA AACCGGAGGA GCTGCAACGG GTCAGCCAGG TCAGTAGTGG CATCAGTTTA GGTTTTTGCT ATTTAACGTT GCGCAAAAGT CCCCGACTCT CCCTCTGGCA GGCGCGAAAA GTGATCTCCA TTATTCATCA ATCCGGTTTA TTACAAACGT TAGAAGTCGG AGAAAACCTG ATCACCGCCA GTCATGCATT ACTGCCAGGC TGGACTATTC CGCATTGGCA AGTACCGGAT GAAGTCAAAC TACCGAAAAC CTTGACGCTG GTTTATCACC TCCCGATAGA ACTTCATACC ATGGCAGAAC GCCTACAGGC GACACTGGCA GCGGAAGGCT GTGAACTCAC AATTATTTTT CATAACGCAA AAAACTGGGA CGACACGACC CTACTGGCAC ACGCAGACCT CATGATGGGC GACAGATTAA TTGGCGAAGC ACCGGAATAT ACTCTGGAGC AATGGCTGCG CTGCGATCCG CTGTGGCCAC ATGTTTTCGA CGCTCCAGCA TACGCACATC TACAATCGAC ACTGGACGCG GTGCAAGTAA TGCCTGATGA GGAAAACCGA TTTAATGCCC TGAAAGCGGT TTTTAGCCAG TTAATGACAG ATGCGACGCT GACGCCGCTG TTCAACTATC ACTATCGCAT TAGTGCCCCT CCCGGCGTGA ACGGTGTGCG ACTGACACCG CGCGGCTGGT TTGAATTTAC CGAAGCCTGG CTTCCCGCGC CGTCGCAATG A
|
Protein sequence | MRLLNRLNQY QRLWQPSAGK PQTVTVSELA ERCFCSERHV RTLLRQAQEA GWLEWQAQSG RGKRGQLRFL VTPESLRNAM MEKALETGKQ QDVLELAQLA PGELRTLLQP FMGGQWQNDT PTLRIPYYRP LEPLQPGFLP GRAEQHLAGQ IFSGLTRFDN NTQRPIGDLA HHWETSADGL RWDFYLRSTL HWHNGDAVKA SHLHQRLLML LQLPALDQLF ISVKRIEVTH PQCLTFFLHR PDYWLAHRLA SYCSHLAHPQ FPLIGTGPFR LTQFTAELVR LESHDYYHLR HPLLKAVEYW ITPPLFEKDL GTSCRHPVQI TIGKPEELQR VSQVSSGISL GFCYLTLRKS PRLSLWQARK VISIIHQSGL LQTLEVGENL ITASHALLPG WTIPHWQVPD EVKLPKTLTL VYHLPIELHT MAERLQATLA AEGCELTIIF HNAKNWDDTT LLAHADLMMG DRLIGEAPEY TLEQWLRCDP LWPHVFDAPA YAHLQSTLDA VQVMPDEENR FNALKAVFSQ LMTDATLTPL FNYHYRISAP PGVNGVRLTP RGWFEFTEAW LPAPSQ
|
| |