Gene ECH74115_0531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0531 
Symbol 
ID6967412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp535262 
End bp536962 
Gene Length1701 bp 
Protein Length566 aa 
Translation table11 
GC content53% 
IMG OID643384578 
Productextracellular solute-binding protein 
Protein accessionYP_002269092 
Protein GI209398118 
COG category[R] General function prediction only 
COG ID[COG4533] ABC-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.585387 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.74128 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACTGC TCAACCGCCT TAACCAGTAT CAACGTCTGT GGCAACCTTC CGCCGGAAAG 
CCGCAAACCG TCACCGTCAG CGAACTGGCC GAACGCTGTT TTTGCAGCGA ACGCCATGTT
CGTACGCTGT TGCGTCAGGC ACAGGAAGCG GGATGGCTGG AGTGGCAGGC GCAGTCAGGA
CGCGGAAAGC GCGGACAATT ACGCTTTCTG GTCACGCCAG AATCGCTACG CAATGCGATG
ATGGAAAAGG CACTGGAAAC CGGAAAGCAG CAAGATGTGC TGGAGCTGGC GCAACTGGCC
CCAGGTGAGC TGCGCACTCT GTTACAGCCG TTTATGGGCG GACAATGGCA AAACGATACA
CCCACGTTGC GTATTCCCTA CTATCGCCCG CTCGAACCGC TACAACCGGG CTTTTTGCCC
GGCCGTGCCG AGCAGCATCT CGCCGGACAG ATATTTTCCG GCCTGACCCG CTTCGATAAT
AATACCCAGC GCCCGATTGG CGATTTAGCG CATCACTGGG AAACCTCTGC TGACGGGTTA
CGCTGGGACT TTTATCTTCG TTCAACCCTA CACTGGCATA ACGGCGATGC AGTAAAAGCC
TCACACTTAC ACCAGCGATT ATTGATGCTG TTACAACTGC CAGCACTGGA TCAATTATTT
ATTAGCGTGA AGCGTATTGA GGTCACCCAT CCGCAGTGTC TGACCTTCTT TTTACATCGC
CCCGATTACT GGCTTGCGCA CCGGCTGGCG AGCTATTGCA GCCATCTGGC GCATCCGCAA
TTCCCACTGA TTGGCACGGG TCCTTTTCGC TTAACACAAT TCACAGCTGA ACTGGTGCGC
CTGGAAAGCC ATGATTATTA CCATTTACGT CATCCGCTGC TTAAAGCGGT TGAGTACTGG
ATAACTCCGC CGCTTTTCGA AAAAGATTTG GGAACCAGTT GTCGGCATCC CGTGCAAATC
ACCATCGGCA AACCGGAGGA GCTGCAACGG GTCAGCCAGG TCAGTAGTGG CATCAGTTTA
GGTTTTTGCT ATTTAACGTT GCGCAAAAGT CCCCGACTCT CCCTCTGGCA GGCGCGAAAA
GTGATCTCCA TTATTCATCA ATCCGGTTTA TTACAAACGT TAGAAGTCGG AGAAAACCTG
ATCACCGCCA GTCATGCATT ACTGCCAGGC TGGACTATTC CGCATTGGCA AGTACCGGAT
GAAGTCAAAC TACCGAAAAC CTTGACGCTG GTTTATCACC TCCCGATAGA ACTTCATACC
ATGGCAGAAC GCCTACAGGC GACACTGGCA GCGGAAGGCT GTGAACTCAC AATTATTTTT
CATAACGCAA AAAACTGGGA CGACACGACC CTACTGGCAC ACGCAGACCT CATGATGGGC
GACAGATTAA TTGGCGAAGC ACCGGAATAT ACTCTGGAGC AATGGCTGCG CTGCGATCCG
CTGTGGCCAC ATGTTTTCGA CGCTCCAGCA TACGCACATC TACAATCGAC ACTGGACGCG
GTGCAAGTAA TGCCTGATGA GGAAAACCGA TTTAATGCCC TGAAAGCGGT TTTTAGCCAG
TTAATGACAG ATGCGACGCT GACGCCGCTG TTCAACTATC ACTATCGCAT TAGTGCCCCT
CCCGGCGTGA ACGGTGTGCG ACTGACACCG CGCGGCTGGT TTGAATTTAC CGAAGCCTGG
CTTCCCGCGC CGTCGCAATG A
 
Protein sequence
MRLLNRLNQY QRLWQPSAGK PQTVTVSELA ERCFCSERHV RTLLRQAQEA GWLEWQAQSG 
RGKRGQLRFL VTPESLRNAM MEKALETGKQ QDVLELAQLA PGELRTLLQP FMGGQWQNDT
PTLRIPYYRP LEPLQPGFLP GRAEQHLAGQ IFSGLTRFDN NTQRPIGDLA HHWETSADGL
RWDFYLRSTL HWHNGDAVKA SHLHQRLLML LQLPALDQLF ISVKRIEVTH PQCLTFFLHR
PDYWLAHRLA SYCSHLAHPQ FPLIGTGPFR LTQFTAELVR LESHDYYHLR HPLLKAVEYW
ITPPLFEKDL GTSCRHPVQI TIGKPEELQR VSQVSSGISL GFCYLTLRKS PRLSLWQARK
VISIIHQSGL LQTLEVGENL ITASHALLPG WTIPHWQVPD EVKLPKTLTL VYHLPIELHT
MAERLQATLA AEGCELTIIF HNAKNWDDTT LLAHADLMMG DRLIGEAPEY TLEQWLRCDP
LWPHVFDAPA YAHLQSTLDA VQVMPDEENR FNALKAVFSQ LMTDATLTPL FNYHYRISAP
PGVNGVRLTP RGWFEFTEAW LPAPSQ