Gene ECH74115_5111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5111 
Symbol 
ID6970232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4751981 
End bp4753696 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content53% 
IMG OID643388783 
Productputative symporter YidK 
Protein accessionYP_002273209 
Protein GI209399812 
COG category[R] General function prediction only 
COG ID[COG4146] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.393136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCGT TACAAATCTT GAGTTTTGTC GGTTTTACGC TGCTGGTGGC GATCATCACC 
TGGTGGAAGG TTCGCAAAAC AGATACCGGA TCGCAACAAG GCTATTTTCT TGCCGGACGT
TCACTAAAAG CGCCGGTTAT TGCCGCTTCG TTAATGTTAA CCAACCTTTC CACGGAACAA
CTGGTTGGTC TTTCCGGGCA GGCCTACAAA AGCGGCATGT CGGTGATGGG CTGGGAAGTG
ACTTCAGCGG TGACGCTGAT CTTCCTCGCG CTAATCTTTT TACCGCGCTA TCTGAAGCGC
GGCATTGCCA CCATCCCCGA TTTTCTGGAG GAACGTTATG ATAAAACGAC GCGTATGATC
ATCGACTTCT GCTTCCTCAT TGCCACCGGC GTCTGCTTTC TGCCGATTGT TCTCTACTCC
GGCGCGTTGG CGCTCAACAG CCTGTTTCAC GTCGGGGAAT CGCTACAGAT TTCTCACGGT
GCGGCTATCT GGCTACTGGT AATTTTGCTT GGTCTGGCGG GAATTTTGTA TGCGGTGATC
GGCGGACTGC GCGCAATGGC AGTGGCGGAC TCCATCAACG GTATTGGGCT GGTTATCGGC
GGGTTGATGG TGCCGGTATT TGGCTTGATC GCGATGGGCA AGGGCAGCTT TATGCAGGGC
ATTGAGCAAA TTACCACCGT TCACGCCGAG AAATTAAACT CAGTCGGTGG CCCGACCGAT
CCCTTGCCGA TTGGCGCGGC ATTTACCGGT TTAATTCTGG TGAACACCTT TTACTGGTGT
ACAAATCAGG GCATCGTGCA ACGCACGCTG GCGTCAAAAA GCCTGGCGGA AGGGCAAAAG
GGGGCGCTGT TAACGGCGGT GCTGAAAATG CTCGACCCGC TGGTACTGGT GCTGCCAGGG
TTGATTGCGT TTCATCTGTA TCAGGATCTA CCGAAAGCCG ACATGGCCTA CCCGACGCTG
GTCAATAACG TTCTGCCAGT GCCACTGGTG GGTTTCTTCG GCGCGGTGTT ATTTGGTGCA
GTGATCAGTA CCTTCAACGG CTTTCTGAAT AGCGCCAGTA CGTTATTCAG TATGGGTATT
TACCGTCGCA TCATTAACCA GAATGCCGAG CCGCAGCAGC TGGTCACCGT CGGGCGCAAA
TTTGGTTTCT TTATCGCTAT CGTTTCGGTG CTGGTAGCGC CGTGGATCGC CAACGCGCCG
CAGGGGCTGT ATAGCTGGAT GAAACAGCTC AACGGCATTT ACAACGTGCC GCTGGTTACC
ATCATCATTA TGGGCTTTTT CTTCCTGCGC ATCCCGGCGC TGGCGGCAAA AGTGGCGATG
GGGATTGGCA TAATCAGCTA CATCACCATC AACTATCTGG TGAAGTTCGA CTTCCATTTC
CTCTATGTGC TGGCCTGTAC GTTCTGCATC AACGTGGTCG TGATGCTGGT GATCGGTTTT
ATCAAACCGC GCGCCACGCC GTTCACCTTC AAAGATGCGT TTGCGGTGGA CATGAAACCG
TGGAAAAACG TCAAGATCGC GTCAATTGGC ATCCTGTTCG CGATGATTGG CGTCTATGCC
GGGCTGGCTG AATTCGGCGG CTACGGTACG CGCTGGTTAG CGATGATCAG TTATTTCATT
GCCGCCGTAG TGATTGTCTA CCTGATTTTT GACAGCTGGC GGCATCGTCA CGACCCAGCC
GTAACCTTTA CTCCCGACGC GAAGGATAGC CTATGA
 
Protein sequence
MNSLQILSFV GFTLLVAIIT WWKVRKTDTG SQQGYFLAGR SLKAPVIAAS LMLTNLSTEQ 
LVGLSGQAYK SGMSVMGWEV TSAVTLIFLA LIFLPRYLKR GIATIPDFLE ERYDKTTRMI
IDFCFLIATG VCFLPIVLYS GALALNSLFH VGESLQISHG AAIWLLVILL GLAGILYAVI
GGLRAMAVAD SINGIGLVIG GLMVPVFGLI AMGKGSFMQG IEQITTVHAE KLNSVGGPTD
PLPIGAAFTG LILVNTFYWC TNQGIVQRTL ASKSLAEGQK GALLTAVLKM LDPLVLVLPG
LIAFHLYQDL PKADMAYPTL VNNVLPVPLV GFFGAVLFGA VISTFNGFLN SASTLFSMGI
YRRIINQNAE PQQLVTVGRK FGFFIAIVSV LVAPWIANAP QGLYSWMKQL NGIYNVPLVT
IIIMGFFFLR IPALAAKVAM GIGIISYITI NYLVKFDFHF LYVLACTFCI NVVVMLVIGF
IKPRATPFTF KDAFAVDMKP WKNVKIASIG ILFAMIGVYA GLAEFGGYGT RWLAMISYFI
AAVVIVYLIF DSWRHRHDPA VTFTPDAKDS L