Gene ECH74115_4952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4952 
Symbol 
ID6971265 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4593466 
End bp4595436 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content55% 
IMG OID643388635 
Producthypothetical protein 
Protein accessionYP_002273062 
Protein GI209399472 
COG category[S] Function unknown 
COG ID[COG3533] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTT CGGAAGTCGA TCTGCATAAA CTGACGGTCA GCGATCCGTT CCTCGGTCAG 
TACCAACAAC TGGTCCGCGA CGTGGTGATT CCTTATCAGT GGGATGCCTT GAACGATCGT
ATCCCAGAAG CGGAACCCAG CCATGCGATT GAAAACTTTC GCATTGCCGC CGGACTTCAG
GAGGGTGAAT TTTACGGGAT GGTGTTTCAG GACAGCGACG TCGCCAAATG GCTGGAAGCG
GTAGCCTGGT CGCTGTGCCA GAAGCCGGAC GCCGAACTGG AAAAAACCGC CGACGAGGTA
ATCGAACTGA TCGCCTCCGC CCAATGCGAA GACGGCTATC TCAATACTTA CTTTACGGTA
AAAGCACCCG AAGAACGCTG GAGCAATCTG GCGGAGTGTC ATGAACTTTA CTGCGCAGGT
CATCTGATTG AAGCCGGAGT CGCCTTCTTC CAGGCCACGG GAAAACGGCG CTTGCTGGGG
GTCGTTTGCC GCCTGGCCGA TCATATCGAC AGCGTATTTG GTCCAGATGA AAGTAAGTTA
CACGGTTATC CTGGTCACCC GGAAATTGAA CTGGCACTAA TGCGCCTGTA TGAAGTGACT
GAAGAGCCGC GCTACCTGGC GCTGACGAAC TATTTTGTCG AACAGCGTGG TGCGCAACCG
CACTATTACG ACCAGGAATA TGAAAAGCGC GGGCAGACAT CGCACTGGCA CACCTACGGC
CCGGCGTGGA TGGTGAAAGA CAAAGCCTAC AGCCAGGCAC ATTTGCCCCT TGCGCAACAG
CAAACCGCCA TCGGTCACGC GGTACGTTTT GTCTATCTGA TGACCGGCGT CGCGCATCTC
GCGCGTTTAA GTCACGATGA CAGCAAGCGT CAGGACTGCC TGCGGCTGTG GAACAATATG
GCCCAGCGTC AGTTATATAT TACCGGCGGC ATCGGCTCAC AAAGCAGCGG CGAAGCGTTC
AGCAGCGATT ACGATCTGCC GAATGACACG GTTTACGCCG AAAGTTGTGC TTCCATCGGC
CTGATGATGT TCGCCCGACG AATGCTGGAA ATGGAAGGCG ACAGTCAATA TGCCGATGTG
ATGGAGCGCG CACTGTACAA CACTGTGCTC GGCGGCATGG CATTGGATGG CAAACATTTC
TTCTATGTGA ATCCACTGGA AGTACATCCA AAATCGCTGA AATTCAACCA TATCTACGAT
CACGTTAAAC CGATCCGCCA GCGTTGGTTT GGCTGCGCTT GTTGTCCGCC AAATATCGCC
CGCGTGCTGA CCTCGATTGG TCATTATCTC TACACGCCGC GTGAAGATGC GTTGTATATC
AACATCTACG CAGGAAACAG CATGGAAGTG CCGGTAGAAA ATGGCACGCT GCGCCTGCGG
GTTAGCGGGA ACTATCCGTG GCAGGAGCAG GTGACGATTG CGGTTGAATC GCCCCAGCCG
GTACGTCATA CGCTGGCTTT ACGTCTGCCG GACTGGTGCA CACAGCCGCA GATCATATTG
AATGGGGAAG AGGTCGAGCA GGATATTCGT AAAGGGTATT TGCACATTAC CCGCGAATGG
CAGGAGGGCG ATACGCTGAA TCTGACTTTG CCGATGCCGG TACGCCGCGT TTACGGTAAC
CCGCTGGTGC GTCACGTCGC CGGAAAAGTG GCGATTCAGC GCGGCCCGCT GGTGTATTGC
CTGGAAAAGG CCGACAACGG CGAGTCACTG CATAATCTGT GGCTGCCCAC CGATGCGCCA
TTTACGACAT TTGAAGGCAA GGGATTGTTT AGCCATAAGA TCTTAATCCA GGCACCGGGT
TACCGGTATG AACAGAGCAA TCCAGAGCAG CAACCGCTGT GGCATTACGA CAGCGCGCCA
GCCAAACGCC AGACGCAAAC TCTGACCTTT ATCCCGTGGT TTAGCTGGGC CAACCGGGGT
GAAGGCGAAA TGCGGATCTG GGTGAATGAG GAAAAGCATT GCCATCCGTA G
 
Protein sequence
MNISEVDLHK LTVSDPFLGQ YQQLVRDVVI PYQWDALNDR IPEAEPSHAI ENFRIAAGLQ 
EGEFYGMVFQ DSDVAKWLEA VAWSLCQKPD AELEKTADEV IELIASAQCE DGYLNTYFTV
KAPEERWSNL AECHELYCAG HLIEAGVAFF QATGKRRLLG VVCRLADHID SVFGPDESKL
HGYPGHPEIE LALMRLYEVT EEPRYLALTN YFVEQRGAQP HYYDQEYEKR GQTSHWHTYG
PAWMVKDKAY SQAHLPLAQQ QTAIGHAVRF VYLMTGVAHL ARLSHDDSKR QDCLRLWNNM
AQRQLYITGG IGSQSSGEAF SSDYDLPNDT VYAESCASIG LMMFARRMLE MEGDSQYADV
MERALYNTVL GGMALDGKHF FYVNPLEVHP KSLKFNHIYD HVKPIRQRWF GCACCPPNIA
RVLTSIGHYL YTPREDALYI NIYAGNSMEV PVENGTLRLR VSGNYPWQEQ VTIAVESPQP
VRHTLALRLP DWCTQPQIIL NGEEVEQDIR KGYLHITREW QEGDTLNLTL PMPVRRVYGN
PLVRHVAGKV AIQRGPLVYC LEKADNGESL HNLWLPTDAP FTTFEGKGLF SHKILIQAPG
YRYEQSNPEQ QPLWHYDSAP AKRQTQTLTF IPWFSWANRG EGEMRIWVNE EKHCHP