Gene ECH74115_3057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3057 
Symbol 
ID6968913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2832149 
End bp2833372 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content35% 
IMG OID643386889 
Producthypothetical protein 
Protein accessionYP_002271357 
Protein GI209400285 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000245353 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.000000000123111 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGCATGG TTGAATGGAT AAGCCACGAG AAAAAAGATA GCCTGATACT ATTTGTACAT 
GGATTAAATG GTGGTTTGGA AACATGGAAC TTTAATAAAG AAACTTCATT TCCAAAACTA
TTAGCAGAAG ATGAAGACAT TGGCAATGTA TTTGATATTG CATGTTTCAA TTATTTCACC
AAATTTACAC AAACGTATGC TAAAACGTCA GGATTATGGG CTCGTATTTT CTCTAAGAAA
AATAAGTTAG AGAGAAATTT ACCAACTGAT GAAATAGCTG AGTTATTGTA TACTGAGATT
AGGGTAACGC TAAGTGATTA CTCTCGAATT ATAATAATCG CCCATAGTAT GGGGGGGGTA
ATATCAAAAA ATCTAATTCT AAAAAAAGTT GAGCATGAGG AACAATCTAA TATAATTGGA
TTCATTTCGT TAGCAGTCCC TCACTTTGGT GCTAAATTGG CGAATATAAC TTCAATGGTA
TCTTCAAATG CTCAGTTAGT AGATCTTGGA TTGTTAAGTG AAGCGACCGA TACGTTGAAT
AGGCGCTGGA TAAATTGTAG CAAAAAACTT CCTATCACTA GATATGTTTA TGGTTCTCAT
GACACTATAG TTGATAAAAA AAGCGCATTA CCGATGGATA GTGAACGAAA TAACTCAATT
GCAGTTAATG AAGGGCATAG CTCAATTTGC AAGCCCGAAA ACAGCTCGTC TACAGTGTTT
GTTGCGGTAA AGCAATTTAT TCAGCAGATA AATTTAGAAG CACCCAAGAA GATATGTGTT
GAGCGTTTTT CTGATGAGAA ACAATATGAT AATGAGTATT TCGTCTTAAA AATGATTGTC
GCAGATATAC ATCAGGATAT CGCTATGCAT GCAAAGGAAT ATTATTATAA TGCTGAGCTT
GCGAGAAATA TTTTTACGAG TGATTACGAT AGGAAACTAC TTGGTCATTT GTATTCTAAA
ATAAGGGAGA TTTACCAGGA AGAGTATGAG CAATATATCG CGAATTCAAT TTCCCCAGAT
AAATTTATTG CTGCTGTTCA TAGACGAATA GCCCAAGAGG ATAAGTCGTC GTTAGATTCT
CTTATAAAAA GTTTAGAAAC GATACATAAG AAAGGAATGT TGCATCAACT GGCTAACAAA
AATGACAGGG ATATTGTCTG GTCATCTGAA ACCAGCGTAG AAACTTTGGA GCAATTACGG
CGAGGTAATC ATGAAGAAAA ATAA
 
Protein sequence
MSMVEWISHE KKDSLILFVH GLNGGLETWN FNKETSFPKL LAEDEDIGNV FDIACFNYFT 
KFTQTYAKTS GLWARIFSKK NKLERNLPTD EIAELLYTEI RVTLSDYSRI IIIAHSMGGV
ISKNLILKKV EHEEQSNIIG FISLAVPHFG AKLANITSMV SSNAQLVDLG LLSEATDTLN
RRWINCSKKL PITRYVYGSH DTIVDKKSAL PMDSERNNSI AVNEGHSSIC KPENSSSTVF
VAVKQFIQQI NLEAPKKICV ERFSDEKQYD NEYFVLKMIV ADIHQDIAMH AKEYYYNAEL
ARNIFTSDYD RKLLGHLYSK IREIYQEEYE QYIANSISPD KFIAAVHRRI AQEDKSSLDS
LIKSLETIHK KGMLHQLANK NDRDIVWSSE TSVETLEQLR RGNHEEK