Gene ECH74115_2708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2708 
Symbol 
ID6969526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2541349 
End bp2542488 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content35% 
IMG OID643386569 
Producthypothetical protein 
Protein accessionYP_002271048 
Protein GI209397909 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.659455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.29624 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCCAT TAAATGATCT GTCGTTAAAA ACTCAGTCGG TTCAATTAAA TAAAATCACA 
TCGAATACTG AGTCTACGAT AAAACAACAC GAGTTAGTAT CTGATGATGC AATCATAAAT
GAATTATCAA GTGAGTTAGT CAGTTGTCTT GGAAATGATA AGTTTACACC AGTTAGTGAA
GACAGCAACT TACTGAATAT GCTGTCTGAA TTTAAGTTAT TGAGAGAGCA ATGTTTCAGG
TGGGGTAATT ATACTCTATT GTTTGAAAAT TATGGGGCTT ATGATAAGAC GGGATCTATC
ACGATAGAAA AAAGTCAGGG GGAGGGGACT TTACCCATTC GGCATAAATT AGAGTTTATA
TCGACCAATA TTGCAGAGTT GCTGGACAAG TTAACCAAAA TTACAGATGC CAGGCTTTGC
AAAGGTTTCA GTGACTGGGC TAGTTCAGTC AAAGAAGGCG CATCGAATGA CTTGAAAGAA
AATGTGGATA GAGCATTGGT GAGAATGTTT AAATGTGTTA AGCTTCACAG TAATGAACTT
AACTTATCAA GCCTTTCTTT GGGTTCTGTG CCGCCTCTTC CTGAGTGGAT TGAAATGCTT
AGCCTTGTTT ATAATGAACT TGATTCAATA CAGGTGCCCG AATCGTGCAA AGAATTAGAA
CTCGATTTCA ATAACCTTAC AGAATTTCCA CAAGTACCTG ATGGAATTAC CCTGATCTCC
GTAAATAATA ACCTGATATC GTATATTGAC TCATTTCCGC CAAAGGCTAA GAAAATTTTT
ATTTGTCACA ATAAGCTATC GGAAATACCA GCACTACCAG ACACCGCTAA GGTTTTTGAT
TGTAGTGAGA ATAATATTAA AGAAATTAGA TGGTTCCCCA AAAATTTGAA AGAAGCGTAT
ATTGAATATA ATAAGATTGA GGTTATTCCT GCGATACCTG GCAATTTAAA ATTACTTTGT
ATGAAATGTA ATCCTATTAA AGAGGCATTT TTAATGCCAT GGACCCTTAC AGGGATTCGC
TATGAAATAT CGCAGCGAAA ATATATTGTT ATGAATCCCG CCGATTATGA TAAATATTCC
GATATGGTTA AAAAGCATGT AATAGATGGT GAGGAATTCA TAATTAAATA TTATATGTAA
 
Protein sequence
MFPLNDLSLK TQSVQLNKIT SNTESTIKQH ELVSDDAIIN ELSSELVSCL GNDKFTPVSE 
DSNLLNMLSE FKLLREQCFR WGNYTLLFEN YGAYDKTGSI TIEKSQGEGT LPIRHKLEFI
STNIAELLDK LTKITDARLC KGFSDWASSV KEGASNDLKE NVDRALVRMF KCVKLHSNEL
NLSSLSLGSV PPLPEWIEML SLVYNELDSI QVPESCKELE LDFNNLTEFP QVPDGITLIS
VNNNLISYID SFPPKAKKIF ICHNKLSEIP ALPDTAKVFD CSENNIKEIR WFPKNLKEAY
IEYNKIEVIP AIPGNLKLLC MKCNPIKEAF LMPWTLTGIR YEISQRKYIV MNPADYDKYS
DMVKKHVIDG EEFIIKYYM