Gene ECH74115_2080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2080 
Symbol 
ID6970802 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1978202 
End bp1979461 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content33% 
IMG OID643385985 
Productleucine-rich repeat protein 
Protein accessionYP_002270474 
Protein GI209397006 
COG category[S] Function unknown 
COG ID[COG4886] Leucine-rich repeat (LRR) protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCC CTTCAATATT TAACAAAATA AAACCACAAT CCATACAGCA ACATCCAGAA 
AAAAATCAAC TTAACTGGAT GCTCGAATTA AATAAATGGA AAGAAGAACG TATACTTACA
GGTGAAATCC ATCGTCCGGA ATGTCGAAAC GAAGCCGCTA AAAGGATAAG CTGTGCTTTT
TTGTCGAAAC AGAATGACAT TGATTTATCA GGACTTAATT TATCTACTCA ACCACCAGGG
CTGCAAAACT TCACCTCTAT CAATCTTGAT AATAACCAAC TCACACATTT TGATGCAACC
AACTACGATA GACTCGTAAA ACTTAGTCTG AATAGTAACA CTCTTGAGTC AATAAATATT
CATCAAGGCA GAAATGTAAG CATTACACAT ATATCTATGA ATAATAATTG TCTCAGAAAT
ATTGATATAG ATAGGCTTTC ATCAATTACT TATTTTAGTG CGGCACATAA TAAACTAGAG
TTTGTGCAAT TAGAATCTTG CGAATGGCTG CAATACCTGA ATCTCAGCCA TAATCAATTA
ACTGATATTG TTACAGGAAA TAAAGAAGAA CTCTTACTGC TGGATCTATC CCATAATAAA
CTAGCAAGTT TACACAATGC CTTATTTCCC AACTTAAATA CGTTACTTAT CAACAACAAC
TTGCTTTCTG AAATTAAAAT GTTTTATAGC AACTTCTGCA AAGTTCAGAC ATTAAACGCT
GCTAACAATC AGTTGGAAAA AATAAACCTT CATTTCCTGA CTTATCTTTC ATCTATCAAA
AGTTTAAGGC TGGACAATAA TAAAATAACT CGCATTGATA CTGAGAACAC ATCCGATATT
AGAAGTTTAT TCCCCATAAT AAAGAAGAGC GAAAGCTTAA ATTTTTTAAA TATTTCTGGC
GAGAACAATT GCCCTACTAT CCAGCTCATG TTATTTAATT TGTTTTCCCC AGCACTTAAG
CTTAATACTG GCCTGGCAAT TCTTTCGCCT GGTGCATTTG AAGATCACTC TGACGGATTA
GATGTGGATA ACGAATTGTT TCACTATACT ATTAATAAAG CATATACCCC ATATAATATA
CATACTTATA AAACAGAAGA AGTTGTAAAC CAGAGGAATA TAAAAATTAA AAATATGACC
TTAGATGAAA TAAACAATAC TTATTGTAAT AACGATTATT ACAATGAGGC AATAAGAGAG
GAACCGATAG ACTTTCTGGA CAGATCGTTT TCCTCCAGCT CATGGCCTTT TTATCACTAA
 
Protein sequence
MKFPSIFNKI KPQSIQQHPE KNQLNWMLEL NKWKEERILT GEIHRPECRN EAAKRISCAF 
LSKQNDIDLS GLNLSTQPPG LQNFTSINLD NNQLTHFDAT NYDRLVKLSL NSNTLESINI
HQGRNVSITH ISMNNNCLRN IDIDRLSSIT YFSAAHNKLE FVQLESCEWL QYLNLSHNQL
TDIVTGNKEE LLLLDLSHNK LASLHNALFP NLNTLLINNN LLSEIKMFYS NFCKVQTLNA
ANNQLEKINL HFLTYLSSIK SLRLDNNKIT RIDTENTSDI RSLFPIIKKS ESLNFLNISG
ENNCPTIQLM LFNLFSPALK LNTGLAILSP GAFEDHSDGL DVDNELFHYT INKAYTPYNI
HTYKTEEVVN QRNIKIKNMT LDEINNTYCN NDYYNEAIRE EPIDFLDRSF SSSSWPFYH