Gene ECH74115_3892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3892 
Symbol 
ID6967250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3600133 
End bp3601446 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content31% 
IMG OID643387670 
Productsite-specific recombinase, phage integrase family protein 
Protein accessionYP_002272119 
Protein GI209398276 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAACA AATGTATTGA AAGTGAGCAA ATCTTTTTTG CTAAGATGAA CAGGTATAGT 
TTCAAACTGT CAGATAAGAA ATGGCAACTG GATAAAGAAA ACTGCGTATA CCCTCATAAA
GTTGTAGATA GAATGCCTAC AAAAATGAAA CTTAGCTACT TAAAAACATT GGCTTACTAT
GCGTCTGAAT ATAGTTCTTT TTATATTCAA AGTATTAACA ATCTATTTTA TGAGTGGTTT
GGTGCGATGA CTATCGATAC TATTGATGAC AAAGCAATAT ATCAATTGAA TGTTTATTTA
GGTTCAGAAA GAAACTACAA ACTAAACTTA ATTAAGGCTT TCATCATTAA ATGGAAAAAT
CTCAATTACC CTGGGGTAGA AGCGACTGCC ATTAGAATGC TGGAGAAAAT AAAAATCATT
CCAAACCAAA CAGGAGATGC AGTTAAAAGA CGAGATCCAA ATAAAGGACC TTTAACTGAA
GCGGAATTCA ATAACATCAT TAACGCCGTT GGAAAATTTT ATCATGAGAA GAAAATTCAA
TGCTTTTTGT ATTGTTATAT CCTTTTGCTG GCAATAACAG GAAGAAGGCC ATTACAATTA
ATATCTCTAA AAGCTAAAGA TCTCATTAAA AATGAGAGAG GGTGCTTTTT GAATGTACCA
AAAGTAAAAC AAAGAAAATG TTTCAGAAAA GAATTTAACA TGGTTATGAT AGAGCCGTTC
TTATATGACA GCTTATCAAT GCTAATTAAT CAAAATCAGG CGTTTGTTGA AGATAAATTC
AGTGTTGGGA TTAGTAACTA TAGAGGCGAA TTACCAATCT TCATGAATTT AGATAAGATT
ACGGAAACAA AAAGGATTGA GGATTTTTTA TATGATTTAA CAACAGATTT TTTCCATATG
AAAAATTCAG TTATGTCAAA ACTATTAAAA CACTTTCCGT CAAAATTCGA TGTTAGGTCA
GAAAGGACTA ACAGCTATAT AGAACTTAAT GCTAGAAGAT TCAGATATAC GTTAGGAAGT
CGACTGGCTA ATGAAGGAGC CTCAATTGAG GTGATTGCTA AAGCGTTAGA TCATAAATCA
GTAAACTCTT CTATAATATA TATAAAAAAT AATCCTGACA ACGTTTATGA CATCGATAAG
AGACTAAGTG CGTTTTTTAA CCCCTTATCT AATATACTTA TGGGCATAGA GATTGAAGAA
AACAAGAACT TTTTTATCAA GTTTGTTTCA GATGCATTTT TCTTATTGGA AGATACGAAA
GAGGATTTGA AATGTTTAAC GTGTAAAAAA TTCAATCCCT GGAGAGCATT ATGA
 
Protein sequence
MENKCIESEQ IFFAKMNRYS FKLSDKKWQL DKENCVYPHK VVDRMPTKMK LSYLKTLAYY 
ASEYSSFYIQ SINNLFYEWF GAMTIDTIDD KAIYQLNVYL GSERNYKLNL IKAFIIKWKN
LNYPGVEATA IRMLEKIKII PNQTGDAVKR RDPNKGPLTE AEFNNIINAV GKFYHEKKIQ
CFLYCYILLL AITGRRPLQL ISLKAKDLIK NERGCFLNVP KVKQRKCFRK EFNMVMIEPF
LYDSLSMLIN QNQAFVEDKF SVGISNYRGE LPIFMNLDKI TETKRIEDFL YDLTTDFFHM
KNSVMSKLLK HFPSKFDVRS ERTNSYIELN ARRFRYTLGS RLANEGASIE VIAKALDHKS
VNSSIIYIKN NPDNVYDIDK RLSAFFNPLS NILMGIEIEE NKNFFIKFVS DAFFLLEDTK
EDLKCLTCKK FNPWRAL