Gene ECH74115_5789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5789 
Symbol 
ID6972350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp5422570 
End bp5424240 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content46% 
IMG OID643389419 
Productsite-specific recombinase, phage integrase family 
Protein accessionYP_002273811 
Protein GI209400288 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.171322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCGC TTTCTAACTT TGCTCAGTCA ATCGAGTTGC CTATATCGCA ATTGATTCGA 
GAGGTGGTTA ATCGTAACCT CCCGGTATTC TGGCTGGCGA CTGGTCAGTT CGGTTTCTAT
GTTGATGAAT TTAATGCAGT AGAGCGGGAA CCGGGTGCAA AACGAGAAAA ACAGTCTGAT
GATGAAAAGG ATCAACCTAA AGAAGTCATC ATTCTCAATA GCGCGTTTGA GCTGGGTATC
GAGAGCTTCG CAAATGGTTA TCTCCGCCCC TTCAATCCCC GGCATACTTT AGATTGTCTG
TTGAGCGCTG GAGTATCCGA AGGAGAGGCT GCATTTCGAA CTAGTGGTGA TAACCAAAGT
GGAGGTTGGT TCTTCGATTT ACCCGGCGTA GATATAACTG CTGATAGCCT GTTGATTAGC
AAAGTTCATG CTGAAGGCCT TCGACTTACA TGGCTGGTTA AGACCACGCC ACCAGCAGTT
AGCATTCACC CTGCCGTGCC TCTTGTCGCT CCTGTTATCG CTAATGAATA TGTTCACCGC
AAACATTACA ATGAAAACTT GTCATGGCTT CGTGAAGAGT ATTTGAAACA TCGACGTAAG
GGCAAGGTAT CAGAAGCGGC GCTCCGCGAT ATTCGCTATT ACTTCGATTT GATGATTGAA
GTGATGGGGG ATATTCAGTT GGAAGATTTC GACCGTGATT TCCTCCGGGC TTATGAGAGC
AAGTTGCGCA CAATTCCTGC TAACCGTAAT TTGATGAAAG GTAAGCACGG GGTTAAGACG
CTGGATGAGT TAATCGCCAA AGCGGCAGAA TGTGGCGATA AACTGATGAC AGAAGAGTCT
GTCAAAAAGT ATATCAACGG CCTTTATGGT GCAATGGAGT GGGCTGTTGA TGATGGTAAG
TTTCTGAAAT CGCCATGCGA CAACTTTTTC CCTCCCGATG ACAAAGGTGA GCGAGAGCAG
GATCACACTG ACATATTTGA ACCGCATGAA ATTAAGGCAA TTTTTTCGCA ACCGTGGTTT
GTCGCTGGAA CTGTTGAACG TAATGCGCAA GGGCGATTCC ATCAATATTG CCCGTTTCAC
TATTGGGCGC CGTTGTTGGG CTTGATGACG GGGGCAAGGG TTAACGAGAT TGCACAGTTA
ATGCTGGACG ATGTTCTGGC AGATGACGGC GTTTATTACC TGAACCTTGA AAGCGATAGC
GAAAACGGAA AGAAACTAAA AAACGCCAAT TCCCGCCGCA AGATTCCGGT TCATTCTACG
CTGATTGAAC TCGGTTTTAT CGAGTATGTG GATGCGTTGA AAGCTGCCGG GTATGACCGT
CTTTTTCCCG AGCTTAAACC ACATAAAACC AAAGGCTATG GTAGGCCGGT TTCCGCATGG
TTCAATGAAT CATTGCTTGC GGGTCGATTA AAACTTGAAA GAGACAGAAG CAAATCTTTC
CACTCTTTCC GGCATTCTGT TTCAACTTTG CTTAAAGAGA AGGGTGTTAG TTCGGAACTG
CGTGGGCAGC TACTTGGGCA TGTGCGAGGC AAAACAGAAA CTGAAGTGCG ATACAGCAAA
GATTTAAAAC CGGTTCACAT GGTTGAGGTT GTCGAAAAGA TTGATTTTTC TTTGCCCGAG
ATAGCGAGAT TCAACATTCC TGATGGGCTG GATGCTGTAG AATTGATCTG A
 
Protein sequence
MISLSNFAQS IELPISQLIR EVVNRNLPVF WLATGQFGFY VDEFNAVERE PGAKREKQSD 
DEKDQPKEVI ILNSAFELGI ESFANGYLRP FNPRHTLDCL LSAGVSEGEA AFRTSGDNQS
GGWFFDLPGV DITADSLLIS KVHAEGLRLT WLVKTTPPAV SIHPAVPLVA PVIANEYVHR
KHYNENLSWL REEYLKHRRK GKVSEAALRD IRYYFDLMIE VMGDIQLEDF DRDFLRAYES
KLRTIPANRN LMKGKHGVKT LDELIAKAAE CGDKLMTEES VKKYINGLYG AMEWAVDDGK
FLKSPCDNFF PPDDKGEREQ DHTDIFEPHE IKAIFSQPWF VAGTVERNAQ GRFHQYCPFH
YWAPLLGLMT GARVNEIAQL MLDDVLADDG VYYLNLESDS ENGKKLKNAN SRRKIPVHST
LIELGFIEYV DALKAAGYDR LFPELKPHKT KGYGRPVSAW FNESLLAGRL KLERDRSKSF
HSFRHSVSTL LKEKGVSSEL RGQLLGHVRG KTETEVRYSK DLKPVHMVEV VEKIDFSLPE
IARFNIPDGL DAVELI