Gene ECH74115_5032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5032 
Symbol 
ID6972422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4684091 
End bp4685344 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content48% 
IMG OID643388713 
Productintegrase 
Protein accessionYP_002273139 
Protein GI209400784 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGGGGTAT CAGGTTGGTT ACTGGCGGAA ATTCCCCCAG GAGGATGGAT TCATCAAAAG 
CTGGAGGATG TCATGGCGCT AACCGATGTT AAAGTGAAAA CCGCGAAGCC AAAAGAAAGG
CCCTATAAGC TGGCTGATGG CGGAGGTATG TATCTGCTGA TTAACGCTAA TGGTTCCAAA
TACTGGCGTA TGAAGTACCG CTTCGCCGGT AAAGAAAAGA TGCTGTCTAT TGGCGTATAC
CCCGATGTAA CGCTTGCCGA CGCACGTGAG AAACGCAGTG AAGCCAGGAA AATACTTGCG
GCTGGTGGCG ATCCGGGTGA GGCAAAAAAA GAAGAAAAAA TAGCGCTGCA AATGAGTTTG
AAAAACACTT TCGAGGCCGT TGCTCGCGAG TGGCATCAGA CGAAAGCCGA TCGCTGGTCA
CTGCGCTACC GCGATGAAAT AATCGACACC TTTGAGAAAG ATATCTTCCC CTACATCGGC
AAGCGACCTA TTGCCGAAAT CAAGCCGATG GAGCTATTGG AAGCACTACG TAAGATGGAG
AAACGTGGAG CACTGGAGAA AATGCGCAAG GTCCGCCAAC GTTGTGGTGA GGTGTTTCGC
TATGCAATTG TTACTGGTCG AGCCGACTAT AACCCTGCTC CCGATCTTGC CAGTGCTTTA
GCTACACCGA AAAAAGTACA TTTTCCCTTT CTTACTGCCA ATGAACTTCC CCACTTCCTT
ACTGATCTGG CGGGTTATAC CGGAAGTATC ATCACTAAGA CAGCTACTCA GATCATTATG
TTAACCGGTG TACGAACGCA AGAGTTGCGT TTCGCGCATT GGGAGGATAT TGATTTTGAG
GCAAAATTAT GGGAGATCCC GGCAGAAGTA ATGAAGATGA AACGGCCTCA TATTGTGCCG
CCCTCTGAGC AGGTTATTGC GCTATTTAAG CAACTTGAAC CAATCTCAAA ACATCATCCT
CTGGTCTTTA TCGGTAGGAA CGATCCTCGC AAGCCAATTA GTAAGGAGAG CATTAACCAG
GTCATTGAAT TGCTGGGGTA TAAGGGAAGA CTCACAGGGC ACGGTTTCAG GCATACCATG
AGCACAATTC TGCACGAACA AGGCTTTAAT TCTGCCTGGA TTGAAATGCA GTTGGCTCAT
GTGGATAAAA ACTCCATCAG GGGTACCTAT AACCACGCCC AGTATCTCGA TGGTCGCCGT
GAAATGATGC AATGGTACGC TGACTACATT GATTCACTTT CTGAGCTGGC CTGA
 
Protein sequence
MGVSGWLLAE IPPGGWIHQK LEDVMALTDV KVKTAKPKER PYKLADGGGM YLLINANGSK 
YWRMKYRFAG KEKMLSIGVY PDVTLADARE KRSEARKILA AGGDPGEAKK EEKIALQMSL
KNTFEAVARE WHQTKADRWS LRYRDEIIDT FEKDIFPYIG KRPIAEIKPM ELLEALRKME
KRGALEKMRK VRQRCGEVFR YAIVTGRADY NPAPDLASAL ATPKKVHFPF LTANELPHFL
TDLAGYTGSI ITKTATQIIM LTGVRTQELR FAHWEDIDFE AKLWEIPAEV MKMKRPHIVP
PSEQVIALFK QLEPISKHHP LVFIGRNDPR KPISKESINQ VIELLGYKGR LTGHGFRHTM
STILHEQGFN SAWIEMQLAH VDKNSIRGTY NHAQYLDGRR EMMQWYADYI DSLSELA