Gene ECH74115_0306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0306 
Symbol 
ID6968335 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp314636 
End bp315868 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content45% 
IMG OID643384371 
Productsite-specific recombinase, phage integrase family 
Protein accessionYP_002268886 
Protein GI209397533 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.644103 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTATAG GATTGTGTAT ATGTTCCTGT TCGGTCTGGA TTCCTATACA CATGCCTTTA 
AACGATATGC AGATTCGCCG CGCTAAGCCT GAAGCTAAAG CCTATACATT TGGAGATGGG
CTAGGGTTGT CATTACTTAT AGAACCTAAT GGAAGCAAGA GTTGGCGGTT CCGCTATCGC
TATGCCGGCA AACCCAAAAT GATCTCGCTT GGTGTTTACC CAACGATCAC CCTTGCCGAT
GCTCGTTCCC GTCGTGATGA AGCTCGAAAA CTTGTGGCAG AAGGAAAGAA CCCTAGTGAG
GTTCGAAAAG AGCAAAAGCT AGCTATGCAA ACAGAGTCAG AGAACGCCTT CGAAAAGATA
GCCAGAGAGT GGCATCAACT TAAATCTGCT AAATGGTCGG CGGGATATGC ATCAGACATC
ATGGAAGCGT TTAAGAACGA CATTTTTCCT TATGTCGGAA CAAGGCCTGT GGGAGAGATT
AAACCGCTAG AGCTGCTGAA CGTTCTGCGT AAAATTGAGA AACGTGGTGC GTTGGAGAAA
ATGCGCAAAG TGCGGCAGCG TTGCTCCGAA GTGAACCGCC CCGCAATTGC AACGGGTAGG
GCGGAGTACA ATCCTGCGGC TGATCTCTCC AGCGCTCTCG AAGTACACCA ATCCAATCAT
TTCCCATTCC TAAAAGCTGA TGAGATACCT GATTTTCTAC GTGCCTTAGA GGGTTACTCC
GGGAGTAAGC TTGTCCAGAT AGCCACGAAA TTACTGATGA TTACGGGTGT GAGAACCATC
GAATTACGCG CGGCATTATG GCAAGAATTT GATCTGGATA ACGCTATTTG GGAAATTCCT
GCTGAAAGGA TGAAAATGCG TAGGCCACAT CTTGTGCCCT TATCATCTCA AGCGGTAGAT
TTACTCAATG AACTCAAGAT CATGACAGGG AACTATCGTT ATGTTTTTCC AGGGCGGAAC
GATCCGAATA GGCCAATGAG CGAAGCGAGT ATAAATCAAG CCATTAAGCG TATTGGGTAT
GGAGGAAAAG TCACTGGACA TGGTTTTCGT CATACCCTTT CTACAATCCT GCATGAGCAA
GGTTTTGAGA GTGCTTGGAT TGAAATCCAG TTGGCTCATG TAGATAAAAA TTCTATTAGG
GGGACTTATA ACCATGCTCA ATATTTTAGT GGAAGGAAGT CTATGATGGA CTGGTACAGT
AATTTGATAT TTGAAAGACT AAAAAGGAGT TAA
 
Protein sequence
MCIGLCICSC SVWIPIHMPL NDMQIRRAKP EAKAYTFGDG LGLSLLIEPN GSKSWRFRYR 
YAGKPKMISL GVYPTITLAD ARSRRDEARK LVAEGKNPSE VRKEQKLAMQ TESENAFEKI
AREWHQLKSA KWSAGYASDI MEAFKNDIFP YVGTRPVGEI KPLELLNVLR KIEKRGALEK
MRKVRQRCSE VNRPAIATGR AEYNPAADLS SALEVHQSNH FPFLKADEIP DFLRALEGYS
GSKLVQIATK LLMITGVRTI ELRAALWQEF DLDNAIWEIP AERMKMRRPH LVPLSSQAVD
LLNELKIMTG NYRYVFPGRN DPNRPMSEAS INQAIKRIGY GGKVTGHGFR HTLSTILHEQ
GFESAWIEIQ LAHVDKNSIR GTYNHAQYFS GRKSMMDWYS NLIFERLKRS