Gene ECH74115_3582 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3582 
Symbol 
ID6972415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3297224 
End bp3298417 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content47% 
IMG OID643387380 
Productintegrase 
Protein accessionYP_002271839 
Protein GI209397288 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0000000000010448 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGATAAAA TAATTTTACC CACCGGATTT TTACCCATGC TCACCGTTAA GCAGATTGAA 
GCAGCAAAGC CGAAAGAAAA ACCATACCGC CTACTCGATG GTAATGGCCT GTACCTTTAT
GTCCCTGTAT CAGGGAAAAA GGTATGGCAG CTTCGCTACA AGATTGACGG TAAGGAGAAA
ATCCTGACCG TCGGAAAATA TCCGCTTATG ACTTTGCAAG AGGCAAGGGA TAAAGCATGG
ACTGCGAGGA AAGACATCTC GGTTGGCATC GATCCGGTAA AGGCGAAAAA GGCTTCGTCT
AACAACAATT CCTTTAGTGC GATTTACAAG GAATGGTACG AGCATAAGAG GCAAGTCTGG
TCAGCCGCCT ATGCGACTGA ACTTGCAAAA ATGTTTGATG ACGACATTTT ACCTATCATC
GGCGGCCTTG AAATTCAGGA TATTGAGCCG ATGCAACTGC TGGAAGTAAT CCGCAGATTT
GAAGATCGCG GGGCAATGGA GCGAGCCAAC AAAGCACGCA GAAGATGCGG CGAGGTTTTC
CGTTACGCTA TTGTCACCGG AAGGGCTAAA TATAACCCGG CACCTGACCT TGCTGAAGCC
ATGAAGGGAT ACCGCAAGAA GAACTTCCCG TTTTTACCTG CCGACCAGAT CCCGGCATTC
AACAAAGCAC TTGCAACATT TTCAGGAAGT ATCGTATCTC TCATTGCGAC CAAAGTTTTA
CGCTACACAG CACTAAGAAC GAAAGAGCTT CGTTCCATGC AATGGAAGAA CGTCGATTTT
GAAAACAGGA TTATCACCAT CGAGGCCAGT GTGATGAAGG GACGCAAGAT TCATGTGGTT
CCGATGTCGG ACCAGGTTGT TGAACTTCTC ACTACGCTAA GCTCCATCAC TAAACCAGTA
TCAGAGTTTG TTTTTGCCGG GCGCAACGAT AAGAAGAAGT CAATCTGTGA GAACGCTGTA
CTGCTTGTGA TCAAACAAAT CGGCTATGAA GGTCTGGAAA GCGGTCACGG ATTCAGGCAT
GAATTCAGCA CGATTATGAA CGAGCACGAA TGGCCTGCTG ACGCTATTGA AGTGCAACTA
GCACATGCCA ACGGCGGATC TGTGCGTGGG ATTTACAACC ATGCTCAGTA TCTCGATAAG
CGCAGAGAAA TGATGCAGTG GTGGGCGGAC TGGCTTGATG GGAAGGTGGA GTAG
 
Protein sequence
MDKIILPTGF LPMLTVKQIE AAKPKEKPYR LLDGNGLYLY VPVSGKKVWQ LRYKIDGKEK 
ILTVGKYPLM TLQEARDKAW TARKDISVGI DPVKAKKASS NNNSFSAIYK EWYEHKRQVW
SAAYATELAK MFDDDILPII GGLEIQDIEP MQLLEVIRRF EDRGAMERAN KARRRCGEVF
RYAIVTGRAK YNPAPDLAEA MKGYRKKNFP FLPADQIPAF NKALATFSGS IVSLIATKVL
RYTALRTKEL RSMQWKNVDF ENRIITIEAS VMKGRKIHVV PMSDQVVELL TTLSSITKPV
SEFVFAGRND KKKSICENAV LLVIKQIGYE GLESGHGFRH EFSTIMNEHE WPADAIEVQL
AHANGGSVRG IYNHAQYLDK RREMMQWWAD WLDGKVE