Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3582 |
Symbol | |
ID | 6972415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3297224 |
End bp | 3298417 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 643387380 |
Product | integrase |
Protein accession | YP_002271839 |
Protein GI | 209397288 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0582] Integrase |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0000000000010448 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGATAAAA TAATTTTACC CACCGGATTT TTACCCATGC TCACCGTTAA GCAGATTGAA GCAGCAAAGC CGAAAGAAAA ACCATACCGC CTACTCGATG GTAATGGCCT GTACCTTTAT GTCCCTGTAT CAGGGAAAAA GGTATGGCAG CTTCGCTACA AGATTGACGG TAAGGAGAAA ATCCTGACCG TCGGAAAATA TCCGCTTATG ACTTTGCAAG AGGCAAGGGA TAAAGCATGG ACTGCGAGGA AAGACATCTC GGTTGGCATC GATCCGGTAA AGGCGAAAAA GGCTTCGTCT AACAACAATT CCTTTAGTGC GATTTACAAG GAATGGTACG AGCATAAGAG GCAAGTCTGG TCAGCCGCCT ATGCGACTGA ACTTGCAAAA ATGTTTGATG ACGACATTTT ACCTATCATC GGCGGCCTTG AAATTCAGGA TATTGAGCCG ATGCAACTGC TGGAAGTAAT CCGCAGATTT GAAGATCGCG GGGCAATGGA GCGAGCCAAC AAAGCACGCA GAAGATGCGG CGAGGTTTTC CGTTACGCTA TTGTCACCGG AAGGGCTAAA TATAACCCGG CACCTGACCT TGCTGAAGCC ATGAAGGGAT ACCGCAAGAA GAACTTCCCG TTTTTACCTG CCGACCAGAT CCCGGCATTC AACAAAGCAC TTGCAACATT TTCAGGAAGT ATCGTATCTC TCATTGCGAC CAAAGTTTTA CGCTACACAG CACTAAGAAC GAAAGAGCTT CGTTCCATGC AATGGAAGAA CGTCGATTTT GAAAACAGGA TTATCACCAT CGAGGCCAGT GTGATGAAGG GACGCAAGAT TCATGTGGTT CCGATGTCGG ACCAGGTTGT TGAACTTCTC ACTACGCTAA GCTCCATCAC TAAACCAGTA TCAGAGTTTG TTTTTGCCGG GCGCAACGAT AAGAAGAAGT CAATCTGTGA GAACGCTGTA CTGCTTGTGA TCAAACAAAT CGGCTATGAA GGTCTGGAAA GCGGTCACGG ATTCAGGCAT GAATTCAGCA CGATTATGAA CGAGCACGAA TGGCCTGCTG ACGCTATTGA AGTGCAACTA GCACATGCCA ACGGCGGATC TGTGCGTGGG ATTTACAACC ATGCTCAGTA TCTCGATAAG CGCAGAGAAA TGATGCAGTG GTGGGCGGAC TGGCTTGATG GGAAGGTGGA GTAG
|
Protein sequence | MDKIILPTGF LPMLTVKQIE AAKPKEKPYR LLDGNGLYLY VPVSGKKVWQ LRYKIDGKEK ILTVGKYPLM TLQEARDKAW TARKDISVGI DPVKAKKASS NNNSFSAIYK EWYEHKRQVW SAAYATELAK MFDDDILPII GGLEIQDIEP MQLLEVIRRF EDRGAMERAN KARRRCGEVF RYAIVTGRAK YNPAPDLAEA MKGYRKKNFP FLPADQIPAF NKALATFSGS IVSLIATKVL RYTALRTKEL RSMQWKNVDF ENRIITIEAS VMKGRKIHVV PMSDQVVELL TTLSSITKPV SEFVFAGRND KKKSICENAV LLVIKQIGYE GLESGHGFRH EFSTIMNEHE WPADAIEVQL AHANGGSVRG IYNHAQYLDK RREMMQWWAD WLDGKVE
|
| |