Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5789 |
Symbol | |
ID | 6972350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5422570 |
End bp | 5424240 |
Gene Length | 1671 bp |
Protein Length | 556 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643389419 |
Product | site-specific recombinase, phage integrase family |
Protein accession | YP_002273811 |
Protein GI | 209400288 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.171322 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 58 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTCGC TTTCTAACTT TGCTCAGTCA ATCGAGTTGC CTATATCGCA ATTGATTCGA GAGGTGGTTA ATCGTAACCT CCCGGTATTC TGGCTGGCGA CTGGTCAGTT CGGTTTCTAT GTTGATGAAT TTAATGCAGT AGAGCGGGAA CCGGGTGCAA AACGAGAAAA ACAGTCTGAT GATGAAAAGG ATCAACCTAA AGAAGTCATC ATTCTCAATA GCGCGTTTGA GCTGGGTATC GAGAGCTTCG CAAATGGTTA TCTCCGCCCC TTCAATCCCC GGCATACTTT AGATTGTCTG TTGAGCGCTG GAGTATCCGA AGGAGAGGCT GCATTTCGAA CTAGTGGTGA TAACCAAAGT GGAGGTTGGT TCTTCGATTT ACCCGGCGTA GATATAACTG CTGATAGCCT GTTGATTAGC AAAGTTCATG CTGAAGGCCT TCGACTTACA TGGCTGGTTA AGACCACGCC ACCAGCAGTT AGCATTCACC CTGCCGTGCC TCTTGTCGCT CCTGTTATCG CTAATGAATA TGTTCACCGC AAACATTACA ATGAAAACTT GTCATGGCTT CGTGAAGAGT ATTTGAAACA TCGACGTAAG GGCAAGGTAT CAGAAGCGGC GCTCCGCGAT ATTCGCTATT ACTTCGATTT GATGATTGAA GTGATGGGGG ATATTCAGTT GGAAGATTTC GACCGTGATT TCCTCCGGGC TTATGAGAGC AAGTTGCGCA CAATTCCTGC TAACCGTAAT TTGATGAAAG GTAAGCACGG GGTTAAGACG CTGGATGAGT TAATCGCCAA AGCGGCAGAA TGTGGCGATA AACTGATGAC AGAAGAGTCT GTCAAAAAGT ATATCAACGG CCTTTATGGT GCAATGGAGT GGGCTGTTGA TGATGGTAAG TTTCTGAAAT CGCCATGCGA CAACTTTTTC CCTCCCGATG ACAAAGGTGA GCGAGAGCAG GATCACACTG ACATATTTGA ACCGCATGAA ATTAAGGCAA TTTTTTCGCA ACCGTGGTTT GTCGCTGGAA CTGTTGAACG TAATGCGCAA GGGCGATTCC ATCAATATTG CCCGTTTCAC TATTGGGCGC CGTTGTTGGG CTTGATGACG GGGGCAAGGG TTAACGAGAT TGCACAGTTA ATGCTGGACG ATGTTCTGGC AGATGACGGC GTTTATTACC TGAACCTTGA AAGCGATAGC GAAAACGGAA AGAAACTAAA AAACGCCAAT TCCCGCCGCA AGATTCCGGT TCATTCTACG CTGATTGAAC TCGGTTTTAT CGAGTATGTG GATGCGTTGA AAGCTGCCGG GTATGACCGT CTTTTTCCCG AGCTTAAACC ACATAAAACC AAAGGCTATG GTAGGCCGGT TTCCGCATGG TTCAATGAAT CATTGCTTGC GGGTCGATTA AAACTTGAAA GAGACAGAAG CAAATCTTTC CACTCTTTCC GGCATTCTGT TTCAACTTTG CTTAAAGAGA AGGGTGTTAG TTCGGAACTG CGTGGGCAGC TACTTGGGCA TGTGCGAGGC AAAACAGAAA CTGAAGTGCG ATACAGCAAA GATTTAAAAC CGGTTCACAT GGTTGAGGTT GTCGAAAAGA TTGATTTTTC TTTGCCCGAG ATAGCGAGAT TCAACATTCC TGATGGGCTG GATGCTGTAG AATTGATCTG A
|
Protein sequence | MISLSNFAQS IELPISQLIR EVVNRNLPVF WLATGQFGFY VDEFNAVERE PGAKREKQSD DEKDQPKEVI ILNSAFELGI ESFANGYLRP FNPRHTLDCL LSAGVSEGEA AFRTSGDNQS GGWFFDLPGV DITADSLLIS KVHAEGLRLT WLVKTTPPAV SIHPAVPLVA PVIANEYVHR KHYNENLSWL REEYLKHRRK GKVSEAALRD IRYYFDLMIE VMGDIQLEDF DRDFLRAYES KLRTIPANRN LMKGKHGVKT LDELIAKAAE CGDKLMTEES VKKYINGLYG AMEWAVDDGK FLKSPCDNFF PPDDKGEREQ DHTDIFEPHE IKAIFSQPWF VAGTVERNAQ GRFHQYCPFH YWAPLLGLMT GARVNEIAQL MLDDVLADDG VYYLNLESDS ENGKKLKNAN SRRKIPVHST LIELGFIEYV DALKAAGYDR LFPELKPHKT KGYGRPVSAW FNESLLAGRL KLERDRSKSF HSFRHSVSTL LKEKGVSSEL RGQLLGHVRG KTETEVRYSK DLKPVHMVEV VEKIDFSLPE IARFNIPDGL DAVELI
|
| |