Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2649 |
Symbol | |
ID | 6972236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2499422 |
End bp | 2500423 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 643386512 |
Product | integrase family protein |
Protein accession | YP_002270994 |
Protein GI | 209395701 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.413706 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.000000656425 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCGGTTC GTAAGATTCC ATCAGGTAAA TGGCTTTGTG AATGTTATCC CTACGGGGCA TCGGGAAAAC GCATTCGTAA ACAGTTTGCG ACAAAAAGTG AGGCGCTCTC TTATGAGCGC CGTTTAATGA ATAGTAGAGT TGGAGACGAG TTTCAAGATG GTTCTGGTCC TCGTCTTTCT GAGTTGATTG CTCGTTGGTT TGAGATGTAC GGTAAAACCT TGTCCTCTGG TGCAGAGCGC AAAGTCAAAC TTGAGGCGAT TTGTTCCAGG CTGGGAGATC CATTTGCTTC TCAGTTTGAC AAAAATATGT TTGCTACTTA TCGGGAAAGA AGGCTATCAG GAGAATGGAA TCCCAAGGGG AAGAAAAAAC TTAGTGAAGC AACCGTTAAT CGCGAGCAGT CATATCTACA TGCTGTTTTT GCCGAACTGA AGCGCCTTGG GGAGTGGTCT GGTGAAAACC CCCTGACTGG TATTCGCAAG TTTCGTGAGG AAGAAAAGGA ACTGGCGTTT CTGTATGTAG ATGAGATTGA ACGCCTTCTG ATTGCGTGTG ATGAGTCACG GAATAAAGAT TTGGGGGTTG TTGTCCGTAT TGGGCTTGCG ACTGGTGCTC GGTGGAGTGA AGCAGAAGGA TTAAAGCAAT CTCAAGTACT GCCCGGTCGA ATCACATTTG TTAAAACTAA AGGAAAGAAG AACCGCACTG TACCGATTTC ACCTCAATTG CAAGCTATGC TTCCTAAAAA ACGAGGAGCG CTATTTTCAC CATGTTATGA GGCTTTTGAC GCTGCAATTA AGAGAGCGAA GATCGAGCTT CCTGATGGGC AATTAACTCA TGTGCTACGT CACACGTTTG CCAGTCATTT TATGATGCGG GGTGGAAATA TTCTTGTGTT GCAAAAAATA CTGGGGCATA GCGATATAAA AATGACTATG CGTTATGCGC ATTTTGCTCC AGGTCATTTA GAGGCTGCTG TTGAATTGAA CCCTTTTGAC AATAGAGGGT AA
|
Protein sequence | MSVRKIPSGK WLCECYPYGA SGKRIRKQFA TKSEALSYER RLMNSRVGDE FQDGSGPRLS ELIARWFEMY GKTLSSGAER KVKLEAICSR LGDPFASQFD KNMFATYRER RLSGEWNPKG KKKLSEATVN REQSYLHAVF AELKRLGEWS GENPLTGIRK FREEEKELAF LYVDEIERLL IACDESRNKD LGVVVRIGLA TGARWSEAEG LKQSQVLPGR ITFVKTKGKK NRTVPISPQL QAMLPKKRGA LFSPCYEAFD AAIKRAKIEL PDGQLTHVLR HTFASHFMMR GGNILVLQKI LGHSDIKMTM RYAHFAPGHL EAAVELNPFD NRG
|
| |