Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1298 |
Symbol | |
ID | 6968240 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1312903 |
End bp | 1314105 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643385286 |
Product | integrase |
Protein accession | YP_002269781 |
Protein GI | 209398563 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0582] Integrase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.64566 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.14409 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGTAT TGACGGATAC GAAAGCAAGA CATATCAAAC CTGATGACAA ACCATTGCCC CATGGGGGAA TTACCGGACT GACCCTTCAT CCTTCTTCAG TAAAGGGGCG GGGGAAATGG GTTTTTCGTT ATGTAAGTCC GGTGACACAA AAAAGACGTA ATGCTGGATT GGGAACTTAC CCAGAGGTCA GTATTGCTGA AGCTGCACGT ACTGCCCGGA TAATGCGAGA GCAACTTGCT GCAGGTGATG ATCCTCTGGA GATTAAAAAG GCTGAATCTG AGAAAGTCGC TATCCCAACA TTTGCCGATG CAGCCAGGCG TGTACATGCA GAACTGTCTC CTGGATGGGA AAATCCAAAG CATGTAAGGC AGTGGTTATC GACGCTTGAG AATTACGCGT TTCCTCAACT GGGAGCAAAA ACGCTGGATT CGATTACGGC TGCGGACGTG GCAGAAACAC TGCGTCCAGT CTGGTTAACC TTGTCAGAAA CGGCAAGCCG GGTTAAACAG CGCATTCATG TTGTTATGCA GTGGGGATGG GCGCACGGTT TTTGTGTAGC AAATCCTGTT GATGTGGTTG ACCATTTGCT TCCTCAGCAG ACAAGAGGAC GTGATGAACA CCAACCCGCA ATGCCCTGGA GGCAGTTACC GCTTTTTGTG GCGACCAGTG TGTATACCGA TGAACCTTAT AATGTTACCC GCGCACTGTT ATTAATGGTG ATACTTACAG CAACTCGCTC GGGCGAAGCC AGGGGAATGC GCTGGGCTGA AATTGATTTT CATAAGCGGG TATGGACTAT ACCTGCAGAA AGAATGAAAG CCAGGATACA GCATCGTGTT CCTTTATCCC GGCAGGCTAT TTACATTCTG GAAAATATAC GTGGCCTGCA TGATGAACTG GTGTTCCCTT CACCCAGAAA GCAGCAGATC CTTTCCGATA TGGTGTTGAC AAGTTTTCTG CGTAAAAAGA AAGCCGTCAG TGACATTCCG GGGCGAGTTG CCACGGCACA TGGTTTTCGC TCAACATTCA GGGACTGGTG TAGCGAACAG GGGTATTCGC GGGATCTGGC GGAAAGGGCG CTCGCTCATA CGCTGAAAAA TAAGGTTGAG GCGGCATATC ATCGTACTGA TCTACTGGAG CAGCGTGTAC CGATGATGCA GGCATGGGCG GATTATGTGA TGTCTCAAAT TGTGAATAAA TAA
|
Protein sequence | MAVLTDTKAR HIKPDDKPLP HGGITGLTLH PSSVKGRGKW VFRYVSPVTQ KRRNAGLGTY PEVSIAEAAR TARIMREQLA AGDDPLEIKK AESEKVAIPT FADAARRVHA ELSPGWENPK HVRQWLSTLE NYAFPQLGAK TLDSITAADV AETLRPVWLT LSETASRVKQ RIHVVMQWGW AHGFCVANPV DVVDHLLPQQ TRGRDEHQPA MPWRQLPLFV ATSVYTDEPY NVTRALLLMV ILTATRSGEA RGMRWAEIDF HKRVWTIPAE RMKARIQHRV PLSRQAIYIL ENIRGLHDEL VFPSPRKQQI LSDMVLTSFL RKKKAVSDIP GRVATAHGFR STFRDWCSEQ GYSRDLAERA LAHTLKNKVE AAYHRTDLLE QRVPMMQAWA DYVMSQIVNK
|
| |