Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1571 |
Symbol | |
ID | 6968162 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1535679 |
End bp | 1536908 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 643385536 |
Product | site-specific recombinase, phage integrase family |
Protein accession | YP_002270030 |
Protein GI | 209397964 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0359454 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.000000698249 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGCAGAG CACTTAACAA ACTGAGCGAT ACACAGCTGA GGAAAATCAA CGGCACACCC GCCCAAAAAA CAGCCTTTCT TAATGACGGT GGAAACCTGA GCGTCAGGCA TTCAACCAGT GGCCTTTTAA CCTGGTATTT CACTTACAGG GCCGGAACGG GAAGGGGGGC ACCACCGGAA CGCATTAAGC TGGGAAATTA TCCTGATCTG AGCCTGAAAT CAGCCAGGGA AAAAGCCGCC CAGTGTCGCG CATGGCTGGC AGAGGGGAAA AATCCACGTC ATGAGCTTAA TTACACCGTA CAGGAAGCGT TAAAGCCGGT AACGGTTGGC GATGCGCTCA CCTACTGGCT TGAGTCGTAC GCAAAGGAAA ACCGCGTGGA TTATGCCGCC CTGAAAAAGC GCCTTAATAA TCACGTAATA CAGCACATTG GTGCTATGCC GCTGGATAAA TGCGAGCTAC GGCACTGGCT GGCCTGTTTT GACCAGGTGG CAAAGCGAAC GCCTGTTACT GCCGGATTCT TGCTACAGAC GTGCAAACAG GCGCTTAAGT TCTGCCGGAG GCGGCGCTAT GCAATCAGCA ACGTTCTTGA TGATATGAGT GTGGCGGACG TTGGGAAAAA ACCGGATATA AGCGAGCGTG TCTTAAGCAC CAAAGAACTG GGCGAATTAT TGCAGGCACT GGACAAAAAA ATATTCTCCC CCTACTACAT CGCGTTAATC CGCCTCCTGA TTGTGTTCGG ATGCCGGACG GTAGAACTGA GGTTATCGGA GATCAGCGAG TGGGATTTTA CCGAAATGCT CTGGACCGTT CCGAAGGAGC ACAGCAAAAC GAAGGTAGCC ATATTCCGGC CTATACCGGA AGCAATACTG CCGTTCATCA CGCAGCTGGT GGAGCAGAAC AGGCACACGG GCTTATTGCT GGGGGAAGTG AAACAGGAGG CCAGCGTATC GCAGTATGGA AGATTAGCGC ACAGGAGGCT TAATCACCCT CACTGGTCAC TGCATGACAT CCGGCGCACC TTTACAACCA TGCTGAACGA TTTAGGCGTT GATCCGCATG TCGTGGAGCA GCTTACAGGC CACCAGATGC CAGGAATGCA GCGAGTTTAT AATCATTCCC GTTATCTGGA TGCGAAACGC AATGCGCTGG ATATGTGGAC GGAGCGGTTA GGGATACTGG CGGGAACACA TGAAAACGTA ACCACGCTAC CAGTAGCCAG AAGAAAATAA
|
Protein sequence | MSRALNKLSD TQLRKINGTP AQKTAFLNDG GNLSVRHSTS GLLTWYFTYR AGTGRGAPPE RIKLGNYPDL SLKSAREKAA QCRAWLAEGK NPRHELNYTV QEALKPVTVG DALTYWLESY AKENRVDYAA LKKRLNNHVI QHIGAMPLDK CELRHWLACF DQVAKRTPVT AGFLLQTCKQ ALKFCRRRRY AISNVLDDMS VADVGKKPDI SERVLSTKEL GELLQALDKK IFSPYYIALI RLLIVFGCRT VELRLSEISE WDFTEMLWTV PKEHSKTKVA IFRPIPEAIL PFITQLVEQN RHTGLLLGEV KQEASVSQYG RLAHRRLNHP HWSLHDIRRT FTTMLNDLGV DPHVVEQLTG HQMPGMQRVY NHSRYLDAKR NALDMWTERL GILAGTHENV TTLPVARRK
|
| |