Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_A0037 |
Symbol | |
ID | 6966545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011351 |
Strand | - |
Start bp | 26460 |
End bp | 29657 |
Gene Length | 3198 bp |
Protein Length | 1065 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643384067 |
Product | relaxase |
Protein accession | YP_002268546 |
Protein GI | 209395673 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.204416 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATAATAC GCTATGGCGG TGGCAATCAT GGTATTGCAG AATACCTTGA AAATGGAAGA AAAGCGGAGC GTGAGTATAC CCGTGATGAA CTTGATCATA GATTTATCAT TGACGGCGAT CTGGCTACAA CAAATAAAAT AATTGAATCA ATAGAAGATA AAGGGCAGGA GCGATACCTG CATATCACAT TGTCATTTGC AGAAAATGAA ATTTCAGAAG AAACTCTTAA AAACGTGACT TCTGATTTCA AGAAGATGTT CATGAATGCC TATCATACGG ATGAGTATTC ATTTTATGCA GAAGCCCATC TGCCAAAAAT TAAAAATATT GTTGATAACA AAACAGGTGA ACTTGTGGAG CGTAAACCAC ATATTCACAT TGTCATTCCA AGAACAAATC TTGTTACAGG TAGGTCATTA AATCCACGAG GCGATCTGAC AAATCTTAAA ACTCAGACCC AACTTGATTC TGTTCAGGAA TATCTTAACA ACAAATATAA CCTGATAAGC CCAAAGGATT CCGTTCGTGT AAGTGATGAA AACTATGCAA ATGTGCTGTC ACGAGTAAAA GGCGATTTAT ACCGTGAACG GAACAATGAA GTAAAAAAGG AAGTCTTCGC CAGAGTTACA AATGAAAATA TTAACTCGAC AGATGCATTC AAAAATTTAT TAATGCAATA TGGTGATGTA AAGGTTCGCA ATGAGGGGAA AACTAATCAA TACTTTGCTG TGCGTCTTAA AGATGACAAA AAATATACTA ATTTTAAAAA CCCATTATTT AGTTCATCTT TCATAGAAAA ACGTGAACTA CCGTTAGTAA AACCGACCCC AGCACAGATT AACAATAACG TTAGTACCTG GCTTGATAAA ACCAGTCACG AAATAAAACA TATTTATTCA AAATCGAGTA AAACGCGTGA GCACTATAAG GCGCTTAGTG AGCCAGAAAA GAAAAGTTTT CTAAAAAACA GGATTGATGA CTATGACAAA AAATACAAAC TTAACACAAA AAATTCTGAA AGAACGCAAG GACGGTCTTG TGGTAACAAG TTCAGTATTG AATCAACTTC CAGATTCAGT AGAACTAAAA CAAGAGTCGG CGTGCCACGT TTGCCCCAAC GCGGTTTGGT TTACGGAATC CACGGGAGAG GAAAACCGCC CGAATCTGTC CGTATTTTGC AGGATAATGA ACAGTGTAAT CTGGCAAACA GAATGGAAAA CAGATCGCAT TCTGATTCAG CAATGCGACG GAATGCTGAT AGACGATTCT CAGAACGAGG AATAAAAAAT ATCGGTATAT CAACGCCATT ACATGAAGCA TTATTTCAGA AGTTAAATAA TGATGCAGAA CGTAATGAGT TAACCACTAT GGCAGAAATC AGAAAGAAAA TTGATCCTGA ACGTTTTCTT TCTGCGACTG CCCGTGAATT TAATATTGTA CCTGGAGAAC ATGCAATCAG TAAGGCAAAG GATGGTTCTC CGCGTTTTGC AGTTGGCAAC AGAAATCTTA ATGCGTCTGA TTTTTTGACT AAATATATTA ATTTATCATG GTCTGATGCT AAAAATTTCT TGCTTAAAAC ATATGGGGAG CAATTGACTG AAAAAACCTT TGAGCCAGTA GCAACAACAA GAAAACTAAG TTATGAAGAA TCCAGAGAAA GATATTCTTC CTTTAAAAAA ACGGACATTT CTTTAAGAAG TATTATTCGT GCAGAAAAGA AAAACATGTA TAATGAATTG CGCGAAATGC GGCAACAGAT TTATTCTTTA CCTAAAGAAA ATCGTGATAT AGCTAAAGGT ATTCTGGTTT ATAAGAAAAT AACCACTCTT GAGCGACTGG ATGATATGTA TTCTGCGGGA CGTAGTTTTA TCAACCAGTA TCATAAAGAC TGGAATGAGG ATAAAAATGC TATGAAGGCA ATTGATAAAC TCAAAAATTA TCTCAATAAA GAAAATGAAA ACAGCATTTC TGGTGCTGAA ATTGAGCTAT CACTGGAAAA AGCTGTTCAG GTACAAAAGC GATTGCAGGA ACTGCAACGA ACAAATACCA GGTTGAAAGA TCTTGTAATG GATAAACAAG AAACCAAGAT TGTTTATCGT GACCAAAAAA CAGAGTCCCC CGTATTTACT GATAAGGGTG ATTTTGTTGT TGCAGGAAAA AATCCTTCTA AAGAAGAGAT CGGCATCATG CTGGAGTATT CGAAGGAGAA GTTTGGCGGT GTGCTCAAAC TTACCGGAAG CGAAGAGTTT AAGAAAGAAT GTGCTCTTGT AGCCGCAGAA CGGGATATGA ATATTATTCT GCGCCCTGAG AAATACCAAC AGATGATGCT GGAGCATAAA GCGGAGCTTG AGTCTAAAGC AATGCAGCAT ACTGAACAGG AATCCCAGCA ACAGACTCAG GTTGATAATG AAATCAGTAA GGCAGATGTT CATCAGAACC AGGCGCCAGC GGAAGAGGCT AAACAAGAAA TTTATCTGGT CAGCCATACA AACGTACACG AACAGAACGA AGGGATCATT TTCTATTCAA AAGAGGCTGC TTATAGCTAC TTTGAAGAAG ACAAGAAAAT GGCAATCGAG AAATACGAAG CGAACCCACA TTTGGGAAAT GGATTTGAAG GTGCGTTGCT TGTTTCCAAG ACCATAGCGG CCAGCGAGCT GTCCTCTTAT CCAGAAAAAC TGACAGCAGA ATTTGAGAAA CCCTACGAAA TCCTGGCTGA TTCATACAAG GAATACACCC GTACACCTGT ATATGTTGTT TCGTTCCCGA AAGATGAACT CAACCAGCCT GTTAAGAAGT TCGAATCGCT TTCTGAAGCT ATTGAGTATA AAAATAATAC TGCAATGCTT CATGATCTGG ATAAAAGGGA AGTTATTGTT AAATCTGTAA CCAGAGAAGA ATTAGGGATG ATGGGAGAGC AACTTGCCGT TAAAGAAGCG GAACCTGTTC CACGACATGA ACTTGAAAAA GCACAAGGCC GTGATGTTAG TGAGCAGGAT GGTTTGATAC TGGATGCCAT TGATCGTTAT CAATCTAAAT TTGAAGCCGA GGGATTGGCC TTTAACCGTC AGGAGACTGA AGCCGAACTG TTACATCACG ATTTTACTCG CGAAGTTGCA GAAGATCGTC TTGAACAACA GTTCGTACAG GAAAAAGCAG AACAGCAAAA TCAGCAGCAA TCTGAACAGG AACGATAA
|
Protein sequence | MIIRYGGGNH GIAEYLENGR KAEREYTRDE LDHRFIIDGD LATTNKIIES IEDKGQERYL HITLSFAENE ISEETLKNVT SDFKKMFMNA YHTDEYSFYA EAHLPKIKNI VDNKTGELVE RKPHIHIVIP RTNLVTGRSL NPRGDLTNLK TQTQLDSVQE YLNNKYNLIS PKDSVRVSDE NYANVLSRVK GDLYRERNNE VKKEVFARVT NENINSTDAF KNLLMQYGDV KVRNEGKTNQ YFAVRLKDDK KYTNFKNPLF SSSFIEKREL PLVKPTPAQI NNNVSTWLDK TSHEIKHIYS KSSKTREHYK ALSEPEKKSF LKNRIDDYDK KYKLNTKNSE RTQGRSCGNK FSIESTSRFS RTKTRVGVPR LPQRGLVYGI HGRGKPPESV RILQDNEQCN LANRMENRSH SDSAMRRNAD RRFSERGIKN IGISTPLHEA LFQKLNNDAE RNELTTMAEI RKKIDPERFL SATAREFNIV PGEHAISKAK DGSPRFAVGN RNLNASDFLT KYINLSWSDA KNFLLKTYGE QLTEKTFEPV ATTRKLSYEE SRERYSSFKK TDISLRSIIR AEKKNMYNEL REMRQQIYSL PKENRDIAKG ILVYKKITTL ERLDDMYSAG RSFINQYHKD WNEDKNAMKA IDKLKNYLNK ENENSISGAE IELSLEKAVQ VQKRLQELQR TNTRLKDLVM DKQETKIVYR DQKTESPVFT DKGDFVVAGK NPSKEEIGIM LEYSKEKFGG VLKLTGSEEF KKECALVAAE RDMNIILRPE KYQQMMLEHK AELESKAMQH TEQESQQQTQ VDNEISKADV HQNQAPAEEA KQEIYLVSHT NVHEQNEGII FYSKEAAYSY FEEDKKMAIE KYEANPHLGN GFEGALLVSK TIAASELSSY PEKLTAEFEK PYEILADSYK EYTRTPVYVV SFPKDELNQP VKKFESLSEA IEYKNNTAML HDLDKREVIV KSVTREELGM MGEQLAVKEA EPVPRHELEK AQGRDVSEQD GLILDAIDRY QSKFEAEGLA FNRQETEAEL LHHDFTREVA EDRLEQQFVQ EKAEQQNQQQ SEQER
|
| |