Gene ECH74115_A0037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_A0037 
Symbol 
ID6966545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011351 
Strand
Start bp26460 
End bp29657 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content38% 
IMG OID643384067 
Productrelaxase 
Protein accessionYP_002268546 
Protein GI209395673 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.204416 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAATAC GCTATGGCGG TGGCAATCAT GGTATTGCAG AATACCTTGA AAATGGAAGA 
AAAGCGGAGC GTGAGTATAC CCGTGATGAA CTTGATCATA GATTTATCAT TGACGGCGAT
CTGGCTACAA CAAATAAAAT AATTGAATCA ATAGAAGATA AAGGGCAGGA GCGATACCTG
CATATCACAT TGTCATTTGC AGAAAATGAA ATTTCAGAAG AAACTCTTAA AAACGTGACT
TCTGATTTCA AGAAGATGTT CATGAATGCC TATCATACGG ATGAGTATTC ATTTTATGCA
GAAGCCCATC TGCCAAAAAT TAAAAATATT GTTGATAACA AAACAGGTGA ACTTGTGGAG
CGTAAACCAC ATATTCACAT TGTCATTCCA AGAACAAATC TTGTTACAGG TAGGTCATTA
AATCCACGAG GCGATCTGAC AAATCTTAAA ACTCAGACCC AACTTGATTC TGTTCAGGAA
TATCTTAACA ACAAATATAA CCTGATAAGC CCAAAGGATT CCGTTCGTGT AAGTGATGAA
AACTATGCAA ATGTGCTGTC ACGAGTAAAA GGCGATTTAT ACCGTGAACG GAACAATGAA
GTAAAAAAGG AAGTCTTCGC CAGAGTTACA AATGAAAATA TTAACTCGAC AGATGCATTC
AAAAATTTAT TAATGCAATA TGGTGATGTA AAGGTTCGCA ATGAGGGGAA AACTAATCAA
TACTTTGCTG TGCGTCTTAA AGATGACAAA AAATATACTA ATTTTAAAAA CCCATTATTT
AGTTCATCTT TCATAGAAAA ACGTGAACTA CCGTTAGTAA AACCGACCCC AGCACAGATT
AACAATAACG TTAGTACCTG GCTTGATAAA ACCAGTCACG AAATAAAACA TATTTATTCA
AAATCGAGTA AAACGCGTGA GCACTATAAG GCGCTTAGTG AGCCAGAAAA GAAAAGTTTT
CTAAAAAACA GGATTGATGA CTATGACAAA AAATACAAAC TTAACACAAA AAATTCTGAA
AGAACGCAAG GACGGTCTTG TGGTAACAAG TTCAGTATTG AATCAACTTC CAGATTCAGT
AGAACTAAAA CAAGAGTCGG CGTGCCACGT TTGCCCCAAC GCGGTTTGGT TTACGGAATC
CACGGGAGAG GAAAACCGCC CGAATCTGTC CGTATTTTGC AGGATAATGA ACAGTGTAAT
CTGGCAAACA GAATGGAAAA CAGATCGCAT TCTGATTCAG CAATGCGACG GAATGCTGAT
AGACGATTCT CAGAACGAGG AATAAAAAAT ATCGGTATAT CAACGCCATT ACATGAAGCA
TTATTTCAGA AGTTAAATAA TGATGCAGAA CGTAATGAGT TAACCACTAT GGCAGAAATC
AGAAAGAAAA TTGATCCTGA ACGTTTTCTT TCTGCGACTG CCCGTGAATT TAATATTGTA
CCTGGAGAAC ATGCAATCAG TAAGGCAAAG GATGGTTCTC CGCGTTTTGC AGTTGGCAAC
AGAAATCTTA ATGCGTCTGA TTTTTTGACT AAATATATTA ATTTATCATG GTCTGATGCT
AAAAATTTCT TGCTTAAAAC ATATGGGGAG CAATTGACTG AAAAAACCTT TGAGCCAGTA
GCAACAACAA GAAAACTAAG TTATGAAGAA TCCAGAGAAA GATATTCTTC CTTTAAAAAA
ACGGACATTT CTTTAAGAAG TATTATTCGT GCAGAAAAGA AAAACATGTA TAATGAATTG
CGCGAAATGC GGCAACAGAT TTATTCTTTA CCTAAAGAAA ATCGTGATAT AGCTAAAGGT
ATTCTGGTTT ATAAGAAAAT AACCACTCTT GAGCGACTGG ATGATATGTA TTCTGCGGGA
CGTAGTTTTA TCAACCAGTA TCATAAAGAC TGGAATGAGG ATAAAAATGC TATGAAGGCA
ATTGATAAAC TCAAAAATTA TCTCAATAAA GAAAATGAAA ACAGCATTTC TGGTGCTGAA
ATTGAGCTAT CACTGGAAAA AGCTGTTCAG GTACAAAAGC GATTGCAGGA ACTGCAACGA
ACAAATACCA GGTTGAAAGA TCTTGTAATG GATAAACAAG AAACCAAGAT TGTTTATCGT
GACCAAAAAA CAGAGTCCCC CGTATTTACT GATAAGGGTG ATTTTGTTGT TGCAGGAAAA
AATCCTTCTA AAGAAGAGAT CGGCATCATG CTGGAGTATT CGAAGGAGAA GTTTGGCGGT
GTGCTCAAAC TTACCGGAAG CGAAGAGTTT AAGAAAGAAT GTGCTCTTGT AGCCGCAGAA
CGGGATATGA ATATTATTCT GCGCCCTGAG AAATACCAAC AGATGATGCT GGAGCATAAA
GCGGAGCTTG AGTCTAAAGC AATGCAGCAT ACTGAACAGG AATCCCAGCA ACAGACTCAG
GTTGATAATG AAATCAGTAA GGCAGATGTT CATCAGAACC AGGCGCCAGC GGAAGAGGCT
AAACAAGAAA TTTATCTGGT CAGCCATACA AACGTACACG AACAGAACGA AGGGATCATT
TTCTATTCAA AAGAGGCTGC TTATAGCTAC TTTGAAGAAG ACAAGAAAAT GGCAATCGAG
AAATACGAAG CGAACCCACA TTTGGGAAAT GGATTTGAAG GTGCGTTGCT TGTTTCCAAG
ACCATAGCGG CCAGCGAGCT GTCCTCTTAT CCAGAAAAAC TGACAGCAGA ATTTGAGAAA
CCCTACGAAA TCCTGGCTGA TTCATACAAG GAATACACCC GTACACCTGT ATATGTTGTT
TCGTTCCCGA AAGATGAACT CAACCAGCCT GTTAAGAAGT TCGAATCGCT TTCTGAAGCT
ATTGAGTATA AAAATAATAC TGCAATGCTT CATGATCTGG ATAAAAGGGA AGTTATTGTT
AAATCTGTAA CCAGAGAAGA ATTAGGGATG ATGGGAGAGC AACTTGCCGT TAAAGAAGCG
GAACCTGTTC CACGACATGA ACTTGAAAAA GCACAAGGCC GTGATGTTAG TGAGCAGGAT
GGTTTGATAC TGGATGCCAT TGATCGTTAT CAATCTAAAT TTGAAGCCGA GGGATTGGCC
TTTAACCGTC AGGAGACTGA AGCCGAACTG TTACATCACG ATTTTACTCG CGAAGTTGCA
GAAGATCGTC TTGAACAACA GTTCGTACAG GAAAAAGCAG AACAGCAAAA TCAGCAGCAA
TCTGAACAGG AACGATAA
 
Protein sequence
MIIRYGGGNH GIAEYLENGR KAEREYTRDE LDHRFIIDGD LATTNKIIES IEDKGQERYL 
HITLSFAENE ISEETLKNVT SDFKKMFMNA YHTDEYSFYA EAHLPKIKNI VDNKTGELVE
RKPHIHIVIP RTNLVTGRSL NPRGDLTNLK TQTQLDSVQE YLNNKYNLIS PKDSVRVSDE
NYANVLSRVK GDLYRERNNE VKKEVFARVT NENINSTDAF KNLLMQYGDV KVRNEGKTNQ
YFAVRLKDDK KYTNFKNPLF SSSFIEKREL PLVKPTPAQI NNNVSTWLDK TSHEIKHIYS
KSSKTREHYK ALSEPEKKSF LKNRIDDYDK KYKLNTKNSE RTQGRSCGNK FSIESTSRFS
RTKTRVGVPR LPQRGLVYGI HGRGKPPESV RILQDNEQCN LANRMENRSH SDSAMRRNAD
RRFSERGIKN IGISTPLHEA LFQKLNNDAE RNELTTMAEI RKKIDPERFL SATAREFNIV
PGEHAISKAK DGSPRFAVGN RNLNASDFLT KYINLSWSDA KNFLLKTYGE QLTEKTFEPV
ATTRKLSYEE SRERYSSFKK TDISLRSIIR AEKKNMYNEL REMRQQIYSL PKENRDIAKG
ILVYKKITTL ERLDDMYSAG RSFINQYHKD WNEDKNAMKA IDKLKNYLNK ENENSISGAE
IELSLEKAVQ VQKRLQELQR TNTRLKDLVM DKQETKIVYR DQKTESPVFT DKGDFVVAGK
NPSKEEIGIM LEYSKEKFGG VLKLTGSEEF KKECALVAAE RDMNIILRPE KYQQMMLEHK
AELESKAMQH TEQESQQQTQ VDNEISKADV HQNQAPAEEA KQEIYLVSHT NVHEQNEGII
FYSKEAAYSY FEEDKKMAIE KYEANPHLGN GFEGALLVSK TIAASELSSY PEKLTAEFEK
PYEILADSYK EYTRTPVYVV SFPKDELNQP VKKFESLSEA IEYKNNTAML HDLDKREVIV
KSVTREELGM MGEQLAVKEA EPVPRHELEK AQGRDVSEQD GLILDAIDRY QSKFEAEGLA
FNRQETEAEL LHHDFTREVA EDRLEQQFVQ EKAEQQNQQQ SEQER