Gene ECH74115_0736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0736 
Symbol 
ID6970630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp762050 
End bp763477 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content47% 
IMG OID643384768 
ProductDnaJ domain protein 
Protein accessionYP_002269281 
Protein GI209396284 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.726449 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATT GCTGGAAGAT CCTCAACATA GAGGAAACGA CTGATGTCGA TATTATCCGC 
CGCGCTTATC TGGCGCTGTT ACCGTCCTTT CATCCAGAAA CCGATCCGCA GGGTTTTAAA
CAACTTCGTC AGGCGTATGA GGAAGCGCTC CGGATTGCGC AGTCGCCTGC TAAATCTGTT
TGGCAACCAG AAGAATATGA GGTAGCAGAA CATGAAATTC TGCTCGCCTT TCGTGCGTTA
CTTGCCTCTG ATAGTGAACG TTTTCTGCCC TCCGCCTGGC AGCGATTCAT TCAGCAATTA
AATTATTGCT CGATGAATGA GATTGATGAA TTACGCTGGT CGCTGTGCAC AATAGCCATG
AACACTGCCC ATTTATCCTT CGAGTGCGTG GTGTTATTAG CAGAAAGATT GCGGTGGTTG
CAGGAGGAAA ACGTCGGGGA AATAGACGAA GAAGAACTGG AATCCTTTTT ATATGCCATT
GCGAAGGGAA ATGTTTTTAA CTTCCAGATC ATTCTGCATC TGCCCGTTGC CGTGCAAAAT
GACACCATCG ATTTTTACCA AATGTTCGCC CGGATTTGGT CATCGCATCC AGAATGGCTG
ACATTGTATT TAGCGCAACA TCGCGCAGTG ATTATCCCCG ATGATGCAAA ACTGCACAGA
AATTTACTCC GCTGGTATAG TGCAGGTCGC CTGGGAATCC CCGAACTCCT GGATTACGCC
CGGTCGTGGC GGGAAGCTGA ACCTGATAAT GAAGATGCGC GTTATTATGA ATACGCGCAA
CGCGTCTATT GTGGAGAAGG CGAAAGCCTG CTGGCAGAAC TTTGTGACTA CTGGCACGAG
TATCCCTCCA CTCAGGCGGA TGCTTTAATG TTGCAATGGT GCCGTCAGCA TCGAGTCGAT
TATTACCCAT TAGTGGTGAT GATGATTGAA GCTCGTGAAC TGGTTAACGA CCAGGGAAAA
CCGCTACTTT ATGTCCCCGG CGACAGCGCC CGTACGCGTT TTCATTTATA CGAAATACTC
AGCGATGAAA AACTCTCTGC TCTGGGGCGT TCACTGGTCG AGATGGTTTT GCACAAAGGA
TGTAAGCCGC GGATCTCACT CACGCGTGAT ACAGAACATA CCTTATGGCC ATTATATCTA
GTTGCCAAAC AATTAGTGCA GGCCAGCCAA CCTACAGAAG AATCATTAAT GCCGATCGTA
AGCCGTCTTG ATGCAGAAAA TCGTTGTCCA CTGGAAGCAT TAATTATTCG TCGATTATTA
ATTCAGGCGG CGAATTTTAC CGAGAAGCAA ACTGTTGAAC CGGAGCCGCA ACCGCAGCCA
ATGCCCGTTG ACGATGGTGG GCCAGGCTGT CTGGGCATCA TTAAAATTAT TTTCTATATT
TTTATCTTTG CTGGTTTGAT AGGGAAAATA CTCCATCTGT TCGGGTGA
 
Protein sequence
MKNCWKILNI EETTDVDIIR RAYLALLPSF HPETDPQGFK QLRQAYEEAL RIAQSPAKSV 
WQPEEYEVAE HEILLAFRAL LASDSERFLP SAWQRFIQQL NYCSMNEIDE LRWSLCTIAM
NTAHLSFECV VLLAERLRWL QEENVGEIDE EELESFLYAI AKGNVFNFQI ILHLPVAVQN
DTIDFYQMFA RIWSSHPEWL TLYLAQHRAV IIPDDAKLHR NLLRWYSAGR LGIPELLDYA
RSWREAEPDN EDARYYEYAQ RVYCGEGESL LAELCDYWHE YPSTQADALM LQWCRQHRVD
YYPLVVMMIE ARELVNDQGK PLLYVPGDSA RTRFHLYEIL SDEKLSALGR SLVEMVLHKG
CKPRISLTRD TEHTLWPLYL VAKQLVQASQ PTEESLMPIV SRLDAENRCP LEALIIRRLL
IQAANFTEKQ TVEPEPQPQP MPVDDGGPGC LGIIKIIFYI FIFAGLIGKI LHLFG