Gene ECH74115_2016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2016 
SymbolhrpA 
ID6971329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1913481 
End bp1917383 
Gene Length3903 bp 
Protein Length1300 aa 
Translation table11 
GC content54% 
IMG OID643385933 
ProductATP-dependent RNA helicase HrpA 
Protein accessionYP_002270422 
Protein GI209400718 
COG category[L] Replication, recombination and repair 
COG ID[COG1643] HrpA-like helicases 
TIGRFAM ID[TIGR01967] ATP-dependent helicase HrpA 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.0120087 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAAC AACAAAAATT GACTTTTACG GCCTTGCAGC AGCGGCTGGA TTCGCTGATG 
CTGCGTGACA GACTGCGTTT TTCTCGCCGT CTGCACGGCG TGAAGAAGGT TAAAAATCCT
GATGCACAAC AGGCCATTTT CCAGGAGATG GCGAAAGAGA TTGACCAGGC GGCAGGGAAA
GTCCTGCTGC GTGAAGCGGC ACGACCGGAA ATTACTTATC CTGACAATTT GCCGGTTAGT
CAGAAAAAAC AGGACATTCT CGAAGCGATT CGTGATCACC AGGTGGTGAT AGTCGCCGGG
GAAACGGGTT CTGGTAAAAC GACTCAGTTA CCGAAAATCT GTATGGAGCT GGGGCGCGGG
ATTAAAGGGC TGATCGGCCA TACCCAGCCG CGTCGCCTGG CGGCAAGAAC GGTAGCGAAC
CGTATTGCGG AAGAGCTGAA AACGGAGCCG GGCGGTTGCA TCGGTTACAA AGTGCGTTTC
AGCGATCACG TAAGTGATAA CACGATGGTC AAGCTAATGA CCGACGGTAT CCTGCTGGCG
GAGATCCAGC AAGACCGCCT GCTGATGCAG TACGACACTA TCATTATTGA CGAAGCGCAC
GAACGTAGCC TGAATATCGA TTTTCTGCTC GGTTATTTGA AAGAGTTGCT GCCGCGGCGT
CCTGACCTAA AAATCATTAT CACTTCCGCG ACTATCGACC CGGAACGCTT TTCGCGCCAC
TTTAATAATG CGCCGATTAT TGAAGTCTCC GGTCGGACCT ATCCGGTGGA AGTGCGCTAT
CGCCCGATTG TTGAAGAAGC CGACGACACT GAGCGCGACC AGTTGCAGGC AATTTTCGAT
GCCGTGGACG AACTAAGCCA GGAAAGCCCC GGCGACATTC TGATCTTTAT GAGCGGCGAG
CGGGAAATTC GCGATACCGC CGATGCGCTG AACAAGCTGA ACTTACGCCA TACCGAAATC
TTGCCGCTTT ATGCGCGGCT TTCGAATAGC GAACAGAACC GCGTGTTCCA GTCGCACAGC
GGGCGACGCA TTGTGCTGGC GACCAACGTC GCGGAAACCT CGCTGACTGT ACCGGGGATT
AAATACGTTA TCGACCCCGG TACGGCGCGT ATCAGCCGCT ACAGCTATCG CACCAAAGTG
CAGCGTTTGC CGATTGAGCC CATTTCGCAG GCGTCTGCCA ATCAGCGTAA AGGCCGCTGT
GGTCGTGTGT CCGAAGGGAT CTGTATTCGT CTCTATTCCG AAGACGATTT CCTCTCGCGC
CCGGAGTTTA CCGATCCGGA GATTCTGCGT ACCAACCTGG CCTCGGTTAT TTTGCAGATG
ACCGCGCTGG GGCTAGGCGA TATCGCTGCG TTCCCGTTTG TCGAAGCACC GGATAAACGC
AATATCCAGG ATGGCGTGCG TCTGCTCGAA GAACTGGGCG CGATCACCAC GGATGAACAG
GCCAGCGCCT ATAAACTGAC GCCGCTCGGT CGCCAGCTCT CGCAGTTGCC TGTCGACCCA
CGTCTGGCGC GTATGGTGCT GGAAGCGCAA AAACATGGCT GCGTGCGTGA GGCGATGATT
ATCACGTCTG CGCTCTCCAT TCAGGATCCG CGCGAACGTC CGATGGACAA ACAGCAGGCA
TCGGACGAAA AACATCGTCG CTTCCACGAC AAAGAGTCTG ACTTCCTCGC GTTTGTGAAT
CTGTGGAATT ATCTTGGCGA GCAGCAAAAG GCGCTTTCTT CCAACGCCTT CCGTCGCCTG
TGTCGTACCG ATTATCTCAA CTATCTGCGT GTGCGCGAAT GGCAGGATAT CTACACCCAG
TTGCGCCAGG TGGTGAAAGA ACTCGGCATT CCGGTGAACA GCGAACCGGC GGAGTATCGT
GAAATTCACA TTGCGTTGCT GACGGGGTTG CTTTCCCATA TCGGCATGAA AGATGCCGAT
AAACAAGAAT ATACCGGCGC ACGTAACGCG CGTTTCTCCA TCTTCCCCGG TTCTGGTTTA
TTCAAAAAAC CGCCGAAATG GGTAATGGTG GCGGAACTGG TAGAAACCAG CCGCCTGTGG
GGGCGCATTG CTGCGCGTAT CGACCCGGAA TGGGTGGAGC CTGTCGCTCA GCATTTGATT
AAACGTACCT ACAGCGAACC GCACTGGGAA CGGGCGCAGG GCGCGGTGAT GGCAACGGAA
AAAGTCACTG TTTATGGTTT GCCGATTGTT GCCGCGCGCA AGGTCAACTA CAGCCAGATC
GATCCGGCGT TATGTCGTGA ACTCTTTATT CGCCACGCGC TGGTGGAAGG TGACTGGCAG
ACGCGTCACG CATTCTTCCG TGAAAACCTG AAACTGCGGG CCGAAGTGGA AGAGCTTGAA
CACAAATCAC GTCGCCGCGA TATTCTGGTT GATGACGAAA CGCTGTTTGA GTTCTACGAC
CAGCGCATCG GCCACGATGT AATCTCCGCT CGTCACTTCG ACAGCTGGTG GAAAAAAGTC
AGCCGCGAAA CGCCTGATTT GCTCAACTTT GAAAAGAGCA TGTTGATCAA AGAAGGGGCG
GAAAAGATCA GCAAGCTGGA TTACCCGAAC TTCTGGCATC AGGGCAATCT CAAGCTGCGT
TTGAGCTATC AGTTTGAGCC CGGCGCGGAT GCTGACGGTG TGACCGTACA TATTCCGCTG
CCGTTACTTA ACCAGGTTGA GGAAAGCGGG TTTGAATGGC AGATCCCTGG TCTGCGCCGC
GAACTGGTGA TTGCTCTGAT TAAATCGTTG CCGAAACCGG TACGCCGTAA TTTTGTACCT
GCGCCAAACT ATGCCGAAGC GTTTTTAGGC CGCGTCACAC CGCTGGAGTT ACCGTTGCTC
GACAGCCTTG AGCGCGAGTT ACGGCGGATG ACCGGCGTTA CCGTTGACCG CGAAGACTGG
CACTGGGATC AGGTGCCCGA TCACCTGAAA ATCACCTTCC GCGTGGTGGA TGACAAAAAC
AAGAAGCTAA AAGAAGGGCG CTCGCTACAA GATCTGAAAG ATGCGCTGAA AGGCAAAGTG
CAGGAAACGC TATCTGCGGT GGCGGATGAC GGTATCGAGC AGAGCGGCTT ACATATCTGG
AGTTTTGGTC AGTTGCCGGA AAGCTACGAA CAGAAGCGTG GTAACTACAA AGTGAAGGCG
TGGCCGGCGC TGGTGGATGA GCGCGACAGT GTGGCGATCA AACTGTTTGA TAACCCGCTG
GAGCAAAAAC AGGCAATGTG GAACGGTCTT CGCCGTCTAC TGCTGCTGAA TATTCCCTCA
CCAATCAAGT ATTTACATGA GAAGTTACCA AACAAAGCCA AGCTGGGGCT GTACTTTAAC
CCGTATGGCA AAGTGCTGGA GCTGATCGAC GACTGTATCT CATGCGGTGT GGATCAACTG
ATCGACGCCA ATGGAGGCCC GGTCTGGACG GAAGAAGGCT TTGCTGCGCT GCATGAAAAA
GTGCGTGCCG AACTGAACGA CACGGTGGTG GATATTGCGA AGCAGGTCGA GCAAATCCTT
ACGGCAGTGT TCAATATCAA CAAACGTCTG AAAGGGCGGG TGGATATGAC CATGGCGCTG
GGGCTTTCTG ACATTAAAGC GCAGATGGGC GGGTTGGTAT ATCGCGGTTT TGTCACTGGT
AACGGCTTCA AACGGCTGGG CGACACGCTG CGTTATTTGC AGGCGATTGA AAAACGGCTG
GAAAAACTGG CGGTTGATCC ACATCGCGAT CGTGCACAGA TGCTGAAAGT CGAAAACGTC
CAGCAGGCGT GGCAGCAATG GTTCAACAAA CTCCCGCCTG CACGTCGTGA GGATGAAGAC
GTGAAAGAGA TCCGTTGGAT GATAGAAGAG CTGCGCGTTA GTTACTTCGC TCAACAACTT
GGTACGCCTT ATCCGATTTC AGATAAGCGT ATTTTGCAGG CAATGGAGCA GATTAGCGGT
TAA
 
Protein sequence
MTEQQKLTFT ALQQRLDSLM LRDRLRFSRR LHGVKKVKNP DAQQAIFQEM AKEIDQAAGK 
VLLREAARPE ITYPDNLPVS QKKQDILEAI RDHQVVIVAG ETGSGKTTQL PKICMELGRG
IKGLIGHTQP RRLAARTVAN RIAEELKTEP GGCIGYKVRF SDHVSDNTMV KLMTDGILLA
EIQQDRLLMQ YDTIIIDEAH ERSLNIDFLL GYLKELLPRR PDLKIIITSA TIDPERFSRH
FNNAPIIEVS GRTYPVEVRY RPIVEEADDT ERDQLQAIFD AVDELSQESP GDILIFMSGE
REIRDTADAL NKLNLRHTEI LPLYARLSNS EQNRVFQSHS GRRIVLATNV AETSLTVPGI
KYVIDPGTAR ISRYSYRTKV QRLPIEPISQ ASANQRKGRC GRVSEGICIR LYSEDDFLSR
PEFTDPEILR TNLASVILQM TALGLGDIAA FPFVEAPDKR NIQDGVRLLE ELGAITTDEQ
ASAYKLTPLG RQLSQLPVDP RLARMVLEAQ KHGCVREAMI ITSALSIQDP RERPMDKQQA
SDEKHRRFHD KESDFLAFVN LWNYLGEQQK ALSSNAFRRL CRTDYLNYLR VREWQDIYTQ
LRQVVKELGI PVNSEPAEYR EIHIALLTGL LSHIGMKDAD KQEYTGARNA RFSIFPGSGL
FKKPPKWVMV AELVETSRLW GRIAARIDPE WVEPVAQHLI KRTYSEPHWE RAQGAVMATE
KVTVYGLPIV AARKVNYSQI DPALCRELFI RHALVEGDWQ TRHAFFRENL KLRAEVEELE
HKSRRRDILV DDETLFEFYD QRIGHDVISA RHFDSWWKKV SRETPDLLNF EKSMLIKEGA
EKISKLDYPN FWHQGNLKLR LSYQFEPGAD ADGVTVHIPL PLLNQVEESG FEWQIPGLRR
ELVIALIKSL PKPVRRNFVP APNYAEAFLG RVTPLELPLL DSLERELRRM TGVTVDREDW
HWDQVPDHLK ITFRVVDDKN KKLKEGRSLQ DLKDALKGKV QETLSAVADD GIEQSGLHIW
SFGQLPESYE QKRGNYKVKA WPALVDERDS VAIKLFDNPL EQKQAMWNGL RRLLLLNIPS
PIKYLHEKLP NKAKLGLYFN PYGKVLELID DCISCGVDQL IDANGGPVWT EEGFAALHEK
VRAELNDTVV DIAKQVEQIL TAVFNINKRL KGRVDMTMAL GLSDIKAQMG GLVYRGFVTG
NGFKRLGDTL RYLQAIEKRL EKLAVDPHRD RAQMLKVENV QQAWQQWFNK LPPARREDED
VKEIRWMIEE LRVSYFAQQL GTPYPISDKR ILQAMEQISG