Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2016 |
Symbol | hrpA |
ID | 6971329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1913481 |
End bp | 1917383 |
Gene Length | 3903 bp |
Protein Length | 1300 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643385933 |
Product | ATP-dependent RNA helicase HrpA |
Protein accession | YP_002270422 |
Protein GI | 209400718 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1643] HrpA-like helicases |
TIGRFAM ID | [TIGR01967] ATP-dependent helicase HrpA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0120087 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGAAC AACAAAAATT GACTTTTACG GCCTTGCAGC AGCGGCTGGA TTCGCTGATG CTGCGTGACA GACTGCGTTT TTCTCGCCGT CTGCACGGCG TGAAGAAGGT TAAAAATCCT GATGCACAAC AGGCCATTTT CCAGGAGATG GCGAAAGAGA TTGACCAGGC GGCAGGGAAA GTCCTGCTGC GTGAAGCGGC ACGACCGGAA ATTACTTATC CTGACAATTT GCCGGTTAGT CAGAAAAAAC AGGACATTCT CGAAGCGATT CGTGATCACC AGGTGGTGAT AGTCGCCGGG GAAACGGGTT CTGGTAAAAC GACTCAGTTA CCGAAAATCT GTATGGAGCT GGGGCGCGGG ATTAAAGGGC TGATCGGCCA TACCCAGCCG CGTCGCCTGG CGGCAAGAAC GGTAGCGAAC CGTATTGCGG AAGAGCTGAA AACGGAGCCG GGCGGTTGCA TCGGTTACAA AGTGCGTTTC AGCGATCACG TAAGTGATAA CACGATGGTC AAGCTAATGA CCGACGGTAT CCTGCTGGCG GAGATCCAGC AAGACCGCCT GCTGATGCAG TACGACACTA TCATTATTGA CGAAGCGCAC GAACGTAGCC TGAATATCGA TTTTCTGCTC GGTTATTTGA AAGAGTTGCT GCCGCGGCGT CCTGACCTAA AAATCATTAT CACTTCCGCG ACTATCGACC CGGAACGCTT TTCGCGCCAC TTTAATAATG CGCCGATTAT TGAAGTCTCC GGTCGGACCT ATCCGGTGGA AGTGCGCTAT CGCCCGATTG TTGAAGAAGC CGACGACACT GAGCGCGACC AGTTGCAGGC AATTTTCGAT GCCGTGGACG AACTAAGCCA GGAAAGCCCC GGCGACATTC TGATCTTTAT GAGCGGCGAG CGGGAAATTC GCGATACCGC CGATGCGCTG AACAAGCTGA ACTTACGCCA TACCGAAATC TTGCCGCTTT ATGCGCGGCT TTCGAATAGC GAACAGAACC GCGTGTTCCA GTCGCACAGC GGGCGACGCA TTGTGCTGGC GACCAACGTC GCGGAAACCT CGCTGACTGT ACCGGGGATT AAATACGTTA TCGACCCCGG TACGGCGCGT ATCAGCCGCT ACAGCTATCG CACCAAAGTG CAGCGTTTGC CGATTGAGCC CATTTCGCAG GCGTCTGCCA ATCAGCGTAA AGGCCGCTGT GGTCGTGTGT CCGAAGGGAT CTGTATTCGT CTCTATTCCG AAGACGATTT CCTCTCGCGC CCGGAGTTTA CCGATCCGGA GATTCTGCGT ACCAACCTGG CCTCGGTTAT TTTGCAGATG ACCGCGCTGG GGCTAGGCGA TATCGCTGCG TTCCCGTTTG TCGAAGCACC GGATAAACGC AATATCCAGG ATGGCGTGCG TCTGCTCGAA GAACTGGGCG CGATCACCAC GGATGAACAG GCCAGCGCCT ATAAACTGAC GCCGCTCGGT CGCCAGCTCT CGCAGTTGCC TGTCGACCCA CGTCTGGCGC GTATGGTGCT GGAAGCGCAA AAACATGGCT GCGTGCGTGA GGCGATGATT ATCACGTCTG CGCTCTCCAT TCAGGATCCG CGCGAACGTC CGATGGACAA ACAGCAGGCA TCGGACGAAA AACATCGTCG CTTCCACGAC AAAGAGTCTG ACTTCCTCGC GTTTGTGAAT CTGTGGAATT ATCTTGGCGA GCAGCAAAAG GCGCTTTCTT CCAACGCCTT CCGTCGCCTG TGTCGTACCG ATTATCTCAA CTATCTGCGT GTGCGCGAAT GGCAGGATAT CTACACCCAG TTGCGCCAGG TGGTGAAAGA ACTCGGCATT CCGGTGAACA GCGAACCGGC GGAGTATCGT GAAATTCACA TTGCGTTGCT GACGGGGTTG CTTTCCCATA TCGGCATGAA AGATGCCGAT AAACAAGAAT ATACCGGCGC ACGTAACGCG CGTTTCTCCA TCTTCCCCGG TTCTGGTTTA TTCAAAAAAC CGCCGAAATG GGTAATGGTG GCGGAACTGG TAGAAACCAG CCGCCTGTGG GGGCGCATTG CTGCGCGTAT CGACCCGGAA TGGGTGGAGC CTGTCGCTCA GCATTTGATT AAACGTACCT ACAGCGAACC GCACTGGGAA CGGGCGCAGG GCGCGGTGAT GGCAACGGAA AAAGTCACTG TTTATGGTTT GCCGATTGTT GCCGCGCGCA AGGTCAACTA CAGCCAGATC GATCCGGCGT TATGTCGTGA ACTCTTTATT CGCCACGCGC TGGTGGAAGG TGACTGGCAG ACGCGTCACG CATTCTTCCG TGAAAACCTG AAACTGCGGG CCGAAGTGGA AGAGCTTGAA CACAAATCAC GTCGCCGCGA TATTCTGGTT GATGACGAAA CGCTGTTTGA GTTCTACGAC CAGCGCATCG GCCACGATGT AATCTCCGCT CGTCACTTCG ACAGCTGGTG GAAAAAAGTC AGCCGCGAAA CGCCTGATTT GCTCAACTTT GAAAAGAGCA TGTTGATCAA AGAAGGGGCG GAAAAGATCA GCAAGCTGGA TTACCCGAAC TTCTGGCATC AGGGCAATCT CAAGCTGCGT TTGAGCTATC AGTTTGAGCC CGGCGCGGAT GCTGACGGTG TGACCGTACA TATTCCGCTG CCGTTACTTA ACCAGGTTGA GGAAAGCGGG TTTGAATGGC AGATCCCTGG TCTGCGCCGC GAACTGGTGA TTGCTCTGAT TAAATCGTTG CCGAAACCGG TACGCCGTAA TTTTGTACCT GCGCCAAACT ATGCCGAAGC GTTTTTAGGC CGCGTCACAC CGCTGGAGTT ACCGTTGCTC GACAGCCTTG AGCGCGAGTT ACGGCGGATG ACCGGCGTTA CCGTTGACCG CGAAGACTGG CACTGGGATC AGGTGCCCGA TCACCTGAAA ATCACCTTCC GCGTGGTGGA TGACAAAAAC AAGAAGCTAA AAGAAGGGCG CTCGCTACAA GATCTGAAAG ATGCGCTGAA AGGCAAAGTG CAGGAAACGC TATCTGCGGT GGCGGATGAC GGTATCGAGC AGAGCGGCTT ACATATCTGG AGTTTTGGTC AGTTGCCGGA AAGCTACGAA CAGAAGCGTG GTAACTACAA AGTGAAGGCG TGGCCGGCGC TGGTGGATGA GCGCGACAGT GTGGCGATCA AACTGTTTGA TAACCCGCTG GAGCAAAAAC AGGCAATGTG GAACGGTCTT CGCCGTCTAC TGCTGCTGAA TATTCCCTCA CCAATCAAGT ATTTACATGA GAAGTTACCA AACAAAGCCA AGCTGGGGCT GTACTTTAAC CCGTATGGCA AAGTGCTGGA GCTGATCGAC GACTGTATCT CATGCGGTGT GGATCAACTG ATCGACGCCA ATGGAGGCCC GGTCTGGACG GAAGAAGGCT TTGCTGCGCT GCATGAAAAA GTGCGTGCCG AACTGAACGA CACGGTGGTG GATATTGCGA AGCAGGTCGA GCAAATCCTT ACGGCAGTGT TCAATATCAA CAAACGTCTG AAAGGGCGGG TGGATATGAC CATGGCGCTG GGGCTTTCTG ACATTAAAGC GCAGATGGGC GGGTTGGTAT ATCGCGGTTT TGTCACTGGT AACGGCTTCA AACGGCTGGG CGACACGCTG CGTTATTTGC AGGCGATTGA AAAACGGCTG GAAAAACTGG CGGTTGATCC ACATCGCGAT CGTGCACAGA TGCTGAAAGT CGAAAACGTC CAGCAGGCGT GGCAGCAATG GTTCAACAAA CTCCCGCCTG CACGTCGTGA GGATGAAGAC GTGAAAGAGA TCCGTTGGAT GATAGAAGAG CTGCGCGTTA GTTACTTCGC TCAACAACTT GGTACGCCTT ATCCGATTTC AGATAAGCGT ATTTTGCAGG CAATGGAGCA GATTAGCGGT TAA
|
Protein sequence | MTEQQKLTFT ALQQRLDSLM LRDRLRFSRR LHGVKKVKNP DAQQAIFQEM AKEIDQAAGK VLLREAARPE ITYPDNLPVS QKKQDILEAI RDHQVVIVAG ETGSGKTTQL PKICMELGRG IKGLIGHTQP RRLAARTVAN RIAEELKTEP GGCIGYKVRF SDHVSDNTMV KLMTDGILLA EIQQDRLLMQ YDTIIIDEAH ERSLNIDFLL GYLKELLPRR PDLKIIITSA TIDPERFSRH FNNAPIIEVS GRTYPVEVRY RPIVEEADDT ERDQLQAIFD AVDELSQESP GDILIFMSGE REIRDTADAL NKLNLRHTEI LPLYARLSNS EQNRVFQSHS GRRIVLATNV AETSLTVPGI KYVIDPGTAR ISRYSYRTKV QRLPIEPISQ ASANQRKGRC GRVSEGICIR LYSEDDFLSR PEFTDPEILR TNLASVILQM TALGLGDIAA FPFVEAPDKR NIQDGVRLLE ELGAITTDEQ ASAYKLTPLG RQLSQLPVDP RLARMVLEAQ KHGCVREAMI ITSALSIQDP RERPMDKQQA SDEKHRRFHD KESDFLAFVN LWNYLGEQQK ALSSNAFRRL CRTDYLNYLR VREWQDIYTQ LRQVVKELGI PVNSEPAEYR EIHIALLTGL LSHIGMKDAD KQEYTGARNA RFSIFPGSGL FKKPPKWVMV AELVETSRLW GRIAARIDPE WVEPVAQHLI KRTYSEPHWE RAQGAVMATE KVTVYGLPIV AARKVNYSQI DPALCRELFI RHALVEGDWQ TRHAFFRENL KLRAEVEELE HKSRRRDILV DDETLFEFYD QRIGHDVISA RHFDSWWKKV SRETPDLLNF EKSMLIKEGA EKISKLDYPN FWHQGNLKLR LSYQFEPGAD ADGVTVHIPL PLLNQVEESG FEWQIPGLRR ELVIALIKSL PKPVRRNFVP APNYAEAFLG RVTPLELPLL DSLERELRRM TGVTVDREDW HWDQVPDHLK ITFRVVDDKN KKLKEGRSLQ DLKDALKGKV QETLSAVADD GIEQSGLHIW SFGQLPESYE QKRGNYKVKA WPALVDERDS VAIKLFDNPL EQKQAMWNGL RRLLLLNIPS PIKYLHEKLP NKAKLGLYFN PYGKVLELID DCISCGVDQL IDANGGPVWT EEGFAALHEK VRAELNDTVV DIAKQVEQIL TAVFNINKRL KGRVDMTMAL GLSDIKAQMG GLVYRGFVTG NGFKRLGDTL RYLQAIEKRL EKLAVDPHRD RAQMLKVENV QQAWQQWFNK LPPARREDED VKEIRWMIEE LRVSYFAQQL GTPYPISDKR ILQAMEQISG
|
| |