Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1759 |
Symbol | hrpA |
ID | 6145514 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1763965 |
End bp | 1767867 |
Gene Length | 3903 bp |
Protein Length | 1300 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616635 |
Product | ATP-dependent RNA helicase HrpA |
Protein accession | YP_001743813 |
Protein GI | 170684173 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1643] HrpA-like helicases |
TIGRFAM ID | [TIGR01967] ATP-dependent helicase HrpA |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.00000010565 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACAGAAC AACAAAAATT GACCTTTACG GCCTTGCAGC AACGGTTGGA TTCGCTGATG CTGCGTGACA GACTGCGTTT TTCCCGCCGT CTGCACGGCG TGAAGAAGGT TAAAAATCCT GATGCACAAC AGGCCATTTT CCAGGAGATG GCGAAAGAGA TTGACCAGGC GGCAGGGAAA GTCCTGCTGC GTGAAGCAGT ACGACCGGAA ATCACTTATC CAGACAATTT GCCGGTTAGT CAGAAAAAGC AGGACATTCT CGAAGCAATC CGTGATCACC AGGTGGTGAT CGTCGCCGGG GAAACGGGTT CCGGTAAAAC GACTCAGCTA CCGAAAATCT GTATGGAACT GGGCCGCGGG ATTAAAGGGC TGATCGGCCA TACCCAGCCG CGTCGTCTGG CGGCACGAAC GGTGGCGAAC CGTATTGCGG AAGAGCTGAA AACGGAGCCG GGCGGTTGCA TCGGGTATAA AGTGCGTTTC AGCGATCACG TAAGTGATAA CACGATGGTC AAGCTGATGA CCGACGGTAT CCTGCTGGCG GAGATCCAGC AAGACCGCCT GCTGATGCAG TACGACACTA TCATTATTGA CGAAGCGCAC GAACGCAGCC TGAATATCGA TTTTCTGCTC GGCTATTTGA AAGAGTTGCT GCCGCGGCGT CCTGACTTAA AAATCATTAT CACTTCCGCG ACTATCGACC CGGAACGCTT TTCGCGCCAC TTTAATAATG CGCCGATTAT CGAAGTCTCC GGTCGGACCT ATCCGGTGGA AGTGCGCTAT CGCCCGATCG TTGAAGAAGC TGATGACACC GAGCGCGATC AGTTGCAGGC GATTTTTGAC GCCGTAGACG AACTGAGCCA GGAAAGCCCT GGCGACATTC TGATCTTTAT GAGCGGTGAG CGGGAAATTC GCGATACCGC CGATGCGCTG AACAAGCTAA ATCTGCGGCA TACCGAAATC TTGCCGCTTT ATGCGCGGCT TTCGAACAGC GAACAGAACC GCGTGTTCCA GTCGCACAGT GGACGGCGCA TTGTGCTGGC GACCAACGTC GCGGAAACCT CGCTGACCGT ACCGGGTATT AAATACGTTA TCGACCCCGG TACAGCGCGT ATCAGCCGCT ACAGCTATCG CACCAAAGTG CAGCGTTTGC CGATTGAGCC CATTTCGCAG GCTTCAGCTA ATCAGCGTAA AGGCCGCTGT GGTCGTGTGT CCGAAGGGAT CTGTATTCGT CTCTATTCCG AAGACGATTT CCTTTCGCGC CCGGAATTTA CCGATCCGGA GATTCTGCGT ACCAACCTGG CCTCGGTTAT TTTGCAGATG ACTGCGCTGG GGCTGGGCGA TATCGCTGCG TTCCCGTTTG TTGAAGCACC GGATAAACGC AATATCCAGG ATGGCGTGCG TCTGCTCGAA GAACTGGGCG CGATCACCAC GGATGAACAG GCCAGCGCCT ATAAACTGAC GCCGCTCGGT CGCCAGCTTT CGCAATTGCC TGTCGATCCT CGTCTGGCGC GTATGGTGCT GGAAGCGCAG AAACATGGCT GCGTGCGTGA GGCGATGATT ATCACGTCCG CGCTCTCCAT TCAGGATCCG CGCGAGCGTC CGATGGATAA ACAGCAGGCA TCGGACGAAA AACATCGTCG CTTCCACGAC AAAGAATCCG ACTTCCTCGC GTTTGTAAAT CTGTGGAATT ATCTTGGCGA GCAGCAAAAG GCGCTTTCTT CCAACGCCTT CCGTCGCCTG TGTCGTACCG ATTATCTCAA CTATCTGCGC GTGCGCGAAT GGCAGGATAT CTACACCCAG TTGCGTCAGG TAGTGAAAGA ACTTGGCATT CCGGTTAACA GCGAACCGGC GGAGTATCGC GAAATTCACA TCGCCTTACT GACCGGTTTG CTTTCCCATA TCGGCATGAA AGATGCCGAT AAACAAGAAT ATACCGGCGC ACGTAACGCG CGTTTCTCCA TTTTCCCCGG TTCCGGCTTA TTTAAAAAGC CGCCGAAATG GGTAATGGTG GCGGAACTGG TAGAAACCAG CCGCCTGTGG GGGCGCATTG CTGCGCGTAT CGACCCGGAA TGGGTAGAAC CCGTTGCTCA GCATTTGATT AAACGCACCT ACAGCGAACC GCACTGGGAA CGGGCGCAGG GCGCGGTGAT GGCAACGGAA AAAGTCACTG TTTATGGTTT GCCGATTGTT GCTGCGCGCA AGGTCAACTA CAGCCAGATC GATCCGGCGT TATGTCGTGA ACTCTTTATT CGCCACGCTC TGGTGGAAGG TGACTGGCAG ACGCGTCACG CATTCTTCCG TGAAAACCTG AAACTGCGGG CCGAAGTGGA AGAGCTGGAA CACAAATCAC GTCGCCGCGA TATTCTGGTT GATGACGAAA CGTTGTTTGA GTTCTACGAC CAGCGCATCC GCCACGATGT AATCTCCGCT CGTCACTTCG ATAGCTGGTG GAAAAAAGTC AGCCGCGAAA CGCCTGATTT GCTCAACTTT GAAAAGAGCA TGTTGATCAA AGAAGGGGCG GAAAAAATCA GCAAGCTGGA TTATCCGAAC TTCTGGCATC AGGGCAATCT CAAGCTGCGT TTGAGCTATC AGTTTGAGCC CGGCGCGGAT GCTGACGGTG TGACTGTGCA TATCCCGCTG CCGCTGCTTA ATCAGGTAGA AGAGAGCGGG TTTGAATGGC AAATCCCCGG CCTGCGCCGC GAACTGGTGA TTGCTCTGAT TAAATCGTTG CCGAAACCGG TACGCCGTAA TTTTGTACCT GCGCCAAACT ATGCCGAAGC GTTTTTAGGC CGCGTCAAAC CGCTGGAGTT ACCGTTGCTC GACAGTCTTG AGCGCGAGTT ACGGCGGATG ACCGGTGTTA CCGTTGACCG CGAAGACTGG CACTGGGATC AGGTGCCTGA TCACCTGAAA ATCACCTTCC GCGTGGTGGA TGACAAAAAC AAGAAGCTAA AAGAAGGGCG CTCATTACAG GATCTGAAAG ATGCGCTGAA AGGCAAAGTG CAGGAAACGC TGTCTGCGGT GGCGGATGAC GGTATCGAGC AGAGCGGCTT ACATATCTGG AGTTTTGGTC AGCTGCCGGA AAGCTACGAA CAGAAGCGTG GCAACTATAA AGTGAAGGCG TGGCCAGCGC TGGTGGATGA ACGCGACAGT GTGGCAATCA AATTGTTTGA TAATCCGCTG GAACAAAAGC AGGCAATGTG GAACGGTCTT CGCCGTCTAC TGCTGCTGAA TATTCCATCG CCGATCAAAT ATTTGCATGA AAAGTTACCG AACAAAGCCA AGCTGGGGCT GTACTTTAAC CCGTATGGCA AAGTGCTGGA GCTGATCGAC GACTGTATCT CCTGCGGGGT GGATCAACTG ATCGACGCCA ATGGTGGCCC GGTCTGGACG GAAGAAGGCT TTGCTGCGCT GCATGAAAAA GTCCGTGCTG AACTGAACGA CACGGTGGTG GATATTGCGA AGCAGGTCGA GCAAATCCTC ACTGCGGTGT TCAATATCAA TAAACGTCTG AAAGGGCGGG TGGATATGAC TATGGCGCTG GGGCTTTCTG ACATTAAAGC GCAGATGGGC GGGTTGGTAT ATCGCGGTTT TGTCACTGGT AACGGCTTCA AACGGTTGGG CGACACGCTG CGTTATTTGC AGGCGATTGA AAAACGACTG GAAAAACTGG CGGTTGATCC GCATCGTGAC CGTGCTCAGA TGCTGAAAGT CGAAAACGTC CAGCAGGCGT GGCAGCAATG GATCAACAAA CTGCCGCCCG CACGTCGTGA GGATGAAGAC GTGAAAGAGA TCCGTTGGAT GATAGAAGAG TTGCGCGTTA GTTACTTCGC TCAACAACTT GGTACGCCTT ATCCGATTTC AGATAAGCGT ATTTTGCAGG CGATGGAGCA GATTAGCGGT TAA
|
Protein sequence | MTEQQKLTFT ALQQRLDSLM LRDRLRFSRR LHGVKKVKNP DAQQAIFQEM AKEIDQAAGK VLLREAVRPE ITYPDNLPVS QKKQDILEAI RDHQVVIVAG ETGSGKTTQL PKICMELGRG IKGLIGHTQP RRLAARTVAN RIAEELKTEP GGCIGYKVRF SDHVSDNTMV KLMTDGILLA EIQQDRLLMQ YDTIIIDEAH ERSLNIDFLL GYLKELLPRR PDLKIIITSA TIDPERFSRH FNNAPIIEVS GRTYPVEVRY RPIVEEADDT ERDQLQAIFD AVDELSQESP GDILIFMSGE REIRDTADAL NKLNLRHTEI LPLYARLSNS EQNRVFQSHS GRRIVLATNV AETSLTVPGI KYVIDPGTAR ISRYSYRTKV QRLPIEPISQ ASANQRKGRC GRVSEGICIR LYSEDDFLSR PEFTDPEILR TNLASVILQM TALGLGDIAA FPFVEAPDKR NIQDGVRLLE ELGAITTDEQ ASAYKLTPLG RQLSQLPVDP RLARMVLEAQ KHGCVREAMI ITSALSIQDP RERPMDKQQA SDEKHRRFHD KESDFLAFVN LWNYLGEQQK ALSSNAFRRL CRTDYLNYLR VREWQDIYTQ LRQVVKELGI PVNSEPAEYR EIHIALLTGL LSHIGMKDAD KQEYTGARNA RFSIFPGSGL FKKPPKWVMV AELVETSRLW GRIAARIDPE WVEPVAQHLI KRTYSEPHWE RAQGAVMATE KVTVYGLPIV AARKVNYSQI DPALCRELFI RHALVEGDWQ TRHAFFRENL KLRAEVEELE HKSRRRDILV DDETLFEFYD QRIRHDVISA RHFDSWWKKV SRETPDLLNF EKSMLIKEGA EKISKLDYPN FWHQGNLKLR LSYQFEPGAD ADGVTVHIPL PLLNQVEESG FEWQIPGLRR ELVIALIKSL PKPVRRNFVP APNYAEAFLG RVKPLELPLL DSLERELRRM TGVTVDREDW HWDQVPDHLK ITFRVVDDKN KKLKEGRSLQ DLKDALKGKV QETLSAVADD GIEQSGLHIW SFGQLPESYE QKRGNYKVKA WPALVDERDS VAIKLFDNPL EQKQAMWNGL RRLLLLNIPS PIKYLHEKLP NKAKLGLYFN PYGKVLELID DCISCGVDQL IDANGGPVWT EEGFAALHEK VRAELNDTVV DIAKQVEQIL TAVFNINKRL KGRVDMTMAL GLSDIKAQMG GLVYRGFVTG NGFKRLGDTL RYLQAIEKRL EKLAVDPHRD RAQMLKVENV QQAWQQWINK LPPARREDED VKEIRWMIEE LRVSYFAQQL GTPYPISDKR ILQAMEQISG
|
| |