Gene EcSMS35_1759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1759 
SymbolhrpA 
ID6145514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1763965 
End bp1767867 
Gene Length3903 bp 
Protein Length1300 aa 
Translation table11 
GC content53% 
IMG OID641616635 
ProductATP-dependent RNA helicase HrpA 
Protein accessionYP_001743813 
Protein GI170684173 
COG category[L] Replication, recombination and repair 
COG ID[COG1643] HrpA-like helicases 
TIGRFAM ID[TIGR01967] ATP-dependent helicase HrpA 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.00000010565 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACAGAAC AACAAAAATT GACCTTTACG GCCTTGCAGC AACGGTTGGA TTCGCTGATG 
CTGCGTGACA GACTGCGTTT TTCCCGCCGT CTGCACGGCG TGAAGAAGGT TAAAAATCCT
GATGCACAAC AGGCCATTTT CCAGGAGATG GCGAAAGAGA TTGACCAGGC GGCAGGGAAA
GTCCTGCTGC GTGAAGCAGT ACGACCGGAA ATCACTTATC CAGACAATTT GCCGGTTAGT
CAGAAAAAGC AGGACATTCT CGAAGCAATC CGTGATCACC AGGTGGTGAT CGTCGCCGGG
GAAACGGGTT CCGGTAAAAC GACTCAGCTA CCGAAAATCT GTATGGAACT GGGCCGCGGG
ATTAAAGGGC TGATCGGCCA TACCCAGCCG CGTCGTCTGG CGGCACGAAC GGTGGCGAAC
CGTATTGCGG AAGAGCTGAA AACGGAGCCG GGCGGTTGCA TCGGGTATAA AGTGCGTTTC
AGCGATCACG TAAGTGATAA CACGATGGTC AAGCTGATGA CCGACGGTAT CCTGCTGGCG
GAGATCCAGC AAGACCGCCT GCTGATGCAG TACGACACTA TCATTATTGA CGAAGCGCAC
GAACGCAGCC TGAATATCGA TTTTCTGCTC GGCTATTTGA AAGAGTTGCT GCCGCGGCGT
CCTGACTTAA AAATCATTAT CACTTCCGCG ACTATCGACC CGGAACGCTT TTCGCGCCAC
TTTAATAATG CGCCGATTAT CGAAGTCTCC GGTCGGACCT ATCCGGTGGA AGTGCGCTAT
CGCCCGATCG TTGAAGAAGC TGATGACACC GAGCGCGATC AGTTGCAGGC GATTTTTGAC
GCCGTAGACG AACTGAGCCA GGAAAGCCCT GGCGACATTC TGATCTTTAT GAGCGGTGAG
CGGGAAATTC GCGATACCGC CGATGCGCTG AACAAGCTAA ATCTGCGGCA TACCGAAATC
TTGCCGCTTT ATGCGCGGCT TTCGAACAGC GAACAGAACC GCGTGTTCCA GTCGCACAGT
GGACGGCGCA TTGTGCTGGC GACCAACGTC GCGGAAACCT CGCTGACCGT ACCGGGTATT
AAATACGTTA TCGACCCCGG TACAGCGCGT ATCAGCCGCT ACAGCTATCG CACCAAAGTG
CAGCGTTTGC CGATTGAGCC CATTTCGCAG GCTTCAGCTA ATCAGCGTAA AGGCCGCTGT
GGTCGTGTGT CCGAAGGGAT CTGTATTCGT CTCTATTCCG AAGACGATTT CCTTTCGCGC
CCGGAATTTA CCGATCCGGA GATTCTGCGT ACCAACCTGG CCTCGGTTAT TTTGCAGATG
ACTGCGCTGG GGCTGGGCGA TATCGCTGCG TTCCCGTTTG TTGAAGCACC GGATAAACGC
AATATCCAGG ATGGCGTGCG TCTGCTCGAA GAACTGGGCG CGATCACCAC GGATGAACAG
GCCAGCGCCT ATAAACTGAC GCCGCTCGGT CGCCAGCTTT CGCAATTGCC TGTCGATCCT
CGTCTGGCGC GTATGGTGCT GGAAGCGCAG AAACATGGCT GCGTGCGTGA GGCGATGATT
ATCACGTCCG CGCTCTCCAT TCAGGATCCG CGCGAGCGTC CGATGGATAA ACAGCAGGCA
TCGGACGAAA AACATCGTCG CTTCCACGAC AAAGAATCCG ACTTCCTCGC GTTTGTAAAT
CTGTGGAATT ATCTTGGCGA GCAGCAAAAG GCGCTTTCTT CCAACGCCTT CCGTCGCCTG
TGTCGTACCG ATTATCTCAA CTATCTGCGC GTGCGCGAAT GGCAGGATAT CTACACCCAG
TTGCGTCAGG TAGTGAAAGA ACTTGGCATT CCGGTTAACA GCGAACCGGC GGAGTATCGC
GAAATTCACA TCGCCTTACT GACCGGTTTG CTTTCCCATA TCGGCATGAA AGATGCCGAT
AAACAAGAAT ATACCGGCGC ACGTAACGCG CGTTTCTCCA TTTTCCCCGG TTCCGGCTTA
TTTAAAAAGC CGCCGAAATG GGTAATGGTG GCGGAACTGG TAGAAACCAG CCGCCTGTGG
GGGCGCATTG CTGCGCGTAT CGACCCGGAA TGGGTAGAAC CCGTTGCTCA GCATTTGATT
AAACGCACCT ACAGCGAACC GCACTGGGAA CGGGCGCAGG GCGCGGTGAT GGCAACGGAA
AAAGTCACTG TTTATGGTTT GCCGATTGTT GCTGCGCGCA AGGTCAACTA CAGCCAGATC
GATCCGGCGT TATGTCGTGA ACTCTTTATT CGCCACGCTC TGGTGGAAGG TGACTGGCAG
ACGCGTCACG CATTCTTCCG TGAAAACCTG AAACTGCGGG CCGAAGTGGA AGAGCTGGAA
CACAAATCAC GTCGCCGCGA TATTCTGGTT GATGACGAAA CGTTGTTTGA GTTCTACGAC
CAGCGCATCC GCCACGATGT AATCTCCGCT CGTCACTTCG ATAGCTGGTG GAAAAAAGTC
AGCCGCGAAA CGCCTGATTT GCTCAACTTT GAAAAGAGCA TGTTGATCAA AGAAGGGGCG
GAAAAAATCA GCAAGCTGGA TTATCCGAAC TTCTGGCATC AGGGCAATCT CAAGCTGCGT
TTGAGCTATC AGTTTGAGCC CGGCGCGGAT GCTGACGGTG TGACTGTGCA TATCCCGCTG
CCGCTGCTTA ATCAGGTAGA AGAGAGCGGG TTTGAATGGC AAATCCCCGG CCTGCGCCGC
GAACTGGTGA TTGCTCTGAT TAAATCGTTG CCGAAACCGG TACGCCGTAA TTTTGTACCT
GCGCCAAACT ATGCCGAAGC GTTTTTAGGC CGCGTCAAAC CGCTGGAGTT ACCGTTGCTC
GACAGTCTTG AGCGCGAGTT ACGGCGGATG ACCGGTGTTA CCGTTGACCG CGAAGACTGG
CACTGGGATC AGGTGCCTGA TCACCTGAAA ATCACCTTCC GCGTGGTGGA TGACAAAAAC
AAGAAGCTAA AAGAAGGGCG CTCATTACAG GATCTGAAAG ATGCGCTGAA AGGCAAAGTG
CAGGAAACGC TGTCTGCGGT GGCGGATGAC GGTATCGAGC AGAGCGGCTT ACATATCTGG
AGTTTTGGTC AGCTGCCGGA AAGCTACGAA CAGAAGCGTG GCAACTATAA AGTGAAGGCG
TGGCCAGCGC TGGTGGATGA ACGCGACAGT GTGGCAATCA AATTGTTTGA TAATCCGCTG
GAACAAAAGC AGGCAATGTG GAACGGTCTT CGCCGTCTAC TGCTGCTGAA TATTCCATCG
CCGATCAAAT ATTTGCATGA AAAGTTACCG AACAAAGCCA AGCTGGGGCT GTACTTTAAC
CCGTATGGCA AAGTGCTGGA GCTGATCGAC GACTGTATCT CCTGCGGGGT GGATCAACTG
ATCGACGCCA ATGGTGGCCC GGTCTGGACG GAAGAAGGCT TTGCTGCGCT GCATGAAAAA
GTCCGTGCTG AACTGAACGA CACGGTGGTG GATATTGCGA AGCAGGTCGA GCAAATCCTC
ACTGCGGTGT TCAATATCAA TAAACGTCTG AAAGGGCGGG TGGATATGAC TATGGCGCTG
GGGCTTTCTG ACATTAAAGC GCAGATGGGC GGGTTGGTAT ATCGCGGTTT TGTCACTGGT
AACGGCTTCA AACGGTTGGG CGACACGCTG CGTTATTTGC AGGCGATTGA AAAACGACTG
GAAAAACTGG CGGTTGATCC GCATCGTGAC CGTGCTCAGA TGCTGAAAGT CGAAAACGTC
CAGCAGGCGT GGCAGCAATG GATCAACAAA CTGCCGCCCG CACGTCGTGA GGATGAAGAC
GTGAAAGAGA TCCGTTGGAT GATAGAAGAG TTGCGCGTTA GTTACTTCGC TCAACAACTT
GGTACGCCTT ATCCGATTTC AGATAAGCGT ATTTTGCAGG CGATGGAGCA GATTAGCGGT
TAA
 
Protein sequence
MTEQQKLTFT ALQQRLDSLM LRDRLRFSRR LHGVKKVKNP DAQQAIFQEM AKEIDQAAGK 
VLLREAVRPE ITYPDNLPVS QKKQDILEAI RDHQVVIVAG ETGSGKTTQL PKICMELGRG
IKGLIGHTQP RRLAARTVAN RIAEELKTEP GGCIGYKVRF SDHVSDNTMV KLMTDGILLA
EIQQDRLLMQ YDTIIIDEAH ERSLNIDFLL GYLKELLPRR PDLKIIITSA TIDPERFSRH
FNNAPIIEVS GRTYPVEVRY RPIVEEADDT ERDQLQAIFD AVDELSQESP GDILIFMSGE
REIRDTADAL NKLNLRHTEI LPLYARLSNS EQNRVFQSHS GRRIVLATNV AETSLTVPGI
KYVIDPGTAR ISRYSYRTKV QRLPIEPISQ ASANQRKGRC GRVSEGICIR LYSEDDFLSR
PEFTDPEILR TNLASVILQM TALGLGDIAA FPFVEAPDKR NIQDGVRLLE ELGAITTDEQ
ASAYKLTPLG RQLSQLPVDP RLARMVLEAQ KHGCVREAMI ITSALSIQDP RERPMDKQQA
SDEKHRRFHD KESDFLAFVN LWNYLGEQQK ALSSNAFRRL CRTDYLNYLR VREWQDIYTQ
LRQVVKELGI PVNSEPAEYR EIHIALLTGL LSHIGMKDAD KQEYTGARNA RFSIFPGSGL
FKKPPKWVMV AELVETSRLW GRIAARIDPE WVEPVAQHLI KRTYSEPHWE RAQGAVMATE
KVTVYGLPIV AARKVNYSQI DPALCRELFI RHALVEGDWQ TRHAFFRENL KLRAEVEELE
HKSRRRDILV DDETLFEFYD QRIRHDVISA RHFDSWWKKV SRETPDLLNF EKSMLIKEGA
EKISKLDYPN FWHQGNLKLR LSYQFEPGAD ADGVTVHIPL PLLNQVEESG FEWQIPGLRR
ELVIALIKSL PKPVRRNFVP APNYAEAFLG RVKPLELPLL DSLERELRRM TGVTVDREDW
HWDQVPDHLK ITFRVVDDKN KKLKEGRSLQ DLKDALKGKV QETLSAVADD GIEQSGLHIW
SFGQLPESYE QKRGNYKVKA WPALVDERDS VAIKLFDNPL EQKQAMWNGL RRLLLLNIPS
PIKYLHEKLP NKAKLGLYFN PYGKVLELID DCISCGVDQL IDANGGPVWT EEGFAALHEK
VRAELNDTVV DIAKQVEQIL TAVFNINKRL KGRVDMTMAL GLSDIKAQMG GLVYRGFVTG
NGFKRLGDTL RYLQAIEKRL EKLAVDPHRD RAQMLKVENV QQAWQQWINK LPPARREDED
VKEIRWMIEE LRVSYFAQQL GTPYPISDKR ILQAMEQISG