Gene EcSMS35_0159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0159 
SymbolhrpB 
ID6143552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp172465 
End bp174939 
Gene Length2475 bp 
Protein Length824 aa 
Translation table11 
GC content57% 
IMG OID641615060 
ProductATP-dependent RNA helicase HrpB 
Protein accessionYP_001742276 
Protein GI170681199 
COG category[L] Replication, recombination and repair 
COG ID[COG1643] HrpA-like helicases 
TIGRFAM ID[TIGR01970] ATP-dependent helicase HrpB 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.805468 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones55 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACAAT GTGGCGCGAA GAATGTTAAC CCTCTGGAGC GTTTTGTGTC GTCGTTGCCC 
GTTGCTGCCG TCTTACCTGA GTTACTTGCT GCCCTCGATG GTGCATCGCA GGTGTTATTA
AGTGCGCCGA CCGGGGCCGG GAAATCAACC TGGCTGCCGC TGCAACTGCT GGCGCATCCA
GGCATTAACG GGAAAGTTAT CCTGCTGGAG CCGCGTCGTC TGGCGGCGCG TAACGTTGCG
CAACGGCTGG CGGAGCTGCT TAACGAAAAG CCAGGCGATA CCGTTGGCTA CCGGATGCGT
GCGCAAAACT GCGTCGGGCC GAATACCCGC CTGGAAGTGG TTACCGAAGG CGTGCTGACG
CGCATGATCC AGCGTGACCC GGAACTGAGC GGTGTTGGAT TGGTGATCCT CGATGAGTTT
CATGAGCGCA GTTTGCAGGC GGATTTGGCG TTGGCGCTGT TACTCGATGT GCAACAAGGT
CTGCGTGATG ACCTTAAACT GCTGATTATG TCGGCTACGC TGGACAACGA CCGCTTGCAG
CAAATGCTGC CAGAAGCGCC CGTCATCATC TCAGAAGGGC GCTCGTTTCC GGTTGAACGC
CGTTATTTAC CACTGCCCGC GCATCAGCGT TTTGACGAAG CCGTTGCGGT AGCCACCGCC
GAAATGCTGC GTCAGGAAAG CGGATCATTA CTGTTATTTT TACCTGGCGT CGGAGAAATT
CAGCGTGTGC AGGAACAACT GGCTTCGCGC ATCGGCAGTG ATGTGTTGCT CTGCCCGCTG
TATGGCGCGT TGTCGCTGAA CGATCAGCGA AAAGCGATCC TCCCGGCACC GCAAGGGATG
CGCAAAGTGG TGCTGGCGAC CAATATTGCT GAAACCAGTT TAACCATCGA AGGTATTCGT
CTGGTGGTGG ATTGTGCTCA GGAACGTGTG GCGCGTTTTG ATCCGCGTAC AGGGCTTACG
CGGCTGGTTA CTCAACGCGT TAGTCAGGCG TCGATGACGC AGCGTGCCGG GCGTGCCGGG
CGTCTGGATC CGGGTATCTG CCTGCATTTA ATCGCCAAAG AACAAGCAGA ACGCGCCGCG
GCGCAAAGTG AACCGGAAAT ATTACAAAGC GATCTTTCCT GTTTGCTGAT GGAATTACTG
CAATGGGGAT GCAGCGATCC GGCGCAGATG AGCTGGCTGG ATCAACCGCC AACGGTGAAT
CTACTGGCCG CGAAACGCCT GTTACGGATG CTGGGGGCGC TGGACGGTGA ACGGCTTAGT
GCGCAAGGGC AAAAAATGGC AGCGCTGGGG AACGATCCGC GTTTAGCGGC AATGCTGGTG
AGCGCGAAGA GCGACGACGA AGCTGCTACC GCGGCAAAAA TTGCCGCCAT TCTCGAAGAG
CCGCCACGGA TGGGCAATAG TGACCTGGGC GTGGCGTTTT CGCGCAATCA ACCCGCCTGG
CAGCAACGTA GTCAGCAACT GTTAAAACGC TTAAACGTAC GCGGCGGTGA GGCAGACAGT
TCGCTTATCG CGCCGCTACT TGCCAGAGCG TATGCCGATC GCATTGCTCG TCGCCGTGGG
CAAGATGAAC GCTATCAACT GGCGAACGGC ATGGGGGCGA TGCTCGATGC CGACGATGCG
CTAAGCCGCC ACGAATGGTT GATCGCACCG TTATTATTGC AGGGCAGCGC CTCGCCGGAT
GCGCGGATTT TACTGGCGCT ACCGGTCGAT ATTGATGAGT TAGTACAACG CTGCCCGCAG
CTGGTACAGC AATCGGACAC CGTGGAGTGG GATGACGCGC AAGGTACGCT GAAAGCCTGG
CGTCGGCTGC AAATCGGTCA GTTGATGGTG AAAGTGCAGC CACTGGCGAA ACCCTCGGAA
GACGAGTTGC ATCAGGCGAT GCTTAACGGC ATTCGTGATA AAGGTTTAAG CGTGCTCAAC
TGGACGGCGG AAGCGGAACA GCTACGCTTG CGTTTGTTAT GCGCCGCAAA GTGGTTGCCG
GAATATGACT GGCCAGCGGT TGATGATGAG AGTTTGTTGG CGACGCTGGA AACGTGGCTA
CTGCCGCATA TGACAGGCGT ACATTCGCTA CGCGGCTTGA AATCACTCGA TATTTATCAG
GCACTGCGTG GATTACTTGA TTGGGTAATG CAGCAACGTC TGGATAGTGA ATTGCCTGCG
CATTACACTG TGCCGACGGG AAGCCGGATC GCCATTCGTT ATCATGAAGA TAACCCGCCC
GCGCTGGCGG TGAGAATGCA GGAGATGTTT GGCGCGGCCA CCAATCCGAC GATCGCCCAG
GGGCGCGTGC CGCTGGTGCT GGAGTTACTT TCCCCTGCCC AAAGGCCGCT GCAAATCACG
CGTGATTTGA GCGCCTTCTG GAAAGGAGCG TACCGTGAGG TGCAAAAAGA GATGAAAGGG
CGTTATCCCA AACATGTCTG GCCGGACGAC CCGGCAAACA CCGCACCAAC GCGACGAACG
AAAAAGTATT CGTGA
 
Protein sequence
MLQCGAKNVN PLERFVSSLP VAAVLPELLA ALDGASQVLL SAPTGAGKST WLPLQLLAHP 
GINGKVILLE PRRLAARNVA QRLAELLNEK PGDTVGYRMR AQNCVGPNTR LEVVTEGVLT
RMIQRDPELS GVGLVILDEF HERSLQADLA LALLLDVQQG LRDDLKLLIM SATLDNDRLQ
QMLPEAPVII SEGRSFPVER RYLPLPAHQR FDEAVAVATA EMLRQESGSL LLFLPGVGEI
QRVQEQLASR IGSDVLLCPL YGALSLNDQR KAILPAPQGM RKVVLATNIA ETSLTIEGIR
LVVDCAQERV ARFDPRTGLT RLVTQRVSQA SMTQRAGRAG RLDPGICLHL IAKEQAERAA
AQSEPEILQS DLSCLLMELL QWGCSDPAQM SWLDQPPTVN LLAAKRLLRM LGALDGERLS
AQGQKMAALG NDPRLAAMLV SAKSDDEAAT AAKIAAILEE PPRMGNSDLG VAFSRNQPAW
QQRSQQLLKR LNVRGGEADS SLIAPLLARA YADRIARRRG QDERYQLANG MGAMLDADDA
LSRHEWLIAP LLLQGSASPD ARILLALPVD IDELVQRCPQ LVQQSDTVEW DDAQGTLKAW
RRLQIGQLMV KVQPLAKPSE DELHQAMLNG IRDKGLSVLN WTAEAEQLRL RLLCAAKWLP
EYDWPAVDDE SLLATLETWL LPHMTGVHSL RGLKSLDIYQ ALRGLLDWVM QQRLDSELPA
HYTVPTGSRI AIRYHEDNPP ALAVRMQEMF GAATNPTIAQ GRVPLVLELL SPAQRPLQIT
RDLSAFWKGA YREVQKEMKG RYPKHVWPDD PANTAPTRRT KKYS