Gene EcSMS35_1779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1779 
SymboldbpA 
ID6143287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1795569 
End bp1796942 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content52% 
IMG OID641616655 
ProductATP-dependent RNA helicase DbpA 
Protein accessionYP_001743833 
Protein GI170681328 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.184016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCTT TTTCTACCCT GAATGTTTTG CCTCCCGCCC AACTCACGAA CCTTAATGAG 
TTGGGTTATT TAACCATGAC GCCGGTGCAG GCCGCCGCGC TTCCGGCGAT CCTTGCCGGA
AAAGATGTTC GCGTGCAGGC GAAAACCGGC AGTGGCAAAA CGGCGGCTTT TGGCCTCGGC
TTGTTACAGC AAATTGATGC GTCGCTATTT CAAACCCAGG CTTTAGTGCT GTGTCCTACG
CGTGAACTGG CGGATCAGGT GGCAGGTGAA TTGCGTCGGC TGGCGCGTTT TCTGCCAAAT
ACCAAAATTT TGACGTTGTG CGGTGGTCAA CCGTTCGGTA TGCAGCGTGA TTCGTTGCAA
CATGCGCCGC ATATTATCGT GGCAACGCCG GGTCGTTTGC TGGATCACTT GCAAAAAGGC
ACGGTATCAC TGGATGCGCT GAATACTTTG GTGATGGATG AGGCCGACCG CATGCTGGAT
ATGGGATTTA GCGACGCCAT TGATGATGTC ATCCGTTTTG CGCCAGCATC TCGACAGACG
CTTCTGTTTT CGGCAACCTG GCCGGAAGCC ATCGCCGCAA TCAGCGGACG AGTGCAACGC
GATCCTTTAG CGATTGAAAT TGACTCAACA GATGCTCTGC CTCCCATTGA ACAACAATTT
TATGAGACAT TCAGCAAAGG CAAAATTCCG TTGTTGCAAC GGTTATTAAG TTTGCATCAG
CCATCCTCTT GCGTGGTGTT TTGCAATACC AAAAAAGATT GCCAGGCTGT CTGTGACGCG
CTGAATGAAG TAGGGCAAAG TGCATTGTCG TTACACGGCG ATCTGGAGCA ACGCGATCGC
GATCAGACTC TGGTACGTTT TGCTAACGGT AGCGCCCGTG TACTGGTCGC GACTGATGTT
GCTGCGCGTG GTCTGGATAT TAAATCGCTT GAGCTGGTAG TGAACTTTGA ACTGGCGTGG
GACCCTGAAG TTCATGTACA TCGCATCGGT CGTACAGCGC GTGCAGGAAA TAGCGGTCTG
GCGATCAGTT TCTGTGCCCC GGAAGAAGCA CAGCGGGCCA ATATCATTTC TGACATGTTG
CAGATAAAAC TTAACTGGCA AACGCCGCCA GCTAATAGTT CCATTGTGCC GCTGGAAGCC
GAAATGGCAA CGTTATGTAT CGATGGCGGG AAAAAAGCCA AAATGCGCCC GGGTGATGTA
TTAGGTGCGC TGACAGGAGA TATCGGGCTT GATGGCGCAG ATATTGGCAA AATCGCCGTG
CATCCGGCGC ATGTCTATGT CGCGGTGCGT CAGGCTGTTG CTCATAAAGC ATGGAAACAG
TTACAGGGCG GGAAGATTAA AGGAAAAACG TGCCGGGTGC GGTTATTAAA ATAA
 
Protein sequence
MTAFSTLNVL PPAQLTNLNE LGYLTMTPVQ AAALPAILAG KDVRVQAKTG SGKTAAFGLG 
LLQQIDASLF QTQALVLCPT RELADQVAGE LRRLARFLPN TKILTLCGGQ PFGMQRDSLQ
HAPHIIVATP GRLLDHLQKG TVSLDALNTL VMDEADRMLD MGFSDAIDDV IRFAPASRQT
LLFSATWPEA IAAISGRVQR DPLAIEIDST DALPPIEQQF YETFSKGKIP LLQRLLSLHQ
PSSCVVFCNT KKDCQAVCDA LNEVGQSALS LHGDLEQRDR DQTLVRFANG SARVLVATDV
AARGLDIKSL ELVVNFELAW DPEVHVHRIG RTARAGNSGL AISFCAPEEA QRANIISDML
QIKLNWQTPP ANSSIVPLEA EMATLCIDGG KKAKMRPGDV LGALTGDIGL DGADIGKIAV
HPAHVYVAVR QAVAHKAWKQ LQGGKIKGKT CRVRLLK