Gene EcSMS35_0820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0820 
Symbol 
ID6145501 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp822568 
End bp823932 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content56% 
IMG OID641615708 
ProductATP-dependent RNA helicase RhlE 
Protein accessionYP_001742900 
Protein GI170683710 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTCG ATTCTTTGGG TTTAAGCCCT GATATCCTGC GCGCCGTTGC CGAGCAGGGT 
TACCGTGAAC CCACCCCTAT TCAGCAGCAG GCGATCCCTG CGGTGTTGGA AGGCCGCGAC
CTGATGGCCA GCGCCCAGAC CGGCACCGGC AAAACAGCGG GCTTTACGCT GCCGCTGTTG
CAACACCTGA TCACTCGCCA GCCGCACGCC AAAGGGCGTC GTCCGGTACG TGCGCTCATT
CTTACCCCGA CCCGTGAACT GGCGGCGCAG ATTGGCGAAA ACGTCCGTGA TTACAGCAAA
TACCTGAACA TTCGTTCGCT GGTGGTGTTT GGCGGTGTCA GCATTAACCC ACAAATGATG
AAGTTGCGCG GCGGCGTTGA TGTGCTGGTG GCAACGCCGG GACGTCTGCT GGACCTGGAA
CATCAAAATG CAGTGAAGCT GGATCAGGTT GAAATCCTCG TCCTTGATGA AGCTGACCGC
ATGCTCGACA TGGGCTTTAT CCACGATATC CGTCGCGTGT TAACAAAACT GCCAGCGAAG
CGCCAGAACC TGTTGTTCTC CGCGACCTTC TCTGACGATA TTAAAGCCCT GGCGGAAAAA
CTGTTGCACA ACCCGCTGGA AATCGAAGTG GCACGCCGCA ATACTGCGTC TGATCAGGTG
ACCCAGCACG TTCACTTTGT CGATAAGAAA CGCAAACGCG AATTGCTGTC GCATATGATT
GGGAAAGGGA ACTGGCAGCA GGTGCTGGTG TTTACCCGTA CCAAACATGG CGCTAACCAT
CTGGCTGAAC AGCTTAATAA AGATGGCATC CGCAGTGCGG CGATCCACGG CAATAAATCG
CAAGGTGCGC GTACCCGTGC GCTGGCTGAT TTTAAATCGG GCGATATTCG TGTACTGGTG
GCAACTGACA TCGCTGCGCG CGGCCTGGAT ATTGAAGAGC TGCCGCACGT GGTCAACTAT
GAACTGCCAA ACGTACCGGA AGATTATGTC CACCGTATCG GGCGTACCGG TCGTGCGGCT
GCTACCGGTG AAGCGTTGTC GCTGGTGTGT GTTGATGAAC ACAAACTGCT GCGTGATATC
GAAAAACTGC TGAAAAAAGA GATCCCGCGC ATTGCGATTC CGGGTTATGA GCCGGACCCG
TCAATCAAAG CCGAACCGAT CCAGAACGGT CGCCAGCAAC GTGGCGGCGG CGGTCGTGGG
CAAGGTGGTG GTCGCGGTCA ACAGCAACCA CGCCGTGGAG AAGGTGGTGC GAAATCAGCT
AGCGCGAAGC CCGCAGAAAA AACATCTCGT CGTCTTGGTG ATGCCAAACC GGCAGGCGAA
CAACAACGCC GTCGCCGTCC GCGTAAACCT GCCGCTGCGC AGTAA
 
Protein sequence
MSFDSLGLSP DILRAVAEQG YREPTPIQQQ AIPAVLEGRD LMASAQTGTG KTAGFTLPLL 
QHLITRQPHA KGRRPVRALI LTPTRELAAQ IGENVRDYSK YLNIRSLVVF GGVSINPQMM
KLRGGVDVLV ATPGRLLDLE HQNAVKLDQV EILVLDEADR MLDMGFIHDI RRVLTKLPAK
RQNLLFSATF SDDIKALAEK LLHNPLEIEV ARRNTASDQV TQHVHFVDKK RKRELLSHMI
GKGNWQQVLV FTRTKHGANH LAEQLNKDGI RSAAIHGNKS QGARTRALAD FKSGDIRVLV
ATDIAARGLD IEELPHVVNY ELPNVPEDYV HRIGRTGRAA ATGEALSLVC VDEHKLLRDI
EKLLKKEIPR IAIPGYEPDP SIKAEPIQNG RQQRGGGGRG QGGGRGQQQP RRGEGGAKSA
SAKPAEKTSR RLGDAKPAGE QQRRRRPRKP AAAQ