Gene EcSMS35_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2042 
Symbolrne 
ID6143391 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2062159 
End bp2065332 
Gene Length3174 bp 
Protein Length1057 aa 
Translation table11 
GC content56% 
IMG OID641616918 
Productribonuclease E 
Protein accessionYP_001744094 
Protein GI170683427 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1530] Ribonucleases G and E 
TIGRFAM ID[TIGR00757] ribonuclease, Rne/Rng family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000184616 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.00765925 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAGAA TGTTAATCAA CGCAACTCAG CAGGAAGAGT TGCGCGTTGC CCTTGTAGAT 
GGGCAGCGTC TGTATGACCT GGATATCGAA AGTCCAGGGC ACGAGCAGAA AAAGGCAAAC
ATCTACAAAG GTAAAATCAC CCGCATTGAA CCGAGTCTGG AAGCTGCTTT TGTTGATTAC
GGCGCTGAAC GTCACGGTTT CCTCCCACTA AAAGAAATTG CCCGCGAATA TTTCCCTGCT
AACTACAGTG CTCATGGTCG TCCCAACATT AAAGATGTGT TGCGTGAAGG TCAGGAAGTC
ATTGTTCAGA TCGATAAAGA AGAGCGCGGC AACAAAGGCG CGGCATTAAC CACCTTTATC
AGTCTGGCGG GTAGCTATCT GGTTCTGATG CCGAACAACC CGCGCGCGGG TGGCATTTCT
CGTCGTATCG AAGGCGACGA CCGTACCGAA TTAAAAGAAG CACTGGCAAG CCTTGAACTG
CCGGAAGGCA TGGGGCTTAT CGTGCGCACC GCTGGCGTCG GCAAATCTGC TGAGGCGCTG
CAATGGGATT TAAGCTTCCG TCTGAAACAC TGGGAAGCCA TCAAAAAAGC CGCTGAAAGC
CGCCCGGCTC CGTTCCTGAT TCATCAGGAG AGCAACGTAA TCGTTCGCGC ATTCCGCGAT
TACTTACGCC AGGACATCGG TGAAATCCTT ATCGATAACC CGAAAGTGCT CGAACTGGCA
CGTCAGCATA TCGCTGCATT AGGCCGCCCG GATTTCAGCA GCAAAATCAA ACTGTACACC
GGCGAGATCC CGCTGTTCAG CCACTACCAG ATCGAGTCGC AGATCGAGTC CGCCTTCCAG
CGTGAAGTTC GTCTGCCATC CGGTGGTTCG ATTGTTATCG ACAGCACCGA AGCGTTAACG
GCCATCGACA TCAACTCCGC ACGCGCGACC CGCGGCGGCG ATATCGAAGA AACCGCGTTT
AACACTAACC TCGAAGCTGC CGATGAGATT GCTCGTCAGC TGCGCCTGCG TGACCTCGGC
GGCCTGATTG TTATCGACTT CATCGACATG ACGCCTGTAC GCCACCAGCG TGCGGTAGAA
AATCGTCTGC GTGAAGCGGT GCGTCAGGAC CGTGCGCGTA TTCAAATCAG CCATATTTCT
CGCTTTGGCC TGCTGGAAAT GTCCCGTCAG CGTCTGAGCC CATCACTGGG TGAATCCAGC
CATCACGTCT GCCCGCGCTG CTCCGGTACT GGTACCGTGC GTGACAACGA ATCGTTGTCG
CTCTCTATTC TGCGTCTGAT CGAAGAAGAA GCGCTGAAAG AGAACACTCA GGAAGTTCAC
GCCATTGTTC CTGTGCCAAT CGCGTCTTAC CTGCTGAATG AAAAACGTTC TGCAGTGAAT
GCGATTGAAA CTCGTCAGGA CGGCGTGCGC TGTGTGATTG TGCCAAACGA TCAGATGGAA
ACCCCGCACT ACCATGTGCT GCGCGTGCGT AAAGGGGAAG AAACTCCAAC CTTAAGCTAC
ATGCTGCCGA AGCTGCATGA AGAAGCGATG GCGCTGCCGT CTGAAGAAGA GTTCGCTGAA
CGTAAGCGTC CGGAACAACC TGCGCTGGCA ACCTTTGCCA TGCCGGATGT GCCGCCAGCG
CCAACCCCAG CTGAACCTGC CGCGCCTGTT GTAGCCCCAG CACCTAAAGC TGCAACGGCA
ACACCAGCAT CTCCTGCACA ACCAGGGTTG TTGAGCCGCT TCTTCAGCGC ACTGAAAGCG
CTGTTCAGCG GTGGTGAAGA AACCAAACCG GCCGAGCAAC CAGCACCGAA AGCAGAAGCG
AAACCGGAAC GTCAACAGGA TCGTCGCAAG CCTCGTCAGA ACAACCGCCG TGACCGTAAT
GAGCGCCGCG ACTCCCGTAG TGAACGTACA GAAGGCAGCG ATAATCGCGA AGAAAACCGT
CGTAATCGTC GCCAGGCACA GCAGCAGACT GTCGAGACGC GTGAGAGCCG TCAGCAGGCT
GAGGTAACGG AAAAAGCGCG TACCACCGAC GAGCAGCAAG CGCCGCGTCG TGAACGTAGC
CGCCGCCGTA ACGATGATAA ACGTCAGGCG CAACAAGAAG CGAAGGCGCT GAATGTTGAA
GAGCAATCTG TTCAGGAAAC CGAACAGGAA GAACGTGTAC GTCCGGTTCA GCCGCGTCGT
AAACAGCGTC AGCTCAATCA GAAAGTGCGT TACGAGCAAA GCGTAGCCGA AGAAGCGGTA
GTCGCACCGG TGGTTGAAGA AACTGTCGCT GGCGAACCAA TTGTTCAGGA AGCGCCAGCT
CCACGCACAG AACTGGTGAA AGTCCCGCTG CCAGTCGTAG CGCAAGCTGC ACCAGAACAG
CAAGAAGAGA ACAATGCCGA TAACCGTGAC AACGGTGGCA TGCCGCGTCG TTCTCGCCGC
TCGCCTCGTC ACCTGCGCGT GAGTGGTCAG CGTCGTCGTC GCTATCGTGA CGAGCGTTAT
CCAACCCAAT CGCCAATGCC GTTGACCGTA GCGTGCGCGT CTCCGGAACT GGCCTCTGGC
AAAGTCTGGA TCCGCTATCC GATTGTACGT CCACAAGATG TACAGGTTGA AGAGCAGCGC
GAACAGGAAG AAGTACAAGT GCAGCCGATG GTGACTGAGG TCCCTGTCGC CGCCGCTGTC
GAACCGGTTG TTAGCGCACC GGTTGTTGAA GAAGTGGCTG AAGTCGTAGA AGCCCCCGTT
CACGTTGCCG AACCGCAACC GGAAGTGGTT GAAACGACGC ATCCTGAAGT AATTGCCGCC
GCGGTAACTG AACAGCCGCA GGTGATTACC GAGTCTGATG TTGCCGTAGC CCAGGAAGTT
GCAGAACACG CAGAACCGGT AGTTGAACCG CAGGAAGAGA CGGCTGATAT TGAAGAAGTT
GCCGAAACTG CTGAGGTTGT GGTTGCTGAA CCTGAAGTTG TTGCTCAACC TGCCGCACCA
GTCGCAGCAG AAGTTGAAAC GGTAGCCGCG GTTGAACCAG AGATCACCGT TGAGCATAAC
CACGCTACCG CGCCAATGAC GCGCGCTCCG GCACCGGAAT ATGTTCCGGA GGCACCGCGT
CACAGTGACT GGCAGCGCCC TACTTTTGCC TTCGAAGGCA AAGGTGCCGC AGGAGGTCAT
ACGGCAACGC ATCATGCCTC TGCCGCTCCT GCGCGTCCGC AACCTGTTGA GTAA
 
Protein sequence
MKRMLINATQ QEELRVALVD GQRLYDLDIE SPGHEQKKAN IYKGKITRIE PSLEAAFVDY 
GAERHGFLPL KEIAREYFPA NYSAHGRPNI KDVLREGQEV IVQIDKEERG NKGAALTTFI
SLAGSYLVLM PNNPRAGGIS RRIEGDDRTE LKEALASLEL PEGMGLIVRT AGVGKSAEAL
QWDLSFRLKH WEAIKKAAES RPAPFLIHQE SNVIVRAFRD YLRQDIGEIL IDNPKVLELA
RQHIAALGRP DFSSKIKLYT GEIPLFSHYQ IESQIESAFQ REVRLPSGGS IVIDSTEALT
AIDINSARAT RGGDIEETAF NTNLEAADEI ARQLRLRDLG GLIVIDFIDM TPVRHQRAVE
NRLREAVRQD RARIQISHIS RFGLLEMSRQ RLSPSLGESS HHVCPRCSGT GTVRDNESLS
LSILRLIEEE ALKENTQEVH AIVPVPIASY LLNEKRSAVN AIETRQDGVR CVIVPNDQME
TPHYHVLRVR KGEETPTLSY MLPKLHEEAM ALPSEEEFAE RKRPEQPALA TFAMPDVPPA
PTPAEPAAPV VAPAPKAATA TPASPAQPGL LSRFFSALKA LFSGGEETKP AEQPAPKAEA
KPERQQDRRK PRQNNRRDRN ERRDSRSERT EGSDNREENR RNRRQAQQQT VETRESRQQA
EVTEKARTTD EQQAPRRERS RRRNDDKRQA QQEAKALNVE EQSVQETEQE ERVRPVQPRR
KQRQLNQKVR YEQSVAEEAV VAPVVEETVA GEPIVQEAPA PRTELVKVPL PVVAQAAPEQ
QEENNADNRD NGGMPRRSRR SPRHLRVSGQ RRRRYRDERY PTQSPMPLTV ACASPELASG
KVWIRYPIVR PQDVQVEEQR EQEEVQVQPM VTEVPVAAAV EPVVSAPVVE EVAEVVEAPV
HVAEPQPEVV ETTHPEVIAA AVTEQPQVIT ESDVAVAQEV AEHAEPVVEP QEETADIEEV
AETAEVVVAE PEVVAQPAAP VAAEVETVAA VEPEITVEHN HATAPMTRAP APEYVPEAPR
HSDWQRPTFA FEGKGAAGGH TATHHASAAP ARPQPVE