Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2042 |
Symbol | rne |
ID | 6143391 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2062159 |
End bp | 2065332 |
Gene Length | 3174 bp |
Protein Length | 1057 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641616918 |
Product | ribonuclease E |
Protein accession | YP_001744094 |
Protein GI | 170683427 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1530] Ribonucleases G and E |
TIGRFAM ID | [TIGR00757] ribonuclease, Rne/Rng family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000184616 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.00765925 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAAGAA TGTTAATCAA CGCAACTCAG CAGGAAGAGT TGCGCGTTGC CCTTGTAGAT GGGCAGCGTC TGTATGACCT GGATATCGAA AGTCCAGGGC ACGAGCAGAA AAAGGCAAAC ATCTACAAAG GTAAAATCAC CCGCATTGAA CCGAGTCTGG AAGCTGCTTT TGTTGATTAC GGCGCTGAAC GTCACGGTTT CCTCCCACTA AAAGAAATTG CCCGCGAATA TTTCCCTGCT AACTACAGTG CTCATGGTCG TCCCAACATT AAAGATGTGT TGCGTGAAGG TCAGGAAGTC ATTGTTCAGA TCGATAAAGA AGAGCGCGGC AACAAAGGCG CGGCATTAAC CACCTTTATC AGTCTGGCGG GTAGCTATCT GGTTCTGATG CCGAACAACC CGCGCGCGGG TGGCATTTCT CGTCGTATCG AAGGCGACGA CCGTACCGAA TTAAAAGAAG CACTGGCAAG CCTTGAACTG CCGGAAGGCA TGGGGCTTAT CGTGCGCACC GCTGGCGTCG GCAAATCTGC TGAGGCGCTG CAATGGGATT TAAGCTTCCG TCTGAAACAC TGGGAAGCCA TCAAAAAAGC CGCTGAAAGC CGCCCGGCTC CGTTCCTGAT TCATCAGGAG AGCAACGTAA TCGTTCGCGC ATTCCGCGAT TACTTACGCC AGGACATCGG TGAAATCCTT ATCGATAACC CGAAAGTGCT CGAACTGGCA CGTCAGCATA TCGCTGCATT AGGCCGCCCG GATTTCAGCA GCAAAATCAA ACTGTACACC GGCGAGATCC CGCTGTTCAG CCACTACCAG ATCGAGTCGC AGATCGAGTC CGCCTTCCAG CGTGAAGTTC GTCTGCCATC CGGTGGTTCG ATTGTTATCG ACAGCACCGA AGCGTTAACG GCCATCGACA TCAACTCCGC ACGCGCGACC CGCGGCGGCG ATATCGAAGA AACCGCGTTT AACACTAACC TCGAAGCTGC CGATGAGATT GCTCGTCAGC TGCGCCTGCG TGACCTCGGC GGCCTGATTG TTATCGACTT CATCGACATG ACGCCTGTAC GCCACCAGCG TGCGGTAGAA AATCGTCTGC GTGAAGCGGT GCGTCAGGAC CGTGCGCGTA TTCAAATCAG CCATATTTCT CGCTTTGGCC TGCTGGAAAT GTCCCGTCAG CGTCTGAGCC CATCACTGGG TGAATCCAGC CATCACGTCT GCCCGCGCTG CTCCGGTACT GGTACCGTGC GTGACAACGA ATCGTTGTCG CTCTCTATTC TGCGTCTGAT CGAAGAAGAA GCGCTGAAAG AGAACACTCA GGAAGTTCAC GCCATTGTTC CTGTGCCAAT CGCGTCTTAC CTGCTGAATG AAAAACGTTC TGCAGTGAAT GCGATTGAAA CTCGTCAGGA CGGCGTGCGC TGTGTGATTG TGCCAAACGA TCAGATGGAA ACCCCGCACT ACCATGTGCT GCGCGTGCGT AAAGGGGAAG AAACTCCAAC CTTAAGCTAC ATGCTGCCGA AGCTGCATGA AGAAGCGATG GCGCTGCCGT CTGAAGAAGA GTTCGCTGAA CGTAAGCGTC CGGAACAACC TGCGCTGGCA ACCTTTGCCA TGCCGGATGT GCCGCCAGCG CCAACCCCAG CTGAACCTGC CGCGCCTGTT GTAGCCCCAG CACCTAAAGC TGCAACGGCA ACACCAGCAT CTCCTGCACA ACCAGGGTTG TTGAGCCGCT TCTTCAGCGC ACTGAAAGCG CTGTTCAGCG GTGGTGAAGA AACCAAACCG GCCGAGCAAC CAGCACCGAA AGCAGAAGCG AAACCGGAAC GTCAACAGGA TCGTCGCAAG CCTCGTCAGA ACAACCGCCG TGACCGTAAT GAGCGCCGCG ACTCCCGTAG TGAACGTACA GAAGGCAGCG ATAATCGCGA AGAAAACCGT CGTAATCGTC GCCAGGCACA GCAGCAGACT GTCGAGACGC GTGAGAGCCG TCAGCAGGCT GAGGTAACGG AAAAAGCGCG TACCACCGAC GAGCAGCAAG CGCCGCGTCG TGAACGTAGC CGCCGCCGTA ACGATGATAA ACGTCAGGCG CAACAAGAAG CGAAGGCGCT GAATGTTGAA GAGCAATCTG TTCAGGAAAC CGAACAGGAA GAACGTGTAC GTCCGGTTCA GCCGCGTCGT AAACAGCGTC AGCTCAATCA GAAAGTGCGT TACGAGCAAA GCGTAGCCGA AGAAGCGGTA GTCGCACCGG TGGTTGAAGA AACTGTCGCT GGCGAACCAA TTGTTCAGGA AGCGCCAGCT CCACGCACAG AACTGGTGAA AGTCCCGCTG CCAGTCGTAG CGCAAGCTGC ACCAGAACAG CAAGAAGAGA ACAATGCCGA TAACCGTGAC AACGGTGGCA TGCCGCGTCG TTCTCGCCGC TCGCCTCGTC ACCTGCGCGT GAGTGGTCAG CGTCGTCGTC GCTATCGTGA CGAGCGTTAT CCAACCCAAT CGCCAATGCC GTTGACCGTA GCGTGCGCGT CTCCGGAACT GGCCTCTGGC AAAGTCTGGA TCCGCTATCC GATTGTACGT CCACAAGATG TACAGGTTGA AGAGCAGCGC GAACAGGAAG AAGTACAAGT GCAGCCGATG GTGACTGAGG TCCCTGTCGC CGCCGCTGTC GAACCGGTTG TTAGCGCACC GGTTGTTGAA GAAGTGGCTG AAGTCGTAGA AGCCCCCGTT CACGTTGCCG AACCGCAACC GGAAGTGGTT GAAACGACGC ATCCTGAAGT AATTGCCGCC GCGGTAACTG AACAGCCGCA GGTGATTACC GAGTCTGATG TTGCCGTAGC CCAGGAAGTT GCAGAACACG CAGAACCGGT AGTTGAACCG CAGGAAGAGA CGGCTGATAT TGAAGAAGTT GCCGAAACTG CTGAGGTTGT GGTTGCTGAA CCTGAAGTTG TTGCTCAACC TGCCGCACCA GTCGCAGCAG AAGTTGAAAC GGTAGCCGCG GTTGAACCAG AGATCACCGT TGAGCATAAC CACGCTACCG CGCCAATGAC GCGCGCTCCG GCACCGGAAT ATGTTCCGGA GGCACCGCGT CACAGTGACT GGCAGCGCCC TACTTTTGCC TTCGAAGGCA AAGGTGCCGC AGGAGGTCAT ACGGCAACGC ATCATGCCTC TGCCGCTCCT GCGCGTCCGC AACCTGTTGA GTAA
|
Protein sequence | MKRMLINATQ QEELRVALVD GQRLYDLDIE SPGHEQKKAN IYKGKITRIE PSLEAAFVDY GAERHGFLPL KEIAREYFPA NYSAHGRPNI KDVLREGQEV IVQIDKEERG NKGAALTTFI SLAGSYLVLM PNNPRAGGIS RRIEGDDRTE LKEALASLEL PEGMGLIVRT AGVGKSAEAL QWDLSFRLKH WEAIKKAAES RPAPFLIHQE SNVIVRAFRD YLRQDIGEIL IDNPKVLELA RQHIAALGRP DFSSKIKLYT GEIPLFSHYQ IESQIESAFQ REVRLPSGGS IVIDSTEALT AIDINSARAT RGGDIEETAF NTNLEAADEI ARQLRLRDLG GLIVIDFIDM TPVRHQRAVE NRLREAVRQD RARIQISHIS RFGLLEMSRQ RLSPSLGESS HHVCPRCSGT GTVRDNESLS LSILRLIEEE ALKENTQEVH AIVPVPIASY LLNEKRSAVN AIETRQDGVR CVIVPNDQME TPHYHVLRVR KGEETPTLSY MLPKLHEEAM ALPSEEEFAE RKRPEQPALA TFAMPDVPPA PTPAEPAAPV VAPAPKAATA TPASPAQPGL LSRFFSALKA LFSGGEETKP AEQPAPKAEA KPERQQDRRK PRQNNRRDRN ERRDSRSERT EGSDNREENR RNRRQAQQQT VETRESRQQA EVTEKARTTD EQQAPRRERS RRRNDDKRQA QQEAKALNVE EQSVQETEQE ERVRPVQPRR KQRQLNQKVR YEQSVAEEAV VAPVVEETVA GEPIVQEAPA PRTELVKVPL PVVAQAAPEQ QEENNADNRD NGGMPRRSRR SPRHLRVSGQ RRRRYRDERY PTQSPMPLTV ACASPELASG KVWIRYPIVR PQDVQVEEQR EQEEVQVQPM VTEVPVAAAV EPVVSAPVVE EVAEVVEAPV HVAEPQPEVV ETTHPEVIAA AVTEQPQVIT ESDVAVAQEV AEHAEPVVEP QEETADIEEV AETAEVVVAE PEVVAQPAAP VAAEVETVAA VEPEITVEHN HATAPMTRAP APEYVPEAPR HSDWQRPTFA FEGKGAAGGH TATHHASAAP ARPQPVE
|
| |