Gene EcSMS35_1841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1841 
Symbolrnb 
ID6147184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1864054 
End bp1865988 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content53% 
IMG OID641616717 
Productexoribonuclease II 
Protein accessionYP_001743895 
Protein GI170681733 
COG category[K] Transcription 
COG ID[COG4776] Exoribonuclease II 
TIGRFAM ID[TIGR00358] VacB and RNase II family 3'-5' exoribonucleases
[TIGR02062] exoribonuclease II 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.816503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000000293222 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTCAGG ACAACCCGCT GCTAGCGCAG CTTAAACAGC AACTGCATTC CCAGACGCCA 
CGCGCTGAAG GGGTGGTAAA AGCCACAGAA AAAGGCTTTG GCTTCCTGGA AGTCGACGCG
CAAAAAAGTT ATTTCATTCC GCCGCCGCAG ATGAAAAAAG TCATGCATGG CGACCGAATT
ATCGCGGTGA TCCACAGTGA AAAAGAACGT GAATCCGCAG AGCCAGAAGA ACTGGTTGAA
CCGTTCCTGA CTCGTTTCGT GGGTAAAGTT CAGGGCAAAA ATGACCGTCT GGCCATCGTT
CCTGATCATC CACTCTTAAA AGACGCCATT CCTTGCCGCG CAGCCCGTGG CCTGAACCAC
GAGTTTAAAG AAGGCGACTG GGCGGTTGCC GAAATGCGCC GTCATCCGCT GAAAGGCGAT
CGTTCTTTCT ATGCTGAACT GACACAATAC ATCACTTTTG GTGACGATCA CTTTGTACCG
TGGTGGGTTA CCCTTGCGCG CCATAATCTG GAAAAAGAAG CACCAGACGG CGTCGCTACC
GAAATGCTCG ATGAAGGTCT GGTTCGTGAA GATCTGACTG CGCTGGATTT TGTCACCATC
GACAGTGCCA GCACAGAAGA TATGGATGAC GCCCTTTTCG CTAAGGCGTT GCCGGATGAC
AAACTTCAGC TGATTGTGGC GATTGCCGAT CCAACTGCGT GGATTGCTGA AGGCAGCAAG
CTGGACAAAG CCGCGAAAAT TCGCGCATTC ACCAACTATC TGCCTGGCTT CAACATCCCT
ATGCTGCCTC GCGAGCTTTC TGACGATCTC TGCTCACTGC GCGCCAATGA AGTCCGCCCG
GTACTGGCAT GCCGCATGAC GCTCTCCGCT GATGGCACCA TTGAAGATAA TATCGAATTC
TTTGCCGCCA CCATCGAATC CAAAGCGAAG CTGGTGTATG ACCAGGTTTC TGACTGGCTG
GAAAATACCG GTGACTGGCA GCCTGAAAGC GAAGCAATTG CCGAACAAGT CCGTTTGCTA
GCGCAAATTT GCCAACGCCG CGGCGAGTGG CGTCATAACC ACGCACTGGT GTTTAAAGAT
CGCCCGGATT ACCGCTTTAT TCTCGGTGAA AAAGGAGAAG TGCTGGATAT CGTCGCCGAG
CCTCGTCGCA TTGCCAACCG TATCGTCGAA GAAGCGATGA TTGCCGCTAA CATTTGTGCG
GCCCGCGTAC TGCGCGATAA GCTCGGTTTT GGCATCTATA ACGTGCATAT GGGCTTTGAT
CCGGCGAATG CCGACGCGCT GGCAGCGTTG CTGAAAACCC ACGGTCTGCA TGTCGATGCC
GAAGAAGTGC TCACGCTGGA CGGTTTCTGC AAACTGCGTC GTGAACTGGA TGCGCAACCA
ACTGGTTTCC TCGACAGCCG CATTCGTCGC TTCCAGTCAT TTGCTGAAAT TAGCACTGAA
CCCGGTCCTC ACTTTGGCCT CGGTCTGGAA GCATACGCCA CCTGGACTTC GCCGATCCGT
AAATATGGCG ACATGATCAA CCACCGTCTG CTGAAAGCGG TTATCAAAGG CGAAACTGCG
ACGCGTCCAC AGGATGAAAT CACTGTCCAA ATGGCCGAGC GTCGCCGTCT CAACCGGATG
GCAGAACGTG ATGTTGGCGA CTGGTTATAC GCACGCTTCC TGAAAGACAA AGCCGGGACC
GACACCCGTT TCGCGGCGGA AATTGTCGAT ATCAGCCGTG GCGGCATGCG TGTGCGTTTG
GTTGATAACG GCGCTATCGC CTTTATTCCG GCACCTTTCT TACACGCTGT GCGCGATGAA
CTGGTTTGCA GCCAGGAAAA CGGCACTGTA CAAATTAAAG GTGAAACGGC TTACAAAGTG
ACTGACGTTA TTGACGTCAC CATTGCCGAA GTCCGCATGG AAACCCGCAG CATTATTGCG
CGCCCGGTCG CGTAA
 
Protein sequence
MFQDNPLLAQ LKQQLHSQTP RAEGVVKATE KGFGFLEVDA QKSYFIPPPQ MKKVMHGDRI 
IAVIHSEKER ESAEPEELVE PFLTRFVGKV QGKNDRLAIV PDHPLLKDAI PCRAARGLNH
EFKEGDWAVA EMRRHPLKGD RSFYAELTQY ITFGDDHFVP WWVTLARHNL EKEAPDGVAT
EMLDEGLVRE DLTALDFVTI DSASTEDMDD ALFAKALPDD KLQLIVAIAD PTAWIAEGSK
LDKAAKIRAF TNYLPGFNIP MLPRELSDDL CSLRANEVRP VLACRMTLSA DGTIEDNIEF
FAATIESKAK LVYDQVSDWL ENTGDWQPES EAIAEQVRLL AQICQRRGEW RHNHALVFKD
RPDYRFILGE KGEVLDIVAE PRRIANRIVE EAMIAANICA ARVLRDKLGF GIYNVHMGFD
PANADALAAL LKTHGLHVDA EEVLTLDGFC KLRRELDAQP TGFLDSRIRR FQSFAEISTE
PGPHFGLGLE AYATWTSPIR KYGDMINHRL LKAVIKGETA TRPQDEITVQ MAERRRLNRM
AERDVGDWLY ARFLKDKAGT DTRFAAEIVD ISRGGMRVRL VDNGAIAFIP APFLHAVRDE
LVCSQENGTV QIKGETAYKV TDVIDVTIAE VRMETRSIIA RPVA