Gene EcSMS35_4650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4650 
Symbolrnr 
ID6144921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4751436 
End bp4753877 
Gene Length2442 bp 
Protein Length813 aa 
Translation table11 
GC content55% 
IMG OID641619466 
Productexoribonuclease R 
Protein accessionYP_001746574 
Protein GI170681596 
COG category[K] Transcription 
COG ID[COG0557] Exoribonuclease R 
TIGRFAM ID[TIGR00358] VacB and RNase II family 3'-5' exoribonucleases
[TIGR02063] ribonuclease R 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.526461 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAAG ATCCTTTCCA GGAACGCGAA GCTGAAAAAT ACGCGAATCC CATCCCTAGT 
CGGGAATTTA TCCTCGAACA TTTAACCAAA CGTGAAAAAC CGGCCAGCCG TGATGAGCTG
GCGGTAGAAC TGCACATTGA AGGCGAAGAG CAGCTTGAAG GCCTGCGTCG CCGCCTGCGC
GCGATGGAGC GCGATGGTCA ACTGGTCTTC ACTCGTCGTC AGTGCTATGC GCTGCCGGAA
CGCCTCGACC TGGTGAAAGG TACCGTTATT GGCCACCGTG ATGGCTACGG CTTTCTGCGG
GTTGAAGGGC GTAAAGATGA TTTGTATCTC TCCAGCGAGC AGATGAAAAC CTGCATTCAT
GGCGATCAGG TGCTGGCGCA GCCGCTGGGC GCTGACCGTA AAGGTCGTCG TGAAGCGCGT
ATTGTCCGCG TACTGGTGCC AAAAACCAGC CAGATTGTTG GTCGCTACTT TACCGAAGCG
GGCGTCGGCT TTGTGGTTCC TGACGACAGT CGTTTGAGCT TCGATATCTT AATCCCGCCC
GATCAGATCA TGGGCGCGCG GATGGGCTTT GTGGTGGTGG TTGAACTGAC CCAACGCCCG
ACTCGCCGTA CCAAAGCGGT GGGTAAAATC GTCGAAGTAC TTGGCGACAA TATGGGCACC
GGCATGGCGG TTGATATCGC TCTGCGTACC CATGAAATTC CGTATATCTG GCCGCAGGCT
GTTGAGCAAC AAGTTGCCGG GCTGAAAGAA GAAGTGCCGG AAGAAGCAAA AGCGGGCCGT
GTCGATTTGC GCGATTTACC GCTGGTCACC ATTGATGGCG AAGACGCCCG TGACTTTGAC
GATGCAGTTT ACTGTGAGAA AAAACGCGGC GGCGGCTGGC GTTTATGGGT CGCGATTGCC
GACGTCAGCT ACTATGTGCG TCCGCCAACA CCGCTGGACA GAGAAGCGCG TAACCGTGGT
ACGTCGGTGT ACTTCCCTTC GCAGGTTATC CCGATGCTGC CGGAAGTGCT CTCTAACGGC
CTGTGCTCGC TCAACCCGCA GGTAGACCGC CTGTGTATGG TGTGCGAGAT GACGGTTTCG
TCGAAAGGCC GCCTGACTGG CTACAAATTC TATGAAGCGG TAATGAGCTC TCACGCACGT
CTGACCTACA CCAAAGTCTG GCATATTCTG CAGGGCGATC AGGATCTGCG CGAGCAGTAC
GCCCCGCTGG TTAAGCATCT CGAAGAGTTG CATAACCTCT ATAAAGTGCT GGATAAAGCC
CGTGAAGAAC GCGGTGGGAT CTCATTTGAG AGCGAAGAAG CGAAGTTCAT TTTCAACGCT
GAACGCCGTA TTGAACGTAT CGAACAGACT CAGCGTAACG ACGCCCACAA GTTAATTGAA
GAGTGCATGA TTCTGGCGAA TATCTCGGCG GCGCGTTTCG TTGAGAAAGC CAAAGAACCG
GCACTGTTCC GTATTCACGA CAAGCCGAGC ACCGAAGCGA TTACCTCTTT CCGTTCAGTG
CTGGCGGAGC TGGGGCTGGA ATTGCCGGGC GGTAACAAGC CGGAACCGCG CGACTATGCA
GAGTTACTGG AATCGGTTGC CGATCGTCCT GATGCAGAAA TGCTGCAAAC TATGCTGCTG
CGCTCGATGA AACAGGCGAT TTACGATCCG GAAAACCGTG GTCACTTCGG TCTGGCATTG
CAGTCCTATG CGCACTTTAC TTCGCCGATT CGTCGTTATC CTGACCTGAC GCTGCACCGC
GCCATTAAAT ATCTGCTGGC GAAAGAGCAG GGGCATCAGG GCAACACCAC TGAAACCGGC
GGCTACCATT ATTCGATGGA AGAGATGCTG CAACTGGGTC AGCACTGTTC GATGGCGGAA
CGTCGTGCCG ACGAAGCAAC ACGCGATGTA GCTGACTGGC TGAAGTGTGA CTTCATGCTC
GACCAGGTAG GTAACGTCTT TAAAGGCGTA ATTTCCAGCG TCACTGGCTT TGGCTTCTTC
GTCCGTCTGG ACGACTTGTT CATTGATGGT CTGGTCCATG TCTCTTCGCT GGACAATGAC
TACTATCGCT TTGACCAGGT AGGGCAACGC CTGATGGGTG AATCCAGCGG CCAGACTTAT
CGCCTGGGCG ATCGCGTGGA AGTTCGCGTC GAAGCGGTTA ATATGGACGA GCGTAAAATC
GACTTTAGCC TGATCTCCAG CGAACGCGCA CCGCGCAACG TCGGTAAAAC GGCGCGCGAG
AAAGCGAAAA AAGGCGATGC AGGTAAAAAA GGCGGCAAGC GTCGTCAGGT CGGTAAAAAG
GTAAACTTTG AGCCAGACAG CGCTTTCCGC GGCGAGAAAA AAACGAAGCC GAAAGCGGCG
AAGAAAGACG CGAGAAAAGC GAAAAAGCCA TCGGCGAAAA CGCAGAAAAT AGCCGCAGCG
ACCAAAGCGA AGCGTGCGGC GAAGAAAAAA GTGGCAGAGT GA
 
Protein sequence
MSQDPFQERE AEKYANPIPS REFILEHLTK REKPASRDEL AVELHIEGEE QLEGLRRRLR 
AMERDGQLVF TRRQCYALPE RLDLVKGTVI GHRDGYGFLR VEGRKDDLYL SSEQMKTCIH
GDQVLAQPLG ADRKGRREAR IVRVLVPKTS QIVGRYFTEA GVGFVVPDDS RLSFDILIPP
DQIMGARMGF VVVVELTQRP TRRTKAVGKI VEVLGDNMGT GMAVDIALRT HEIPYIWPQA
VEQQVAGLKE EVPEEAKAGR VDLRDLPLVT IDGEDARDFD DAVYCEKKRG GGWRLWVAIA
DVSYYVRPPT PLDREARNRG TSVYFPSQVI PMLPEVLSNG LCSLNPQVDR LCMVCEMTVS
SKGRLTGYKF YEAVMSSHAR LTYTKVWHIL QGDQDLREQY APLVKHLEEL HNLYKVLDKA
REERGGISFE SEEAKFIFNA ERRIERIEQT QRNDAHKLIE ECMILANISA ARFVEKAKEP
ALFRIHDKPS TEAITSFRSV LAELGLELPG GNKPEPRDYA ELLESVADRP DAEMLQTMLL
RSMKQAIYDP ENRGHFGLAL QSYAHFTSPI RRYPDLTLHR AIKYLLAKEQ GHQGNTTETG
GYHYSMEEML QLGQHCSMAE RRADEATRDV ADWLKCDFML DQVGNVFKGV ISSVTGFGFF
VRLDDLFIDG LVHVSSLDND YYRFDQVGQR LMGESSGQTY RLGDRVEVRV EAVNMDERKI
DFSLISSERA PRNVGKTARE KAKKGDAGKK GGKRRQVGKK VNFEPDSAFR GEKKTKPKAA
KKDARKAKKP SAKTQKIAAA TKAKRAAKKK VAE