Gene EcolC_3834 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3834 
Symbol 
ID6064737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4187981 
End bp4190422 
Gene Length2442 bp 
Protein Length813 aa 
Translation table11 
GC content55% 
IMG OID641603246 
Productexoribonuclease R 
Protein accessionYP_001726765 
Protein GI170021811 
COG category[K] Transcription 
COG ID[COG0557] Exoribonuclease R 
TIGRFAM ID[TIGR00358] VacB and RNase II family 3'-5' exoribonucleases
[TIGR02063] ribonuclease R 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000264601 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCACAAG ATCCTTTCCA GGAACGCGAA GCTGAAAAAT ACGCGAATCC CATCCCTAGT 
CGGGAATTTA TCCTCGAACA TTTAACCAAA CGTGAAAAAC CGGCCAGCCG TGATGAGCTG
GCGGTAGAAC TGCACATTGA AGGCGAAGAG CAGCTTGAAG GCCTGCGTCG CCGCCTGCGC
GCGATGGAGC GCGATGGTCA ACTGGTCTTC ACTCGTCGTC AGTGCTATGC GCTGCCGGAA
CGCCTCGACC TGGTGAAAGG TACCGTTATT GGCCACCGTG ATGGCTACGG CTTTCTGCGG
GTTGAAGGGC GTAAAGATGA TTTGTATCTC TCCAGCGAGC AGATGAAAAC CTGCATTCAT
GGCGATCAGG TGCTGGCGCA GCCGCTGGGC GCTGACCGTA AAGGTCGTCG TGAAGCGCGT
ATTGTCCGCG TACTGGTGCC AAAAACCAGC CAGATTGTTG GTCGCTACTT TACTGAAGCG
GGCGTCGGCT TTGTGGTTCC TGACGATAGC CGTCTGAGCT TCGATATCTT AATCCCGCCC
GATCAGATCA TGGGCGCGCG GATGGGCTTT GTGGTCGTAG TCGAACTGAC TCAGCGTCCG
ACTCGCCGCA CCAAAGCGGT GGGTAAAATC GTCGAAGTGC TGGGCGACAA TATGGGCACC
GGCATGGCGG TTGATATCGC TCTGCGTACC CATGAAATTC CGTACATCTG GCCGCAGGCT
GTTGAGCAAC AGGTTGCCGG GCTGAAAGAA GAAGTGCCGG AAGAAGCAAA AGCGGGCCGT
GTTGATCTGC GCGATTTACC GCTGGTCACC ATTGATGGCG AAGACGCCCG TGACTTTGAC
GATGCAGTTT ACTGCGAGAA AAAACGCGGC GGCGGCTGGC GTTTATGGGT CGCGATTGCC
GACGTCAGCT ACTATGTGCG TCCGCCAACG CCGCTGGACA GAGAAGCGCG TAACCGTGGC
ACGTCGGTGT ACTTCCCTTC GCAGGTTATC CCGATGCTGC CGGAAGTGCT CTCTAACGGC
CTGTGTTCGC TCAACCCGCA GGTAGACCGC CTGTGTATGG TGTGCGAGAT GACGGTTTCG
TCGAAAGGCC GCCTGACGGG CTACAAATTC TACGAAGCGG TGATGAGCTC TCACGCGCGT
CTGACCTACA CCAAAGTCTG GCATATTCTG CAGGGCGATC AGGATCTGCG CGAGCAGTAC
GCCCCGCTGG TTAAGCATCT CGAAGAGTTG CATAACCTCT ATAAAGTGCT GGATAAAGCC
CGTGAAGAAC GCGGTGGGAT CTCATTTGAG AGCGAAGAAG CGAAGTTCAT TTTCAACGCT
GAACGCCGTA TTGAACGTAT CGAACAGACC CAGCGTAACG ACGCGCACAA ATTAATTGAA
GAGTGCATGA TTCTGGCGAA TATCTCGGCG GCGCGTTTCG TTGAGAAAGC CAAAGAACCG
GCACTGTTCC GTATTCACGA CAAGCCGAGC ACCGAAGCGA TTACCTCTTT CCGTTCAGTG
CTGGCGGAGC TGGGGCTGGA GCTGCCGGGT GGTAACAAGC CGGAACCGCG TGACTACGCG
GAACTGCTGG AGTCGGTTGC CGACCGTCCT GATGCAGAAA TGCTGCAAAC CATGCTGCTA
CGCTCGATGA AACAGGCGAT TTACGATCCA GAAAACCGTG GTCACTTCGG TCTGGCATTG
CAGTCCTATG CGCACTTTAC TTCACCGATT CGTCGTTATC CTGACCTGAC GCTGCACCGC
GCCATTAAAT ATCTGCTGGC GAAAGAGCAG GGGCATCAGG GCAACACCAC TGAAACCGGC
GGCTACCATT ATTCGATGGA AGAGATGTTG CAACTGGGTC AGCACTGTTC GATGGCGGAA
CGTCGTGCCG ACGAAGCAAC GCGCGATGTC GCTGACTGGC TGAAGTGTGA CTTCATGCTC
GACCAGGTAG GTAACGTCTT TAAAGGCGTA ATTTCCAGCG TCACTGGCTT TGGCTTCTTC
GTCCGTCTGG ACGACTTGTT CATTGATGGT CTGGTCCATG TCTCTTCGCT GGACAATGAC
TACTATCGCT TTGACCAGGT AGGGCAACGC CTGATGGGGG AATCCAGCGG CCAGACTTAT
CGCCTGGGCG ATCGCGTGGA AGTTCGCGTC GAAGCGGTTA ATATGGACGA GCGCAAAATC
GACTTTAGTC TGATCTCCAG TGAACGCGCA CCGCGCAACG TCGGTAAAAC GGCGCGCGAG
AAAGCGAAAA AAGGCGATGC AGGCAAAAAA GGCGGCAAGC GTCGTCAGGT CGGTAAAAAG
GTAAACTTTG AGCCAGACAG CGCCTTCCGC GGTGAGAAAA AAACGAAGCC GAAAGCGGCG
AAGAAAGACG CGAGAAAAGC GAAAAAGCCA TCGGCGAAAA CGCAGAAAAT AGCCGCAGCG
ACCAAAGCGA AGCGTGCGGC GAAGAAAAAA GTGGCAGAGT GA
 
Protein sequence
MSQDPFQERE AEKYANPIPS REFILEHLTK REKPASRDEL AVELHIEGEE QLEGLRRRLR 
AMERDGQLVF TRRQCYALPE RLDLVKGTVI GHRDGYGFLR VEGRKDDLYL SSEQMKTCIH
GDQVLAQPLG ADRKGRREAR IVRVLVPKTS QIVGRYFTEA GVGFVVPDDS RLSFDILIPP
DQIMGARMGF VVVVELTQRP TRRTKAVGKI VEVLGDNMGT GMAVDIALRT HEIPYIWPQA
VEQQVAGLKE EVPEEAKAGR VDLRDLPLVT IDGEDARDFD DAVYCEKKRG GGWRLWVAIA
DVSYYVRPPT PLDREARNRG TSVYFPSQVI PMLPEVLSNG LCSLNPQVDR LCMVCEMTVS
SKGRLTGYKF YEAVMSSHAR LTYTKVWHIL QGDQDLREQY APLVKHLEEL HNLYKVLDKA
REERGGISFE SEEAKFIFNA ERRIERIEQT QRNDAHKLIE ECMILANISA ARFVEKAKEP
ALFRIHDKPS TEAITSFRSV LAELGLELPG GNKPEPRDYA ELLESVADRP DAEMLQTMLL
RSMKQAIYDP ENRGHFGLAL QSYAHFTSPI RRYPDLTLHR AIKYLLAKEQ GHQGNTTETG
GYHYSMEEML QLGQHCSMAE RRADEATRDV ADWLKCDFML DQVGNVFKGV ISSVTGFGFF
VRLDDLFIDG LVHVSSLDND YYRFDQVGQR LMGESSGQTY RLGDRVEVRV EAVNMDERKI
DFSLISSERA PRNVGKTARE KAKKGDAGKK GGKRRQVGKK VNFEPDSAFR GEKKTKPKAA
KKDARKAKKP SAKTQKIAAA TKAKRAAKKK VAE