Gene ECH74115_1923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1923 
Symbolrnb 
ID6967607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1814692 
End bp1816626 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content53% 
IMG OID643385854 
Productexoribonuclease II 
Protein accessionYP_002270343 
Protein GI209396001 
COG category[K] Transcription 
COG ID[COG4776] Exoribonuclease II 
TIGRFAM ID[TIGR00358] VacB and RNase II family 3'-5' exoribonucleases
[TIGR02062] exoribonuclease II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0111084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000000000153264 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTTCAGG ACAACCCGCT GCTAGCGCAG CTTAAACAGC AACTGCATTC CCAGACGCCA 
CGCGCTGAAG GGGTGGTAAA AGCCACAGAA AAAGGTTTTG GCTTCCTGGA AGTCGACGCG
CAAAAAAGTT ATTTCATTCC GCCGCCGCAG ATGAAAAAAG TCATGCATGG CGACCGAATT
ATCGCGGTGA TCCACAGTGA AAAAGAACGT GAATCCGCAG AGCCAGAAGA ACTGGTTGAA
CCGTTTCTGA CTCGTTTCGT GGGTAAAGTT CAGGGCAAAA ATGACCGTCT GGCCATCGTT
CCTGATCATC CACTCTTAAA AGACGCCATT CCTTGCCGCG CAGCCCGTGG TCTGAACCAC
GAGTTTAAAG AAGGCGACTG GGCAGTTGCC GAAATGCGCC GTCATCCGCT GAAAGGCGAT
CGTTCTTTCT ATGCTGAACT GACACAATAC ATCACTTTTG GTGACGACCA CTTTGTACCG
TGGTGGGTTA CCCTTGCGCG CCATAATCTG GAAAAAGAAG CACCAGACGG CGTCGCTACC
GAAATGCTCG ATGAAGGTCT GGTTCGTAAA GATCTGACCG CGCTGGATTT TGTCACCATC
GACAGTGCCA GCACAGAAGA TATGGATGAC GCCCTTTTCG CTAAGGCGTT GCCGGATGAC
AAACTTCAGC TGATTGTGGC GATTGCCGAT CCAACCGCGT GGATTGCTGA AGGCAGCAAG
CTGGACAAAG CCGCGAAAAT TCGCGCATTC ACCAACTATC TGCCTGGCTT CAACATCCCT
ATGCTGCCTC GCGAGCTTTC TGACGATCTC TGCTCACTGC GCGCCAATGA AGTCCGCCCG
GTACTGGCAT GCCGCATGAC GCTCTCCGCT GATGGCACCA TTGAAGATAA TATCGAATTC
TTTGCCGCCA CCATCGAATC CAAAGCGAAG CTGGTGTATG ACCAGGTTTC TGACTGGCTG
GAGAATACCG GTGACTGGCA GCCTGAAAGT GAAGCAATTG CCGAACAAGT CCGTTTGCTA
GCGCAAATTT GCCAACGCCG CGGCGAGTGG CGTCATAACC ACGCACTGGT GTTTAAAGAT
CGCCCGGATT ACCGCTTTAT TCTCGGTGAA AAAGGTGAAG TGCTGGATAT CGTCGCCGAG
CCTCGTCGCA TTGCCAACCG TATCGTCGAA GAAGCGATGA TTGCCGCTAA CATTTGTGCA
GCTCGCGTAC TGCGCGATAA GCTCGGTTTT GGTATCTATA ACGTGCATAT GGGCTTTGAT
CCGGCGAATG CCGACGCGCT GGCAGCGTTG CTGAAAACGC ACGGTCTGCA TGTCGATGCC
GAAGAAGTGC TCACGCTGGA CGGTTTCTGC AAACTGCGTC GTGAACTGGA TGCGCAACCA
ACTGGTTTCC TCGACAGCCG CATTCGTCGC TTCCAGTCAT TTGCTGAAAT TAGCACTGAA
CCCGGTCCTC ACTTTGGCCT CGGTCTGGAA GCATACGCCA CCTGGACTTC GCCGATCCGT
AAATATGGCG ACATGATCAA CCACCGTCTG CTGAAAGCGG TTATCAAAGG CGAAACTGCG
ACGCGTCCAC AGGATGAAAT CACTGTCCAA ATGGCCGAGC GTCGCCGTCT CAATCGGATG
GCAGAACGTG ATGTTGGTGA CTGGTTATAC GCACGCTTCC TGAAAGACAA AGCCGGGACC
GACACCCGTT TCGCAGCGGA AATTGTCGAT ATCAGCCGTG GCGGCATGCG TGTTCGTTTG
GTTGATAACG GCGCTATCGC CTTTATTCCT GCACCTTTCT TACACGCTGT GCGCGATGAA
CTGGTTTGCA GCCAGGAAAA CGGCACCGTA CAAATTAAAG GTGAAACGGT TTATAAAGTA
ACTGACGTTA TTGACGTCAC CATTGCCGAA GTCCGCATGG AAACCCGCAG CATTATTGCG
CGCCCGGTCG CGTAA
 
Protein sequence
MFQDNPLLAQ LKQQLHSQTP RAEGVVKATE KGFGFLEVDA QKSYFIPPPQ MKKVMHGDRI 
IAVIHSEKER ESAEPEELVE PFLTRFVGKV QGKNDRLAIV PDHPLLKDAI PCRAARGLNH
EFKEGDWAVA EMRRHPLKGD RSFYAELTQY ITFGDDHFVP WWVTLARHNL EKEAPDGVAT
EMLDEGLVRK DLTALDFVTI DSASTEDMDD ALFAKALPDD KLQLIVAIAD PTAWIAEGSK
LDKAAKIRAF TNYLPGFNIP MLPRELSDDL CSLRANEVRP VLACRMTLSA DGTIEDNIEF
FAATIESKAK LVYDQVSDWL ENTGDWQPES EAIAEQVRLL AQICQRRGEW RHNHALVFKD
RPDYRFILGE KGEVLDIVAE PRRIANRIVE EAMIAANICA ARVLRDKLGF GIYNVHMGFD
PANADALAAL LKTHGLHVDA EEVLTLDGFC KLRRELDAQP TGFLDSRIRR FQSFAEISTE
PGPHFGLGLE AYATWTSPIR KYGDMINHRL LKAVIKGETA TRPQDEITVQ MAERRRLNRM
AERDVGDWLY ARFLKDKAGT DTRFAAEIVD ISRGGMRVRL VDNGAIAFIP APFLHAVRDE
LVCSQENGTV QIKGETVYKV TDVIDVTIAE VRMETRSIIA RPVA