Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1399 |
Symbol | rnb |
ID | 5592803 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1394538 |
End bp | 1396472 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640920554 |
Product | exoribonuclease II |
Protein accession | YP_001458113 |
Protein GI | 157160795 |
COG category | [K] Transcription |
COG ID | [COG4776] Exoribonuclease II |
TIGRFAM ID | [TIGR00358] VacB and RNase II family 3'-5' exoribonucleases [TIGR02062] exoribonuclease II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.0060132 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTCAGG ACAACCCGCT GCTAGCGCAG CTTAAACAGC AACTGCATTC CCAGACGCCA CGCGCTGAAG GGGTGGTAAA AGCCACAGAA AAAGGCTTTG GCTTCCTGGA AGTCGACGCG CAAAAAAGTT ATTTCATTCC GCCGCCGCAG ATGAAAAAAG TCATGCATGG CGACCGAATT ATCGCGGTGA TCCACAGTGA AAAAGAACGT GAATCCGCAG AGCCAGAAGA ACTGGTTGAA CCGTTCCTGA CTCGTTTCGT GGGTAAGGTT CAGGGCAAAA ATGACCGTCT GGCCATCGTT CCTGATCATC CACTCTTAAA AGACGCCATT CCTTGCCGCG CAGCCCGTGG CCTGAACCAC GAGTTTAAAG AAGGCGACTG GGCGGTTGCC GAAATGCGCC GTCATCCGCT GAAAGGCGAT CGTTCTTTCT ATGCAGAACT GACACAATAC ATCACTTTTG GTGACGATCA CTTTGTACCG TGGTGGGTTA CCCTTGCACG CCATAATCTG GAAAAAGAAG CACCAGACGG CGTCGCTACC GAAATGCTCG ATGAAGGTCT GGTTCGTGAA GATCTGACCG CGCTGGATTT TGTCACCATC GACAGTGCCA GCACAGAAGA TATGGATGAC GCCCTTTTCG CTAAGGCGTT GCCGGATGAC AAACTTCAGC TGATTGTGGC GATTGCCGAT CCAACCGCGT GGATTGCTGA AGGCAGTAAG CTGGACAAAG CCGCGAAAAT TCGCGCATTC ACCAACTATC TGCCTGGCTT CAACATCCCT ATGCTGCCTC GCGAGCTTTC TGACGATCTC TGCTCACTGC GCGCCAATGA AGTCCGCCCG GTACTGGCAT GCCGCATGAC GCTCTCCGCT GATGGCACCA TTGAAGATAA TATCGAATTC TTTGCCGCCA CCATCGAATC CAAAGCGAAG CTGGTGTATG ACCAGGTTTC TGACTGGCTG GAAAATACCG GTGACTGGAA GCCTGAAAGT GAAGCAATTG CCGAACAAGT CCGTTTGTTA GCGCAAATTT GCCAACGCCG CGGCGAGTGG CGTCATAACC ACGCACTGGT GTTTAAAGAT CGCCCGGATT ACCGCTTTAT TCTCGGTGAA AAAGGTGAAG TGCTGGATAT CGTCGCCGAG CCTCGTCGCA TTGCCAACCG TATCGTCGAA GAAGCGATGA TTGCCGCTAA CATTTGTGCG GCCCGCGTAC TGCGCGATAA GCTCGGTTTT GGCATCTATA ACGTGCATAT GGGCTTTGAT CCGGCGAATG CCGACGCGCT GGCAGCGTTG CTGAAAACGC ACGGTCTGCA TGTCGATGCC GAAGAAGTGC TCACGCTGGA CGGTTTCTGC AAACTGCGTC GTGAACTGGA TGCGCAACCA ACTGGTTTCC TCGACAGCCG CATTCGTCGC TTCCAGTCAT TTGCTGAAAT TAGCACTGAA CCCGGTCCTC ACTTTGGCCT CGGTCTGGAA GCATACGCCA CCTGGACATC GCCGATCCGT AAATATGGCG ACATGATCAA CCACCGTCTG CTGAAAGCGG TTATCAAAGG CGAAACTGCG ACGCGTCCAC AGGATGAGAT CACTGTCCAA ATGGCCGAGC GTCGCCGTCT CAATCGGATG GCAGAACGTG ATGTTGGTGA CTGGTTATAC GCACGCTTCC TGAAAGACAA AGCCGGGACC GACACCCGTT TCGCGGCGGA AATTGTCGAT ATCAGCCGTG GCGGCATGCG TGTTCGTTTG GTTGATAACG GCGCTATCGC CTTTATTCCT GCACCTTTCT TACACGCTGT GCGCGATGAA ATGGTTTGCA GCCAGGAAAA CGGCACCGTA CAAATTAAAG GTGAAACGGT TTATAAAGTA ACTGACGTTA TTGACGTCAC CATTGCCGAA GTCCGCATGG AAACCCGCAG CATTATTGCG CGCCCGGTCG CGTAA
|
Protein sequence | MFQDNPLLAQ LKQQLHSQTP RAEGVVKATE KGFGFLEVDA QKSYFIPPPQ MKKVMHGDRI IAVIHSEKER ESAEPEELVE PFLTRFVGKV QGKNDRLAIV PDHPLLKDAI PCRAARGLNH EFKEGDWAVA EMRRHPLKGD RSFYAELTQY ITFGDDHFVP WWVTLARHNL EKEAPDGVAT EMLDEGLVRE DLTALDFVTI DSASTEDMDD ALFAKALPDD KLQLIVAIAD PTAWIAEGSK LDKAAKIRAF TNYLPGFNIP MLPRELSDDL CSLRANEVRP VLACRMTLSA DGTIEDNIEF FAATIESKAK LVYDQVSDWL ENTGDWKPES EAIAEQVRLL AQICQRRGEW RHNHALVFKD RPDYRFILGE KGEVLDIVAE PRRIANRIVE EAMIAANICA ARVLRDKLGF GIYNVHMGFD PANADALAAL LKTHGLHVDA EEVLTLDGFC KLRRELDAQP TGFLDSRIRR FQSFAEISTE PGPHFGLGLE AYATWTSPIR KYGDMINHRL LKAVIKGETA TRPQDEITVQ MAERRRLNRM AERDVGDWLY ARFLKDKAGT DTRFAAEIVD ISRGGMRVRL VDNGAIAFIP APFLHAVRDE MVCSQENGTV QIKGETVYKV TDVIDVTIAE VRMETRSIIA RPVA
|
| |