Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0672 |
Symbol | rihA |
ID | 6143229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 684133 |
End bp | 685068 |
Gene Length | 936 bp |
Protein Length | 311 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641615562 |
Product | ribonucleoside hydrolase 1 |
Protein accession | YP_001742768 |
Protein GI | 170684012 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1957] Inosine-uridine nucleoside N-ribohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 66 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACTGC CAATTCTGTT AGATTGCGAC CCAGGTCATG ACGACGCTAT CGCAATCGTT CTCGCCCTCG CCTCACCAGA GCTTGATGTC AAAGCAATTA CGTCTTCCGC CGGAAACCAG ACACCAGAAA AAACCTTACG CAATGTTCTG TGTATGCTGA CCTTGCTTAA TCGCACCGAT ATTCCGGTAG CAAGCGGTGC GGTAAAACCA TTAATGCGTG ATTTGATTAT CGCGGACAAT GTGCACGGCG AAAGCGGTCT CGACGGCCCG GCATTACCGG AACCGACGTT CGCACCGCAA AACTGTACGG CGGTGGAGCT GATGGCGAAA ACGCTGTGTG AAAGTGAGGA ACCTGTCACC ATTGTTTCTA CCGGACCGCA AACTAACGTT GCCTTGCTGC TCAATAGCCA CCCGGAACTG CATAGCAAAA TTGCCCGTAT CGTGATTATG GGCGGCGCAA TGGGGCTTGG TAACTGGACG CCTGCGGCTG AATTTAATAT TTACGTTGAC CCGGAAGCGG CAGAAATTGT CTTCCAGTCA GGGATCCCGG TGGTGATGGC CGGTCTGGAT GTTACTCACA AAGCACAAAT TCACGTTGAG GACACCGAGC GTTTCCGCGC CATTGGCAAC CCTGTTTCAA CCATTGTTGC CGAACTGCTG GATTTCTTCC TCGAATATCA TAAAGACGAA AAATGGGGCT TTGTCGGCGC ACCGCTGCAT GACCCATGCA CCATCGCCTG GCTGTTAAAA CCGGAGTTGT TTACCACTGT TGAGCGCTGG GTTGGCGTGG AAACACAAGG GAAATATACC CAAGGTATGA CGGTTGTTGA TTATTATTAT CTGACTGGCA ATAAACCGAA TGCCACTGTA ATGGTCGATG TTGATCGTCA GGGCTTTGTT GATTTACTGG CCGATCGTCT GAAATTTTAC GCTTAA
|
Protein sequence | MALPILLDCD PGHDDAIAIV LALASPELDV KAITSSAGNQ TPEKTLRNVL CMLTLLNRTD IPVASGAVKP LMRDLIIADN VHGESGLDGP ALPEPTFAPQ NCTAVELMAK TLCESEEPVT IVSTGPQTNV ALLLNSHPEL HSKIARIVIM GGAMGLGNWT PAAEFNIYVD PEAAEIVFQS GIPVVMAGLD VTHKAQIHVE DTERFRAIGN PVSTIVAELL DFFLEYHKDE KWGFVGAPLH DPCTIAWLLK PELFTTVERW VGVETQGKYT QGMTVVDYYY LTGNKPNATV MVDVDRQGFV DLLADRLKFY A
|
| |