Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_0934 |
Symbol | |
ID | 4077562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | - |
Start bp | 997061 |
End bp | 998002 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638006237 |
Product | inosine/uridine-preferring nucleoside hydrolase |
Protein accession | YP_612929 |
Protein GI | 99080775 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1957] Inosine-uridine nucleoside N-ribohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0837501 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0202173 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCCC GCAAGATCAT CATTGATACC GACCCCGGAC AGGACGATGC GGTTGCCATC CTTCTGGCCC TTGCCTCGCC CGAGAACATC GACCTTCTGG GCATCACGGC CGTTGCCGGA AACGTGCCGC TGGAGCTGAC CCAGAAAAAC GCCCGCATCG TCTGCGAGGT CGCCAAACGT CAGGATGTCA AAGTCTTTGC AGGCCGCGCG GCCCCACTCG AACGCCCTCT GGTGACCGCA GAGCATGTCC ACGGGGGCAC TGGCCTCGAC GGCCCGGAGC TGTTTGATCC CGAGATGCCG CTGCAGGATC AAAACGGGGT CGATTTCATT ATCGAGACCC TGCGCCGCGA ACCGGCTGGC ACGGTCACCC TCTGCCCACT TGGCCCGCTT ACAAATATCG CGGCCGCCTT TCGGGCTGCG CCGGATGTGG TGGAGCGTGT GCAGGAAATC GTCCTTATGG GCGGCGCCTA TTTTGAGGTG GGCAACATCA CCCCCGCCGC CGAGTTCAAT ATCTACGTCG ACCCCGAAGC TGCGCATGAG GTGCTCTCTT CCGGGATCAA GATCACGATG ATGCCGCTTG ATGTGACCCA CAAGGCGTTG ACCACTCGCG CCCGCGTCGA AGCCTTTCGC GCGCTGCAAA GCCCCGTGGG CGCGTTCACC GCCGATATGC TCGATTTCTT TGAACGCTTC GACGTGGAGA AATACGGCTC CGAAGGCGGC CCGCTGCACG ACCCTTGCGT GATTGCCTAC CTCATCGACC CTGCTCTCTT CAGTGGTCGC CATATCAACG TCGAGATCGA GACCGCTTCC GCCCTCACCC TTGGCATGAC TGTGGCCGAT TGGTGGGGCG TGACCGATCG ACCGGCCAAT GCTCTTTTCA TCGGCGACAT AGATGCCGAT GGGTTCTTTT CGCTCCTCAC CAACAGGCTG GCCCGCCTAT GA
|
Protein sequence | MPARKIIIDT DPGQDDAVAI LLALASPENI DLLGITAVAG NVPLELTQKN ARIVCEVAKR QDVKVFAGRA APLERPLVTA EHVHGGTGLD GPELFDPEMP LQDQNGVDFI IETLRREPAG TVTLCPLGPL TNIAAAFRAA PDVVERVQEI VLMGGAYFEV GNITPAAEFN IYVDPEAAHE VLSSGIKITM MPLDVTHKAL TTRARVEAFR ALQSPVGAFT ADMLDFFERF DVEKYGSEGG PLHDPCVIAY LIDPALFSGR HINVEIETAS ALTLGMTVAD WWGVTDRPAN ALFIGDIDAD GFFSLLTNRL ARL
|
| |