Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_60105 |
Symbol | URH1 |
ID | 4839250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 1055525 |
End bp | 1056571 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 12 |
GC content | 47% |
IMG OID | 640390565 |
Product | uridine nucleosidase (uridine ribohydrolase) |
Protein accession | XP_001384876 |
Protein GI | 150865596 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1957] Inosine-uridine nucleoside N-ribohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.102312 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGTCG GCGAGAAGAT TCCCATCTGG CTCGACTGCG ATCCAGGAAA CGACGATGCA TTTGCGATCT TATTAGCACT TTTTGACCCT CGGTTTGAAC TCTTGGGAAT ATCTACTGTC CACGGGAACG CTCCTTTGTC GTATACAACT CACAACGCCT TGTCGTTATT GGACAGCTTG GGGGTCGAAC CCGGAACAGT TAAGGTCTAC GCCGGTTCCG AGACTCCTCT TGTCAATGCT CCTCAATCAG CTCCAGAAAT CCACGGCACT ACGGGTATTG GTGGGGTGGA ATTTCCAGAA GTCACGAAAA ACAAAGTTGC TACCGATGTC GGCTACTTGG AGGCGATGAA GCAAGCTATC TTGTCCCACG AGAACGAGCT CTGCTTGGTA TGCACAGGCA CTTTAACCAA CGTCTCGAAA CTCATCACGG AATGTCCTGC CATTATTCCG AAAATTCGCT ACGTATCTAT TATGGGTGGT GCCTTCAATT TGGGAAATGT CACTCCATAT GCCGAGTTCA ACTTCTATGC TGACCCACAT GCTGCTAAGC ATGTGCTTGC TGAGCTTGGC CCTAAAATCA TCTTGTCGCC GCTCAATATC ACCCATAAGG CTACAGCTAC AGAATCAATT CGCAACCAAA TGTACGACAG TGAAGACCCA CATCGCAACT CTGACATCCG CAATATGTTC TACAGTATCC TCATGTTCTT CTCCCATTCG TATATAAAGA AATACGGCAT AACTGAAGGT CCCCCAGTCC ATGACCCTCT CGCATTGTAC TGCCTTTTGC CATTCCTTCA GCAGGACAAA GATTACAAGT ACAAATATTT GAGACGTAAA GTCTCTGTTA TCACGGAAGG AGAGCACTCG GGAGAAAGCA TTCTATTAAA CGGTAACTCG GATCTGTCTG TAGAAGAAGA AGATGGCGTC TACATCGGTC AGGATATCGA CGTAGACCAG TTTTGGCGTA CTGTCCTCAG AGCGGTGAAT GTGGCAGATG TAACCATAAA ACAGGAAATA AATGGTGCTC AAAAAGTGAT GGTTTAA
|
Protein sequence | MTVGEKIPIW LDCDPGNDDA FAILLALFDP RFELLGISTV HGNAPLSYTT HNALSLLDSL GVEPGTVKVY AGSETPLVNA PQSAPEIHGT TGIGGVEFPE VTKNKVATDV GYLEAMKQAI LSHENELCLV CTGTLTNVSK LITECPAIIP KIRYVSIMGG AFNLGNVTPY AEFNFYADPH AAKHVLAELG PKIILSPLNI THKATATESI RNQMYDSEDP HRNSDIRNMF YSILMFFSHS YIKKYGITEG PPVHDPLALY CLLPFLQQDK DYKYKYLRRK VSVITEGEHS GESILLNGNS DSSVEEEDGV YIGQDIDVDQ FWRTVLRAVN VADVTIKQEI NGAQKVMV
|
| |