Gene PICST_60105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_60105 
SymbolURH1 
ID4839250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1055525 
End bp1056571 
Gene Length1047 bp 
Protein Length348 aa 
Translation table12 
GC content47% 
IMG OID640390565 
Producturidine nucleosidase (uridine ribohydrolase) 
Protein accessionXP_001384876 
Protein GI150865596 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1957] Inosine-uridine nucleoside N-ribohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.102312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCG GCGAGAAGAT TCCCATCTGG CTCGACTGCG ATCCAGGAAA CGACGATGCA 
TTTGCGATCT TATTAGCACT TTTTGACCCT CGGTTTGAAC TCTTGGGAAT ATCTACTGTC
CACGGGAACG CTCCTTTGTC GTATACAACT CACAACGCCT TGTCGTTATT GGACAGCTTG
GGGGTCGAAC CCGGAACAGT TAAGGTCTAC GCCGGTTCCG AGACTCCTCT TGTCAATGCT
CCTCAATCAG CTCCAGAAAT CCACGGCACT ACGGGTATTG GTGGGGTGGA ATTTCCAGAA
GTCACGAAAA ACAAAGTTGC TACCGATGTC GGCTACTTGG AGGCGATGAA GCAAGCTATC
TTGTCCCACG AGAACGAGCT CTGCTTGGTA TGCACAGGCA CTTTAACCAA CGTCTCGAAA
CTCATCACGG AATGTCCTGC CATTATTCCG AAAATTCGCT ACGTATCTAT TATGGGTGGT
GCCTTCAATT TGGGAAATGT CACTCCATAT GCCGAGTTCA ACTTCTATGC TGACCCACAT
GCTGCTAAGC ATGTGCTTGC TGAGCTTGGC CCTAAAATCA TCTTGTCGCC GCTCAATATC
ACCCATAAGG CTACAGCTAC AGAATCAATT CGCAACCAAA TGTACGACAG TGAAGACCCA
CATCGCAACT CTGACATCCG CAATATGTTC TACAGTATCC TCATGTTCTT CTCCCATTCG
TATATAAAGA AATACGGCAT AACTGAAGGT CCCCCAGTCC ATGACCCTCT CGCATTGTAC
TGCCTTTTGC CATTCCTTCA GCAGGACAAA GATTACAAGT ACAAATATTT GAGACGTAAA
GTCTCTGTTA TCACGGAAGG AGAGCACTCG GGAGAAAGCA TTCTATTAAA CGGTAACTCG
GATCTGTCTG TAGAAGAAGA AGATGGCGTC TACATCGGTC AGGATATCGA CGTAGACCAG
TTTTGGCGTA CTGTCCTCAG AGCGGTGAAT GTGGCAGATG TAACCATAAA ACAGGAAATA
AATGGTGCTC AAAAAGTGAT GGTTTAA
 
Protein sequence
MTVGEKIPIW LDCDPGNDDA FAILLALFDP RFELLGISTV HGNAPLSYTT HNALSLLDSL 
GVEPGTVKVY AGSETPLVNA PQSAPEIHGT TGIGGVEFPE VTKNKVATDV GYLEAMKQAI
LSHENELCLV CTGTLTNVSK LITECPAIIP KIRYVSIMGG AFNLGNVTPY AEFNFYADPH
AAKHVLAELG PKIILSPLNI THKATATESI RNQMYDSEDP HRNSDIRNMF YSILMFFSHS
YIKKYGITEG PPVHDPLALY CLLPFLQQDK DYKYKYLRRK VSVITEGEHS GESILLNGNS
DSSVEEEDGV YIGQDIDVDQ FWRTVLRAVN VADVTIKQEI NGAQKVMV