Gene PICST_81319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81319 
SymbolURK1 
ID4837048 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1850988 
End bp1852526 
Gene Length1539 bp 
Protein Length504 aa 
Translation table12 
GC content42% 
IMG OID640388363 
Producturidine kinase 
Protein accessionXP_001382581 
Protein GI150863930 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0035] Uracil phosphoribosyltransferase
[COG0572] Uridine kinase 
TIGRFAM ID[TIGR00235] uridine kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0146953 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.484619 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TGGACCGAAA TGACTTCGTT GGAAGAGCCT AGGCCACGTC GGTTCAGCCG AATTGCGCCA 
GACAGCGACG AGTCGACATC ATTTTTCATG TCGTCAGAAA CATTACCCAC CGAGTCTGCT
ATCATGACTC CAGTAGGATC TTTGCATCAC GATCTGGATA CACCCAGAGC TTCTTATCTT
CCTCCTTGGA CAGAACCGTA TATCATTGGA GTCGCAGGGA ACTCTGGATC TGGAAAAACC
TCCATTTCGC AGAAAGTTAT CCAGGAATTG AACCAACCAT GGACGATTTT GCTTTCGTTT
GATAACTTCT ACAATCCTTT GAACGAAGAA GAAAGAAAGC AAGCCTTCAA CAACAATTTT
GATTTTGATA CCCCAGCCTC TTTGGATTTG GATTTGTTAG TGAAAACGGT GAAATCTTTG
AAAAGCGGTG AGAAAACACA AATTCCGGTG TACTCGTTCC AGCACCATAA TCGTACCAAT
AAGTCTACGA CCATCTACGG AGCCAATGTG ATCATTATTG AAGGTATTTA TGCCTTGTAT
GACCAGAGAT TGCTTGACTT GATGGACTTG AAGATTTACG TCGACACGGA CTTGGATATC
TGTTTGTCTC GAAGATTGAC CAGAGACATC TTGTATCGTG GTCGTGACTT GGCAGGTGCC
ATCAAACAAT GGGAGACGTT TGTCAAACCT AACGCCGTCA AACACGTCAA CCCGACTATG
AACAACGCCG ACTTAGTGAT TCCACGAGGC TTGGACAATC TGATTGCCAT CAACTTGATG
ATAAAACATA TTCAGATCCA ACTAGCACTT AAAAGTTCAG CGCATTTGAA GTACTTGAAG
GAGTTGGGTG TTAATATCAA CTTCGATGTG TCCAAATACA ACATTAAGGT TTTACCGGCA
AATAACCAGA CGAAAGGAAT CAACTCTTTA CTCTTTGACG TCAATACTGA GAGGTCAGAT
TTCATCTTTT ATTTTAACCG TATCAGTGCA CTTATTATAG AGTTAGCATT AGAGTTGGTT
ACAGACTATG AGCCTGTGCG CATTAACGAC AACTTCAACG GCTTGAGAAT GGTTAACGAG
ATCATGGCAG TTAATATTAT CCGTTCGGGA GATTGCTTTA TGTCTTCGAT CAAAAGGACT
TTTCCAGAAA TCAGCATCGG AAAGCTTTTG ATTCAAAGTG ACTCTAGAAC TGGTGAACCA
CAATTGCATT TTGACTCCTT GTCAAAGGAA ATGAGCGGAG GAAAGAAGAT CTTGTTGTTT
GACTCCCAGA TCATTAGTGG AGCTGCGTCC ATCATGGCTA TCCAGGTATT AATTGACCAC
AAGGTGAACG AAGAGGATAT CATCTTATGT TCGTATCTTT CCACAGAGAT AGGATTGCGT
CGTATCGTTA ACGTTTTCCC CAAGGTCAAC ATTGCAGTTG GTAAATTGTC GTCTATCGAC
GGTAGTGAAA AGAAATGGTA CAATGAGGAA ATGTTTAAGG ATAGCGACTG GCATTTTAGA
AATAGATTTA TAGACAGTTT GTACTTTGGC ACGGACTGA
 
Protein sequence
MTSLEEPRPR RFSRIAPDSD ESTSFFMSSE TLPTESAIMT PVGSLHHDSD TPRASYLPPW 
TEPYIIGVAG NSGSGKTSIS QKVIQELNQP WTILLSFDNF YNPLNEEERK QAFNNNFDFD
TPASLDLDLL VKTVKSLKSG EKTQIPVYSF QHHNRTNKST TIYGANVIII EGIYALYDQR
LLDLMDLKIY VDTDLDICLS RRLTRDILYR GRDLAGAIKQ WETFVKPNAV KHVNPTMNNA
DLVIPRGLDN SIAINLMIKH IQIQLALKSS AHLKYLKELG VNINFDVSKY NIKVLPANNQ
TKGINSLLFD VNTERSDFIF YFNRISALII ELALELVTDY EPVRINDNFN GLRMVNEIMA
VNIIRSGDCF MSSIKRTFPE ISIGKLLIQS DSRTGEPQLH FDSLSKEMSG GKKILLFDSQ
IISGAASIMA IQVLIDHKVN EEDIILCSYL STEIGLRRIV NVFPKVNIAV GKLSEKKWYN
EEMFKDSDWH FRNRFIDSLY FGTD