Gene PICST_33152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33152 
Symbol 
ID4840216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1553687 
End bp1554784 
Gene Length1098 bp 
Protein Length365 aa 
Translation table12 
GC content43% 
IMG OID640391531 
Productpredicted protein 
Protein accessionXP_001386000 
Protein GI150866408 
COG category[A] RNA processing and modification 
COG ID[COG0430] RNA 3'-terminal phosphate cyclase 
TIGRFAM ID[TIGR03400] 18S rRNA biogenesis protein RCL1 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.58339 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTCAC CAATAGCGTT CGAGGGTCAT AGAAATTTCC GTTTGCGGTT AATTCTCTCG 
ACTCTTTCTG GAAAAGCCAT CAAGATTTCC AAGATCAGAT CACAAGATGT CAATCCGGGT
CTCAGAGACC ACGAAGTGTC CTTCTTAAGA TTGCTTGAAG CCGTTACCAA TGGTTCTCAC
ATAGAAATTT CCTACACTGG TACTACCATA ATATATCGTC CGGGAATCAT CATTGGTGGG
GAATTGGTGC ACAACTGCCC TGCCAATAAG CCTGTTGGAT ATTTTGTAGA ACCCATGCTT
TATTTGGCTC CCTTTTCTAA GAAGAAGTTT TCCATTATTT TCAAGGGCGT AACTGCTAGC
GAAAAGACCA ACGATGCCAC ACCCGAATCT ATCAAATGGG GCTTGATACC GATAATGGAA
AAGTTCGGAG TCAGAGAAGT ATCTTTACAT ATCCTCAAGA GAGGTTCCGC ACCTTTGGGA
GGAGGTGAAG TACACTTAAT GTGCAACTCG CTCATTCCCC AACCTCTCAC AATGCATGCC
GTAGATATTC CAAAGTTTTC TGCCATCAGA GGTGTTGCTT ACTGTACCAG AGTTTCTCCT
TCCATAGTGA ACAGAATCAT TGATTCTGCC AGAAAGGTCT TAAGACCCAC GGGTGTAGAA
GTCAACATCA CTGCAGATGT ATGGAGAGGA GAAAACTCCG GCAAGTCTCC AGGCTTTGGA
GTTACCTTAG TAGCTGAGCT GAAGAAGGGT TGGAGAATTA TAGCTGAAGG TGTAGGTGCT
GCCGCCATGC TTCCAGAAGA TTTAGGTGAA AAAGTAGCCT TCAATCTTTT GGAAGAGCTC
ACTCACAGTG CTGTAGTTGG CAGAAATCAG TTGATGTTAG CTTTGGTGTT CATGACTATC
AGCAAAGAAG ATATCGGAAG ATTGAAGGTT CACACTCAGC AAATCGACGA AAATTTTGTT
CACATAGTTC GTGATATTAA AGAAATCATG GGTACAGAAG TGTTGTTGAA ACCAACTGAT
GAACCTGGCG AAGAAAACAT ACTCACCATG TCCATTAAGG GTATTGGGTT CACCAGTGCC
TCCAAGAAGA TAGCTTAA
 
Protein sequence
MSSPIAFEGH RNFRLRLILS TLSGKAIKIS KIRSQDVNPG LRDHEVSFLR LLEAVTNGSH 
IEISYTGTTI IYRPGIIIGG ELVHNCPANK PVGYFVEPML YLAPFSKKKF SIIFKGVTAS
EKTNDATPES IKWGLIPIME KFGVREVSLH ILKRGSAPLG GGEVHLMCNS LIPQPLTMHA
VDIPKFSAIR GVAYCTRVSP SIVNRIIDSA RKVLRPTGVE VNITADVWRG ENSGKSPGFG
VTLVAESKKG WRIIAEGVGA AAMLPEDLGE KVAFNLLEEL THSAVVGRNQ LMLALVFMTI
SKEDIGRLKV HTQQIDENFV HIVRDIKEIM GTEVLLKPTD EPGEENILTM SIKGIGFTSA
SKKIA