Gene PICST_59891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_59891 
SymbolCYK1 
ID4839208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1673997 
End bp1675100 
Gene Length1104 bp 
Protein Length367 aa 
Translation table12 
GC content46% 
IMG OID640390523 
Productcysteine synthase (CYSK1) 
Protein accessionXP_001385325 
Protein GI150865916 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0439327 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAGAG CCCATACCAC CAAGAGGTTT ATCTCCATTT CGCTCAACCA GCAGCTGGTC 
AAGCTTCCAT TTGTGCCTCT TGTATCGGAG AACGGTTTTC CAGGAGCTAT TGGAAATACT
CCACTTTTGA GACTTCCACG TATCTCCGAT TCTATTGGTC GTAATATCTA CGCTAAGGCA
GAGTTTATGA ATCCTGGAGG ATCAATTAAA GACAGAGCTG CCTTGTATGT GATCAAAGAT
GCCGAGGAAA AGGGTTTGAT CAAGCCTGGT GGAACGATTG TAGAAGGAAC CGCGGGTAAT
ACTGGGATTG GTTTAGCTCA CGTCTGTAGA GCCAAGGGCT ATAACTGTGT AATCTATATG
CCCAACACCC AGTCAAAAAG CAAAATTGAG ACCTTACGTT TATTGGGAGC TGAGGTTTAT
CCTGTTCCTG CCGTGGCCTT TACTGATCCT ATGAACTACA ACCATCAGGC TAAAAGACAT
GCCGAGTCAT TGGATAACGC TGTGTGGACC AACCAGTTCG ACAACACTGC CAATAGACAG
GCTCATATCG AAACTACGGG TCCGGAAATC TGGGCCCAGT TAGATGGCCA AGTCGATGCC
TTTACCTGTT CCACCGGCAC TGGGGGTACT TTTGCAGGAA CTTCAAGATA CTTGAAGTCT
ATCTCTAATG GTAGGGTCAA AGCTGTTCTT GCAGACCCTC CAGGATCGGT ATTGTACTCC
TACATTAAGA GCAACGGCCA GAACATGGAA AGAGGAGGCT CTTCATTCAC TGAGGGTATT
GGCCAAGGCA GAGTCACCGA CAACTTGAAG CCAGACTTGG ACATCATTGA CGATGCTGTG
AAGATTCCCG ATGAAGACTC TATAGTCATG GTCTATAGAT TGCTAGACGA AGAAGGATTG
TATCTTGGTG GCACTGGAGC GTTGAATGTC GTGGCTGCCA TTGAAGTAGC CAAAACTTTG
CCTGAAGGCA GCAATGTCGT CACCATCTTG GCAGATTCTG CCCACAAGTA CAGCGACCGT
ATCTTTTCCA AGACGTGGTT GAAGGAGAAG AACTTGTACA ATGTGCTTCC AGAACACTTG
AAGAAGTATG CTACCTTGGA TTAA
 
Protein sequence
MFRAHTTKRF ISISLNQQSV KLPFVPLVSE NGFPGAIGNT PLLRLPRISD SIGRNIYAKA 
EFMNPGGSIK DRAALYVIKD AEEKGLIKPG GTIVEGTAGN TGIGLAHVCR AKGYNCVIYM
PNTQSKSKIE TLRLLGAEVY PVPAVAFTDP MNYNHQAKRH AESLDNAVWT NQFDNTANRQ
AHIETTGPEI WAQLDGQVDA FTCSTGTGGT FAGTSRYLKS ISNGRVKAVL ADPPGSVLYS
YIKSNGQNME RGGSSFTEGI GQGRVTDNLK PDLDIIDDAV KIPDEDSIVM VYRLLDEEGL
YLGGTGALNV VAAIEVAKTL PEGSNVVTIL ADSAHKYSDR IFSKTWLKEK NLYNVLPEHL
KKYATLD