Gene PICST_30734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30734 
SymbolCYK2 
ID4837877 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp805268 
End bp806446 
Gene Length1179 bp 
Protein Length392 aa 
Translation table12 
GC content47% 
IMG OID640389192 
Productcysteine synthase (O-acetylserine sulfhydrylase) (O-acetylserine (Thiol)-lyase) (CSase) 
Protein accessionXP_001383788 
Protein GI150864811 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.207324 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTTCA ACTGGAAAAT TATACTTCAG TCATCAGCTT CTATAGTTTC AGCGCTTATA 
ATTCTACGTG AACTCCATCG CCAGCTCTCG CAATCGAACA ATTCTTCAAC AAAACTTACA
TCTCTCCCTC CTCGCTCAAG AGGAGTAGAG TCCCTCATAG GAAACACTCC GCTCATAGAA
ATCAAGTCTC TTTCTAAACT AACCGGCTGC AAAATCTACG CGAAGCTCGA ACTCGCTAAT
CCTGCTGGCT CCGCCAAAGA TCGTGTCGCT TTGGCTATCA TCCGGGCCAA CGAAAAACTC
GGACACCTTC GCCCACACTC CGGGGACGTT ATCTTTGAAG GTACTTCTGG TTCCACTGGC
ATCTCCTTCG CTGTACTCGC CAACGCTCTC GGATATGATG CTCACATATG CCTCCCAGAT
GACACTTCTC CTGAAAAATT GCAACTCCTC AAGTCACTCG GCGCTACCAT TCATCCAGTC
AAACCGGCTT CCATTGTAGA CCCACAGCAG TATACCAATG CAGCACGGTG TGGCTCACAG
CAGATCAACG AAGACCCTAA CGACAGGCGC AGGGCCATCT TTGCGGACCA GTTTGAGAAC
GATTTTAACT GGAGAATACA TTACGAGACA ACGGGCCCAG AGATCTGGCG CCAGATGGAA
CAAGACGTAG ATGTGTTTAT CAATGGCTCC GGAACTGGAG GTACTATAGC TGGAGTATCT
AAATACTTAC ACGAGCAGAA TAGAGAGATA AAGATCATAC TAGCAGATCC CCAGGGCTCG
GGATTGGCCA ACAGAGTCAA CTACGGAGTT ATGTACGATA CTGTAGAGAA AGAAGGAACT
AGACGTCGAC ACCAGGTAGA CACGTTAGTG GAAGGTATTG GTCTTAACAG ACTTACATGG
AACTTCAAAC AGGCCGAAGC CCATATTACA GAGGCTATAA GGGTGTCAGA CAATCAGGCA
CTTCGTATGG CAAAGTTCTT GTGTATCAAC GATGGGCTAT TCTGGGGTTC GTCTGCTGCT
ATAAACTGTG TTGCAGCCGT GAAGACAGCA TTGAAGAATG GACCAGGTCA AAAAATTGTA
GTGATCGCAT GTGATCTGGG GGCTAGACAT TTGCTGAAGT TCTGGAAACT GGCGGCTGAG
GTGCCTAATG ATATTACCTT GGATGAAGTT TTACAATAG
 
Protein sequence
MSFNWKIILQ SSASIVSALI ILRELHRQLS QSNNSSTKLT SLPPRSRGVE SLIGNTPLIE 
IKSLSKLTGC KIYAKLELAN PAGSAKDRVA LAIIRANEKL GHLRPHSGDV IFEGTSGSTG
ISFAVLANAL GYDAHICLPD DTSPEKLQLL KSLGATIHPV KPASIVDPQQ YTNAARCGSQ
QINEDPNDRR RAIFADQFEN DFNWRIHYET TGPEIWRQME QDVDVFINGS GTGGTIAGVS
KYLHEQNREI KIILADPQGS GLANRVNYGV MYDTVEKEGT RRRHQVDTLV EGIGLNRLTW
NFKQAEAHIT EAIRVSDNQA LRMAKFLCIN DGLFWGSSAA INCVAAVKTA LKNGPGQKIV
VIACDSGARH LSKFWKSAAE VPNDITLDEV LQ