Gene PICST_54239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_54239 
SymbolSPH1 
ID4837053 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1765295 
End bp1766881 
Gene Length1587 bp 
Protein Length521 aa 
Translation table12 
GC content43% 
IMG OID640388368 
ProductSphingosine kinase, involved in sphingolipid metabolism Lipid transport and metabolism 
Protein accessionXP_001383107 
Protein GI150864335 
COG category[I] Lipid transport and metabolism
[R] General function prediction only 
COG ID[COG1597] Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0415674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTCC ATAATAATCT CAGCACAGAA AGGCTCATAT ACCACCAGCA GGACTCCATA 
CGCGCGACGC TTCAAGACTT CGGGATCCAG ATTGATTCCC AGGAATTGCT AGCAATTGAG
GATGACTCTG TGGACAGTAC ATATTCATTT TGCAGCTGGA AATCTGCACC CGGTCCGGCT
TACGAGTTGG AAGATAACCG TTCAAGAATT CCATACAGAA ACATCCTCTG GGTACAGCCT
GTCATTGATG AAACGGGCCA GGTCCAAGAA GACGACTTGG AAATCACATA CGTCAAACCA
AAGGGCAAGT TGTCGCTCGA GCCAGTCACC TTGCGTATTC TGATCCAAAA CTACCGTGCT
CTCTATAACA ATCTCCAGGA ATTATCTAAT TCAATACTAG CTAAATCCTA CAAGAACCAT
ATCGTGAAAC CGTCAGTCTT GGTGATTATA AATCCTCACG GTGGTCAAGG AAAGGCTTTG
AAGATCTACA ATACGGAAAT AAAGCCAATT TTAAAGGCGG CCAGAGCCAA GATTACTATT
CAGGAAACAA GTTACCACAA GCATGGAATC GATATCGGGC GTGAACTAGA TATCTCCAAG
TACGATGTAA TAGCATGTTG TTCAGGAGAT GGAATCCCGC ATGAAATCAT TAACGGTTTT
TACGAAAGGC CAGATAAGGG CGTGTCTGCT TTCAACAAAA TAGCCATCAC CCAACTTCCG
TGTGGCTCTG GTAATGCCCT TTCTCTCAGT ACCCATGGAA GTAATGATGC TTCCATGGCT
ACTTTTCATA TGTTAAAGGC AAAGAGAACT AAGCTTGACC TCATGGCTGT GACTCAAGGT
GTAGGTCCTA ACGAGAAAAT CAAGTTATCC TTCTTGACGC AATGTTATGG TGTTATTGCG
GATGCTGATA TTGGCACGGA ACATTTGAGG TGGATGGGGG CGATCAGATT TGATGTTGGG
GTTTTACACG GTATTTTGGC AAGAAGAAAG TTTCCCTGTG AATTGTATGT CGATTTCTTG
ACCAATTCAA AACAAGAACT CTCTGCCCAT TTCGACACTT ATCACCAGAA TTCAAATTCG
ACAGCAGCTC GCATAGAACA TCATTCACAA GATGATGGTG AATTGCCACT ATTGAATGAA
GAGCGGTTGC AAGTCAAGGG ACCAAAATTG AACCATCCAC CACCTGAATC ATGGACTAAA
ATAAGCCAAA ACATATCGGA CAATGTCAAC ATTCTCTACG TAGGTAAGAT GCCATATATT
TCCAACGATG TCCAGTTTTT TCCAGCAGCT CTACCAAATG ACGGATCTAT GGACATGATC
CTCACAGATA CTAAAACCTC TGTAATGGAA ACCGCTTCCA TTCTCATGTC CTTAGACAAG
GGATTGCATG TTCATAACGA AAAAGTACAT CATGCTAAGA TTTCGTCTTA CAGATTGATT
CCAAAGATAC CGCGGAATGA GCAGCATTAT ATTTCAGTGG ATGGAGAAAG TTTTCCATTC
GAACCGTTAC AGGTCGAAGT TCTACCAGGA GTACTCACGG GCTTGCTACA AGGTGGAAAT
TTTGTTGATA CGTGCTTTTC ACGTTAG
 
Protein sequence
MTLHNNLSTE RLIYHQQDSI RATLQDFGIQ IDSQELLAIE DDSVDSTYSF CSWKSAPDNR 
SRIPYRNILW VQPVIDETGQ VQEDDLEITY VKPKGKLSLE PVTLRISIQN YRALYNNLQE
LSNSILAKSY KNHIVKPSVL VIINPHGGQG KALKIYNTEI KPILKAARAK ITIQETSYHK
HGIDIGRELD ISKYDVIACC SGDGIPHEII NGFYERPDKG VSAFNKIAIT QLPCGSGNAL
SLSTHGSNDA SMATFHMLKA KRTKLDLMAV TQGVGPNEKI KLSFLTQCYG VIADADIGTE
HLRWMGAIRF DVGVLHGILA RRKFPCELYV DFLTNSKQEL SAHFDTYHQN SNSTAARIEH
HSQDDGELPL LNEERLQVKG PKLNHPPPES WTKISQNISD NVNILYVGKM PYISNDVQFF
PAALPNDGSM DMILTDTKTS VMETASILMS LDKGLHVHNE KVHHAKISSY RLIPKIPRNE
QHYISVDGES FPFEPLQVEV LPGVLTGLLQ GGNFVDTCFS R