Gene PICST_47968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_47968 
Symbol 
ID4840136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp199177 
End bp202707 
Gene Length3531 bp 
Protein Length1176 aa 
Translation table12 
GC content41% 
IMG OID640391451 
Productpredicted protein 
Protein accessionXP_001385372 
Protein GI150865950 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.594248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAAGC TTCTATTTCT ATTGAGCCTC TGTATGATGG CGTACGTTGT GCTCGTAGCT 
GCACTGCCGC GACTAGAGAA GGTCACCACC AGATATCTAC CTGATAGCCT ACTTCAAAAT
GATGCTGCTC CTCAGACTAC AGAACCTCCT TCCAAAAGAA CAACAGACTC GTCCATACTA
CCCTTGGACG TGAGATCACT AAAAGACTTT AAATTAACAG ACTTACTCTT GATCTCAGAC
GTCGACGGTA ACTTGCACGC TGTAGAACGT AAGGAAGGAG CTCTTATCTG GACACTTCCC
AGCGACGAAC CTCTAGTGAA AATCCAGTCC AATAGCTCAA CAGAAGATAG CCAGTCCAAT
ATCCTATGGT TTGTCGAGCC ATACCAGGAT GGATCTCTCT ATTATTTCAC ACCCAAATTT
GGGCTCAACA AACTTCCTAC TAGCATCAGA CAGTTGGTGA TGGAGTCTCC TTTCTCCCTC
AGTGGAGACG ACAAAATCTA CACCGGAAGC AGGAAAACTG CCCTCTATAC TGTCAACATC
TTCACAGGTG AGATTGTCTC ATCGTTCGGA AATGAGGAGA AATGCCCGGT TGCCAATACC
CACTACAAGA TAGACAACTT GTATCTGCGT CTGGATACCA TCAACATAGG GAGAACGACC
TACGAGTTGA CCATCCACTC CAAGTTGAAT ACCAATGTAG TTTGGAACGT GTCCTATTCT
CAATGGGGAC CCAACAATAT AGACAACGAC TTGATCATGC AAAACTCAGA GAGCATAGAT
AAACTCTACT TTACCCCTTT CTACAACAAG TCTATATTGG CTATCAACAA GGATATAAGT
GCACCCAAAT GGTTGACGAA CTTACCTTCT TTTGCTGTCA GTGTGTTTGA TGTCTTCAGC
AATCTCAAGA ACTCGGACTG TGTTCTTTTG CCTCACCCAT CGAAGATCTT CAACCAGCCA
CAACTCGATG ATGACAACGA CAACAATTCC GACCTTGTCT TCATCGACAG AACATCCAAC
AGAAAGGAGT GGTTTGCAAT GAGTTTTAAT AATTATCCTT CATTAATCAA AACGGCTCCA
ATTTCCAATT ACCAGTTGTA TTTGCTTAAG CTTCAGGCTG ACTTCCAAGA CTTGAATGTG
AATGTTGACT ACTTGAAGAA TTTTCAATTA TCTACGAGTC CACCAGAGGA AGTGAAAACT
TTGATTAGTG GCATACATAG AGCTTTTCAA TTGTCTGCCG ACACCTTGTA CCAGCCTTCT
TCTAGATTTG AAAAGAGTGA TGACATCAAA CGTATTGGAC AAAACGAGCA AACAGAACGC
ACAGAACAAG ACGAGCGGAC AGAGCAAGAC GATCAGAATG ACAATTCTTT GAAAACAGAT
TTAGAAGAGA TAAAGCCCAA TATCGCAGAG GGAGTTAATT TCGCTCATGA AACTTCACGT
TCAGAAGTCT TGTATCTACC ACCATCAAAC AATGAGGTGG AAAGTTACAG GCCCTACAAA
GACCCTGAAA CCAGCCATGA AATAATCAAT AATTTGGAAC CAAATTCCAC TACTGCTATC
ATTCGTAGAA TCTTGGAAGA CTTGGTTGTT TTGTTGGTTC TCTTTGTTTT AGTAATGACT
TTTGGAAAGT CCAACAAGTT TGTGCAAGGC TTTTTCAACA ATGGAAGCTT GGTGGGCCCA
GTCTCAATTG AGAAAGACGC GGTGTTCGAT AATGACGTTG TTCTTGATTC GAACTTTTCA
GTTTCGAACG ACAAAGAAAA AGTGTCTATT ACAGAGAAAG ACAAGAGTTC ACAAGTTGTT
GAGCTTGCAA ATCAGGACAA CAAAGATGCG AAAGATTCTA ATGAGGTATT CAAGAAAGTC
AAGATTGTTG TTCCTAACGA ATCATTAGAG AAAGCAAACG ACGATCCCAG TAAATTAGAA
GAAAATGTTG ATGACGATGA AGAAGGCAAC GGCGACGAAA TTACAACTAA GAAGAAGCGC
AAGAGAGGCA GTAGAGGTGG TAAAAGGGGT GGTCGTAGAG TTAATAAGAG CAAGGAAAAT
GCTGGTGATG ACCAAGAAAT CAACGACGAA AGTCCTGAAG AAGAGTTCAT CTCCATTGCT
ACCAAGAGCT TGGTAAAAAC TATCACTACT CCAGTGAAGA TTCCTTCCAA GAAATTACAA
ATCGAAAATA ACCTTGTGAT ATCTGACAAG ATTCTTGGTT ATGGGTCCCA TGGAACCATT
GTATACCAGG GCACGTTTGA GAACAGGCCC GTTGCAGTTA AGCGTATGCT TTTGGATTTC
TACGATGTAG CTAACCATGA AGTTCGCTTA TTGCAGGAAA GTGACGACCA TCCCAATGTC
ATCAGATATT TCTGTTCTCA ATCAAGCGAG TCCGAAAAGT TTTTATATAT TGCTTTGGAA
CTATGTCTCT GCTCGTTGGA AGATATTATT GAAAAGCCCA AGAAGTCTCC TCAACTCAGC
ATTCCTAAAG TTAATGACGT CTTGTATCAA TTGGCCAGTG GTTTACACTA TTTGCATTCA
TTGAAAATCG TTCATCGTGA TTTGAAGCCT CAGAACATAT TAGTTGCAGA TATCAAAAAA
ACATCGTCTT CAAAAGCAAC GACAAAGCCT TCCGAAGAGG AAAACAATGT GAGATTGCTT
ATCTCTGACT TTGGCTTATG TAAGAAGCTA GACTCTGACC AATCTTCGTT CAGAGCAACT
ACACAACATG CTGCTCTGGG AACTTCTGGA TGGAGAGCAC CAGAATTATT GTTGCATCAC
GATTTACTCG AAATATCCCC AGACACCATA TCCTCTGTTG GGTCTGGATC TCGTCACTCA
TTTACAGAGT CGTGGTCGAC TGTTACGAAC TCTTCTTCAG TACAAGCTTC AGGTGGAAAG
AGATTGACCA AGGCAATTGA CATTTTCTCC TTGGGATGTG TTTACTACTA CATTTTGTCT
GGAGGAATGC ATCCTTTTGG TGACAGATAT TTGCGTGAAG GTAATATCAT CAAGGGTGAG
TACGACATTT CCTTGTTAAA ACAGTGCTGC CCCAATGACA AGTATGAAGC CACCGATCTT
ATTGCCAGCA TGATCCATGC GAACCCAAGT AAAAGACGAA GCACATCTAA GATCTTAATT
CATCCATTAT TCTGGTCTTC TAAAAAGCGT TTGGAATTCT TATTAAAGGT CAGCGATAGA
TTCGAAGTGG AAAGAAGGGA TCCTCCTAGT GACTTATTAT TGAAACTTGA GGACCGTGCC
AATGCTGTCC ATGGAGGAAA CTGGCATAAG CAATTCGATG ATGAATTCAT GGACAACTTG
GGTAAATACA GAAAGTACCA CAAAGAGAAA CTAATGGATT TGCTAAGAGC TATTCGTAAT
AAGTACCATC ACTTTAACGA CATGCCTGAA ACATTACAAG CACAAATGAG CCCGCTTCCA
GGTGGGTTCT ACAAGTACTT CAACAACAAG TTCCCTAACT TGCTTATGCA GATATACTTC
TTGATTGAGG AGAATCTCGC AGAAGAACAT GCATTTAAGG ATTTTTACTA G
 
Protein sequence
MRKLLFLLSL CMMAYVVLVA ASPRLEKVTT RYLPDSLLQN DAAPQTTEPP SKRTTDSSIL 
PLDVRSLKDF KLTDLLLISD VDGNLHAVER KEGALIWTLP SDEPLVKIQS NSSTEDSQSN
ILWFVEPYQD GSLYYFTPKF GLNKLPTSIR QLVMESPFSL SGDDKIYTGS RKTALYTVNI
FTGEIVSSFG NEEKCPVANT HYKIDNLYSR SDTINIGRTT YELTIHSKLN TNVVWNVSYS
QWGPNNIDND LIMQNSESID KLYFTPFYNK SILAINKDIS APKWLTNLPS FAVSVFDVFS
NLKNSDCVLL PHPSKIFNQP QLDDDNDNNS DLVFIDRTSN RKEWFAMSFN NYPSLIKTAP
ISNYQLYLLK LQADFQDLNV NVDYLKNFQL STSPPEEVKT LISGIHRAFQ LSADTLYQPS
SRFEKSDDIK RIGQNEQTER TEQDERTEQD DQNDNSLKTD LEEIKPNIAE GVNFAHETSR
SEVLYLPPSN NEVESYRPYK DPETSHEIIN NLEPNSTTAI IRRILEDLVV LLVLFVLVMT
FGKSNKFVQG FFNNGSLVGP VSIEKDAVFD NDVVLDSNFS VSNDKEKVSI TEKDKSSQVV
ELANQDNKDA KDSNEVFKKV KIVVPNESLE KANDDPSKLE ENVDDDEEGN GDEITTKKKR
KRGSRGGKRG GRRVNKSKEN AGDDQEINDE SPEEEFISIA TKSLVKTITT PVKIPSKKLQ
IENNLVISDK ILGYGSHGTI VYQGTFENRP VAVKRMLLDF YDVANHEVRL LQESDDHPNV
IRYFCSQSSE SEKFLYIALE LCLCSLEDII EKPKKSPQLS IPKVNDVLYQ LASGLHYLHS
LKIVHRDLKP QNILVADIKK TSSSKATTKP SEEENNVRLL ISDFGLCKKL DSDQSSFRAT
TQHAASGTSG WRAPELLLHH DLLEISPDTI SSVGSGSRHS FTESWSTVTN SSSVQASGGK
RLTKAIDIFS LGCVYYYILS GGMHPFGDRY LREGNIIKGE YDISLLKQCC PNDKYEATDL
IASMIHANPS KRRSTSKILI HPLFWSSKKR LEFLLKVSDR FEVERRDPPS DLLLKLEDRA
NAVHGGNWHK QFDDEFMDNL GKYRKYHKEK LMDLLRAIRN KYHHFNDMPE TLQAQMSPLP
GGFYKYFNNK FPNLLMQIYF LIEENLAEEH AFKDFY