Gene PICST_32024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32024 
SymbolSDCG1 
ID4839127 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp501244 
End bp504423 
Gene Length3180 bp 
Protein Length1038 aa 
Translation table12 
GC content39% 
IMG OID640390442 
Producthighly conserved hypothetical protein Predicted RNA-binding 
Protein accessionXP_001385110 
Protein GI150865765 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.873906 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.425602 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTAA GTTTCTTAGA TCTCTCTTGA GCTAATTGGT TAAATTTCTT TTACTAACAT 
TTTGTTCAGC AAAGAATCAC CGGTCTAGAC CTCCAGATCA TCACCCGTGA ATTGGTAGCT
TCTATAGCAA ACTACAGATT ACAGAATATT TACAACCTTG CAGGCTCAAA TAGACAGTAT
GTGTTGAAGT TCTCTGTTCC AGACTCTAAG AAGATCGTAG TTTTAGACTG CGGAAATCGT
GTTCATTTAA CCGATTTTGA TCGTCCTACC ACGCCAGCTC CTTCCAATTT TGTTAGCAAG
TTGAGAAAGC ACTTGAAGAC AAGAAGATTG TCAGGTATCA AACAGGTTGG AAATGACAGA
GTGCTTGTAT TAGAGTTCAG CGATGGATTG TTCTACTTGG TGTTGGAATT TTTCAGTGCT
GGAAACGTTT TGCTCTTGGA TGACAACTTG AAGATACTCT CGTTGCAGAG AAATGTAAAA
GAAAAAGGCG AAAACGACAA ATATGCCGTA AACGAAATCT ACAAAATGTT TGACAAGTCT
CTTTTTAGTG AAGACTTCAA ATACGAGAAG CGTGATTACA ATGTAGACGA AATCAAAGCT
TGGATTAAAG AGCAGAGAAT CAAGGTAGAA AACCAACTGC AAGAGCCATC TTCAGGTAAA
AAGAGTAAAG TATTTTCCAT TCACAAGTTG CTCTTTGTAA ATGTTAGTCA TTTATCTAGT
GATTTGATTT TAAAAAATTT GCAGAATGCT GGCATAAGTG GCTCGTCTAG CTGTTTCGAG
TTTGCAGAAG ACAATGAGAA ACTATCCACG ATTGTCGGTG CCTTGGATAA GTCTGAACAG
GAGTACATTT CATTTATCTC TGCTGGGGAC AACGAGCAAA CTAATGGATT TATTGTATCC
AAAAAGAATC CTTTGTACAA TCCTTCTGAA GAGCATAGTG ACAATGACCT CGAATATGTC
TATGACGAGT TTCATCCCTT CAAGCCTTTC AAGAAGAATT TAGAAGGATA TAAGTTCACT
GAAATAGAAG GTTACAATAA GACTTTGGAT ACATTCTTTT CTGCTCTTGA ATCTACCAAA
TTTGCATTAA AGATCGAACA ACAGAAGCAG AATGCCAATA AAAGATTGGA AAACGCTCGT
AGTGAAAGAA ACAAACAGAT ACAGTCCTTG ATCCAACAGC AGGAGACAAA CTCCAAGAAA
GGGGATACTA TCATTTATCA TGCTGATTTA GTAGCATCGT GTATTTCAGC CATACAGAAA
ATGCTAGACA AGCAAATGGA TTGGGGTAAC ATAGAAGCCA TTGTCAAACA TGAGCAAAGT
AGTGGCAACG AGATTATGTC TACAATCAAG TTGCCTCTCA ATCTTAACGA GAACAAAATT
AACTTGGTAT TGCCAGATCC GGAACATATT TCATATTCCG AAGATGATAA TTCTGACAAT
TCTGATAGCG ATTCTGAATC GGATTCGCAG TCGGAGTCGG AGTCTGAATC TGAGTCCGAA
TCAGAATCAG AAAGCGAAAA TGATTCAGAT AGCGATTCCG ATTTGGAGAT GACTCGTAAG
AAGGCAAAAT CTCCAAGAGA ATCTTCCAAA GAAAAAAAGA AGAAGGTTCC ACATACCCTT
TCCGTTTGGA TTGATTTGTC TTTGTCGCCT TATGCCAATG CAAGATTGTA TTTCGAAAGC
AAGAAGAGTG CTGAAAGCAA GAAAGAAAAG GTGGAGAAGA ATACTGAAAT GGCTCTTAAG
AATGCTGAAA GAAAGATCAA ACAAGATTTG GCTCATAACT TGAAAAATGA ACACGACACT
TTGAAACAGT TGAGACCAAA GTATTGGTTT GAGAAGTTCT ACTGGTTTGT TTCTAGTGAA
GGCTATTTAT GTTTGGCAGG GAGAGACCCT TCTCAAACTG ATATGATTTA CTACAGATTC
TTCAATGACA ATGACTTCTT TGTGTCAGCT GAGATGGAAG GCTCCTTGAA GGTGTTCGTC
AAGAATCCTT TCAAGGGTGA AAGTGTTCCT CCATATACTT TGATGCAAGC CGGTAACTTC
GCCAAGTCTA CATCCACTGC TTGGAGCGGA AAGGTGTCTA CATCTGCTTG GGTATTACAT
GGGTCTGATG TGTCCAAGAA AGATTTCGAT GGCTCATTAT TAGCAGGTGG TGAGTTTAAC
TATAAGTCAA AGAAAGAGTT CTTACCACCA ACTCAATTGA CTATGGGATT TGGTCTCTAC
TTATTGGGTG ACGAAGAAAC TGCTCAAAAG TATACAAAAC TCAGAGTTAA TAAAGAAGTG
GAGCACGGTT TTAAGGTCGT AATGGATAAC AAGAAGAAAG ACTTGGAAGA CTTGATCAAG
CAGTTGGAAA CTTCTGAACT AGGAGAAGAC AATTTAGAAG GTGCAGCTGT AGAAAATGAT
GAAAAAACAG AAACAGCTAA AGATAATAAA AACGGTAGCC TGGAGCCAGA AGAGTCAAAT
GAAAGTGACG ACATTGTTTC AATTCAGTCT TCTGTAAGTG AAGCAGGCAA AAATAATTCT
ACCAGAGTGA GAGGGAAGAA AGCAAAATTG AAGAAAATGG CACAAAAATA TGCGGATCAA
GATGAAGAAG AAAGAAGATT GAGAATGACT GCATTAGGAA CTTTGCACCA AGTCGAGCAG
CAGCAAAAGG AGAAAGAAAT CGAACTTCAG AAAGCTGCTG AAAAAGAAAA GGAAAAGTAC
CGAGAATCGG CAGCTGTCCA AAGGCGCAAG AAAGAACAAC AAAGAGAATT GCAACGTTAC
TTGGAAGATG AAAACGAAGA CGAAGCTAGT GCAATGAATT ATCTTGAGAT CTTGGATTCT
TTCCTTGCTA AGCCACAACC AAACGATAAA TTTTCCGCCA TTGTCCCCGT ATTTGGACCA
TGGTCCGCAT TGCAGAAATT AAAATACAAA GTCAAGATTC AACCCGGTTC AGGAAAGAAA
GGGAAGTGTA TCAACGACTC GATGCATTAC TTCACCACCC GTAAAGAGGA TAGCACTAGT
ACAGATACCG ATTTGGACTG GCCAGCCGAA AGACAATTGA TCAATGAGAG CAAGCCCAAC
GACTTGTTGG GAGTCTTTAC GGTCGGAAAA GTCAAGTTGG TACTTCCTGG TGGACAGGAC
AGTAACAATA AGATGAAGGC TTCCAAAAAG CCGGCCAAAA AAGGTGGGAA AAAGAAGTAG
 
Protein sequence
MKQRITGLDL QIITRELVAS IANYRLQNIY NLAGSNRQYV LKFSVPDSKK IVVLDCGNRV 
HLTDFDRPTT PAPSNFVSKL RKHLKTRRLS GIKQVGNDRV LVLEFSDGLF YLVLEFFSAG
NVLLLDDNLK ILSLQRNVKE KGENDKYAVN EIYKMFDKSL FSEDFKYEKR DYNVDEIKAW
IKEQRIKVEN QSQEPSSGKK SKVFSIHKLL FVNVSHLSSD LILKNLQNAG ISGSSSCFEF
AEDNEKLSTI VGALDKSEQE YISFISAGDN EQTNGFIVSK KNPLYNPSEE HSDNDLEYVY
DEFHPFKPFK KNLEGYKFTE IEGYNKTLDT FFSALESTKF ALKIEQQKQN ANKRLENARS
ERNKQIQSLI QQQETNSKKG DTIIYHADLV ASCISAIQKM LDKQMDWGNI EAIVKHEQSS
GNEIMSTIKL PLNLNENKIN LVLPDPEHIS YSEDDNSDNS DSDSESDSQS ESESESESES
ESESENDSDS DSDLEMTRKK AKSPRESSKE KKKKVPHTLS VWIDLSLSPY ANARLYFESK
KSAESKKEKV EKNTEMALKN AERKIKQDLA HNLKNEHDTL KQLRPKYWFE KFYWFVSSEG
YLCLAGRDPS QTDMIYYRFF NDNDFFVSAE MEGSLKVFVK NPFKGESVPP YTLMQAGNFA
KSTSTAWSGK VSTSAWVLHG SDVSKKDFDG SLLAGGEFNY KSKKEFLPPT QLTMGFGLYL
LGDEETAQKY TKLRVNKEVE HGFKVVMDNK KKDLEDLIKQ LETSELGEDN LEGAAVENDE
KTETAKDNKN GSSEPEESNE SDDIVSIQSS VSEAGKNNST RVRGKKAKLK KMAQKYADQD
EEERRLRMTA LGTLHQVEQQ QKEKEIELQK AAEKEKEKYR ESAAVQRRKK EQQRELQRYL
EDENEDEASA MNYLEILDSF LAKPQPNDKF SAIVPVFGPW SALQKLKYKV KIQPGSGKKG
KCINDSMHYF TTRKEDSTST DTDLDWPAER QLINESKPND LLGVFTVGKV KLVLPGGQDS
NNKMKASKKP AKKGGKKK