Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32024 |
Symbol | SDCG1 |
ID | 4839127 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 501244 |
End bp | 504423 |
Gene Length | 3180 bp |
Protein Length | 1038 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640390442 |
Product | highly conserved hypothetical protein Predicted RNA-binding |
Protein accession | XP_001385110 |
Protein GI | 150865765 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.873906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.425602 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTAA GTTTCTTAGA TCTCTCTTGA GCTAATTGGT TAAATTTCTT TTACTAACAT TTTGTTCAGC AAAGAATCAC CGGTCTAGAC CTCCAGATCA TCACCCGTGA ATTGGTAGCT TCTATAGCAA ACTACAGATT ACAGAATATT TACAACCTTG CAGGCTCAAA TAGACAGTAT GTGTTGAAGT TCTCTGTTCC AGACTCTAAG AAGATCGTAG TTTTAGACTG CGGAAATCGT GTTCATTTAA CCGATTTTGA TCGTCCTACC ACGCCAGCTC CTTCCAATTT TGTTAGCAAG TTGAGAAAGC ACTTGAAGAC AAGAAGATTG TCAGGTATCA AACAGGTTGG AAATGACAGA GTGCTTGTAT TAGAGTTCAG CGATGGATTG TTCTACTTGG TGTTGGAATT TTTCAGTGCT GGAAACGTTT TGCTCTTGGA TGACAACTTG AAGATACTCT CGTTGCAGAG AAATGTAAAA GAAAAAGGCG AAAACGACAA ATATGCCGTA AACGAAATCT ACAAAATGTT TGACAAGTCT CTTTTTAGTG AAGACTTCAA ATACGAGAAG CGTGATTACA ATGTAGACGA AATCAAAGCT TGGATTAAAG AGCAGAGAAT CAAGGTAGAA AACCAACTGC AAGAGCCATC TTCAGGTAAA AAGAGTAAAG TATTTTCCAT TCACAAGTTG CTCTTTGTAA ATGTTAGTCA TTTATCTAGT GATTTGATTT TAAAAAATTT GCAGAATGCT GGCATAAGTG GCTCGTCTAG CTGTTTCGAG TTTGCAGAAG ACAATGAGAA ACTATCCACG ATTGTCGGTG CCTTGGATAA GTCTGAACAG GAGTACATTT CATTTATCTC TGCTGGGGAC AACGAGCAAA CTAATGGATT TATTGTATCC AAAAAGAATC CTTTGTACAA TCCTTCTGAA GAGCATAGTG ACAATGACCT CGAATATGTC TATGACGAGT TTCATCCCTT CAAGCCTTTC AAGAAGAATT TAGAAGGATA TAAGTTCACT GAAATAGAAG GTTACAATAA GACTTTGGAT ACATTCTTTT CTGCTCTTGA ATCTACCAAA TTTGCATTAA AGATCGAACA ACAGAAGCAG AATGCCAATA AAAGATTGGA AAACGCTCGT AGTGAAAGAA ACAAACAGAT ACAGTCCTTG ATCCAACAGC AGGAGACAAA CTCCAAGAAA GGGGATACTA TCATTTATCA TGCTGATTTA GTAGCATCGT GTATTTCAGC CATACAGAAA ATGCTAGACA AGCAAATGGA TTGGGGTAAC ATAGAAGCCA TTGTCAAACA TGAGCAAAGT AGTGGCAACG AGATTATGTC TACAATCAAG TTGCCTCTCA ATCTTAACGA GAACAAAATT AACTTGGTAT TGCCAGATCC GGAACATATT TCATATTCCG AAGATGATAA TTCTGACAAT TCTGATAGCG ATTCTGAATC GGATTCGCAG TCGGAGTCGG AGTCTGAATC TGAGTCCGAA TCAGAATCAG AAAGCGAAAA TGATTCAGAT AGCGATTCCG ATTTGGAGAT GACTCGTAAG AAGGCAAAAT CTCCAAGAGA ATCTTCCAAA GAAAAAAAGA AGAAGGTTCC ACATACCCTT TCCGTTTGGA TTGATTTGTC TTTGTCGCCT TATGCCAATG CAAGATTGTA TTTCGAAAGC AAGAAGAGTG CTGAAAGCAA GAAAGAAAAG GTGGAGAAGA ATACTGAAAT GGCTCTTAAG AATGCTGAAA GAAAGATCAA ACAAGATTTG GCTCATAACT TGAAAAATGA ACACGACACT TTGAAACAGT TGAGACCAAA GTATTGGTTT GAGAAGTTCT ACTGGTTTGT TTCTAGTGAA GGCTATTTAT GTTTGGCAGG GAGAGACCCT TCTCAAACTG ATATGATTTA CTACAGATTC TTCAATGACA ATGACTTCTT TGTGTCAGCT GAGATGGAAG GCTCCTTGAA GGTGTTCGTC AAGAATCCTT TCAAGGGTGA AAGTGTTCCT CCATATACTT TGATGCAAGC CGGTAACTTC GCCAAGTCTA CATCCACTGC TTGGAGCGGA AAGGTGTCTA CATCTGCTTG GGTATTACAT GGGTCTGATG TGTCCAAGAA AGATTTCGAT GGCTCATTAT TAGCAGGTGG TGAGTTTAAC TATAAGTCAA AGAAAGAGTT CTTACCACCA ACTCAATTGA CTATGGGATT TGGTCTCTAC TTATTGGGTG ACGAAGAAAC TGCTCAAAAG TATACAAAAC TCAGAGTTAA TAAAGAAGTG GAGCACGGTT TTAAGGTCGT AATGGATAAC AAGAAGAAAG ACTTGGAAGA CTTGATCAAG CAGTTGGAAA CTTCTGAACT AGGAGAAGAC AATTTAGAAG GTGCAGCTGT AGAAAATGAT GAAAAAACAG AAACAGCTAA AGATAATAAA AACGGTAGCC TGGAGCCAGA AGAGTCAAAT GAAAGTGACG ACATTGTTTC AATTCAGTCT TCTGTAAGTG AAGCAGGCAA AAATAATTCT ACCAGAGTGA GAGGGAAGAA AGCAAAATTG AAGAAAATGG CACAAAAATA TGCGGATCAA GATGAAGAAG AAAGAAGATT GAGAATGACT GCATTAGGAA CTTTGCACCA AGTCGAGCAG CAGCAAAAGG AGAAAGAAAT CGAACTTCAG AAAGCTGCTG AAAAAGAAAA GGAAAAGTAC CGAGAATCGG CAGCTGTCCA AAGGCGCAAG AAAGAACAAC AAAGAGAATT GCAACGTTAC TTGGAAGATG AAAACGAAGA CGAAGCTAGT GCAATGAATT ATCTTGAGAT CTTGGATTCT TTCCTTGCTA AGCCACAACC AAACGATAAA TTTTCCGCCA TTGTCCCCGT ATTTGGACCA TGGTCCGCAT TGCAGAAATT AAAATACAAA GTCAAGATTC AACCCGGTTC AGGAAAGAAA GGGAAGTGTA TCAACGACTC GATGCATTAC TTCACCACCC GTAAAGAGGA TAGCACTAGT ACAGATACCG ATTTGGACTG GCCAGCCGAA AGACAATTGA TCAATGAGAG CAAGCCCAAC GACTTGTTGG GAGTCTTTAC GGTCGGAAAA GTCAAGTTGG TACTTCCTGG TGGACAGGAC AGTAACAATA AGATGAAGGC TTCCAAAAAG CCGGCCAAAA AAGGTGGGAA AAAGAAGTAG
|
Protein sequence | MKQRITGLDL QIITRELVAS IANYRLQNIY NLAGSNRQYV LKFSVPDSKK IVVLDCGNRV HLTDFDRPTT PAPSNFVSKL RKHLKTRRLS GIKQVGNDRV LVLEFSDGLF YLVLEFFSAG NVLLLDDNLK ILSLQRNVKE KGENDKYAVN EIYKMFDKSL FSEDFKYEKR DYNVDEIKAW IKEQRIKVEN QSQEPSSGKK SKVFSIHKLL FVNVSHLSSD LILKNLQNAG ISGSSSCFEF AEDNEKLSTI VGALDKSEQE YISFISAGDN EQTNGFIVSK KNPLYNPSEE HSDNDLEYVY DEFHPFKPFK KNLEGYKFTE IEGYNKTLDT FFSALESTKF ALKIEQQKQN ANKRLENARS ERNKQIQSLI QQQETNSKKG DTIIYHADLV ASCISAIQKM LDKQMDWGNI EAIVKHEQSS GNEIMSTIKL PLNLNENKIN LVLPDPEHIS YSEDDNSDNS DSDSESDSQS ESESESESES ESESENDSDS DSDLEMTRKK AKSPRESSKE KKKKVPHTLS VWIDLSLSPY ANARLYFESK KSAESKKEKV EKNTEMALKN AERKIKQDLA HNLKNEHDTL KQLRPKYWFE KFYWFVSSEG YLCLAGRDPS QTDMIYYRFF NDNDFFVSAE MEGSLKVFVK NPFKGESVPP YTLMQAGNFA KSTSTAWSGK VSTSAWVLHG SDVSKKDFDG SLLAGGEFNY KSKKEFLPPT QLTMGFGLYL LGDEETAQKY TKLRVNKEVE HGFKVVMDNK KKDLEDLIKQ LETSELGEDN LEGAAVENDE KTETAKDNKN GSSEPEESNE SDDIVSIQSS VSEAGKNNST RVRGKKAKLK KMAQKYADQD EEERRLRMTA LGTLHQVEQQ QKEKEIELQK AAEKEKEKYR ESAAVQRRKK EQQRELQRYL EDENEDEASA MNYLEILDSF LAKPQPNDKF SAIVPVFGPW SALQKLKYKV KIQPGSGKKG KCINDSMHYF TTRKEDSTST DTDLDWPAER QLINESKPND LLGVFTVGKV KLVLPGGQDS NNKMKASKKP AKKGGKKK
|
| |