Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_54418 |
Symbol | |
ID | 4836885 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1941210 |
End bp | 1942361 |
Gene Length | 1152 bp |
Protein Length | 346 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388200 |
Product | predicted protein |
Protein accession | XP_001382601 |
Protein GI | 150863946 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG5333] Cdk activating kinase (CAK)/RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH/TFIIK, cyclin H subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGCAG ATTTCTGGTG TTCTTCCCAG CGCAATAAAT GGCAACTCAG CCGGCAGTCG CTTCTAGAAG CTCGAAGAAA AGTCCTTCTT CTTGAGAGGA AAATGATTCA AAACGGCCTT ATTAAAGACT ATCCCAATAT CCACTATGAC TTTAACATGC GGATCTATTT GCACAACTTG TTGATCAAGC TTGGCCGCCG ATTAAACATA CGACAAGTGG CTCTAGCCAC TGCCGAAATT TATCTCAATC GGTTTCTCAC TCGTGTTCTG TTGAAGGAGA TCAATGTGTA TCTCTTGGTT ACTACCTGTT TGTATGTAGC CTGCAAGATA GAAGAGTGCC CTCAGCATAT ACGATTGATC ATTTCGGAAG CCCGTAACTT GTGGCCCGAA TACATACCGC ACGATGTTAC CAAACTCGCT GAATTCGAGT TCTACTTGAT AGAAGAAATG GACCTGTATC TTTTCCTTCA TCATCCGTAC AAGTCTTTAA TTCAAATCAG AGACTTTTTG AACGAGAACA GCGCTGTCTT TGGCTTTACA TTAACAGACG ATGAGCTTCA GAACGCCTGG TCGCTTGTAA ACGACAGCTA TATCACCGAT TTGCATTTGC TCTTGCCGCC TCATATAATT GCGGTGGCTC TGATCTACAT CACTATAGTG TTGAAGAAGA ATCTATCCGC AATTCGTGTG AACAGTAGTG CGGTTAACTC CAATGGAGGT CCAAATTCAA TGATGTTTAA CCGAAATCCG GACCAGAATT CTATGCATAT AGACGACTTG ATGATTCTAG CCAATCCTTC AACAGTCAAT GGCAACCAGA ATGCCTCCAC TGCTTCAGGA GTCGATAGTG GAAACACAGT AGCCGGGTCC GATCCCAATT CTGGTGGACA AAATGTAAAT AGTAACAGCA ATGGTACAGG TTCAGACTTG GTTAACAATC TCGAAAGGAC AAACTTTCAC GACATGAAAT TGGACGAAGA GACTATCAAG ATCAACAAGT TCATGAACTT CTTGGACCAT TCGCACATCA ACTTGGACGA GGTGGTGGAG GCCATGCAAG ACATGATCAA CATCTACGTT CAATGGAATC GGTACAACGA GCAGGGTGTG AAGAAGGCAT TGCAGGTGAT GCTTCTCAAT AGACAGTTAT AA
|
Protein sequence | MSADFWCSSQ RNKWQLSRQS LLEARRKVLL LERKMIQNGL IKDYPNIHYD FNMRIYLHNL LIKLGRRLNI RQVALATAEI YLNRFLTRVS LKEINVYLLV TTCLYVACKI EECPQHIRLI ISEARNLWPE YIPHDVTKLA EFEFYLIEEM DSYLFLHHPY KSLIQIRDFL NENSAVFGFT LTDDELQNAW SLVNDSYITD LHLLLPPHII AVASIYITIV LKKNLSAIRV NSSAVNSNGG PNSMMFNRNP DQNSMHIDDL MILANPSTPG SDLVNNLERT NFHDMKLDEE TIKINKFMNF LDHSHINLDE VVEAMQDMIN IYVQWNRYNE QGVKKALQVM LLNRQL
|
| |