Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_55332 |
Symbol | |
ID | 4837068 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 683678 |
End bp | 685075 |
Gene Length | 1398 bp |
Protein Length | 430 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640388383 |
Product | predicted protein |
Protein accession | XP_001382371 |
Protein GI | 150863777 |
COG category | [A] RNA processing and modification |
COG ID | [COG5623] Pre-mRNA cleavage and polyadenylation factor IA/II complex, subunit CLP1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.692936 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTCGCCATAC CTGAGGGTTC AGAATGGCGA ATTGAAGTAC CGCATAAGAC AATTCTCAAG TTCAAAGTAA CTGAAGGAGT AGCAGAGATT TTTGGCACGG AATTACCAAT AAATGTTGAG CTACAGATAT CTGGAACGAA GACAATGGTG TATGCTCCCA TAAGTGGGGG TGCGAAGTTA GTTTACGAAA CTTTTAAAAA CAAAACGGTG ATGTCGAATG AAAGTGAAGA AATTGTAGAG TATCTCTCTA ATGACTCTGT AATGGCCAAC TATATAAATT TGCACTTAGT TGTCGAAGCC ATGCGACAGC AGGTTTCAGA TAACAACATA TTGAATCCGA CAGAATTACA GCTGGGCCCG CGGGTGTTAA TCGTAGGGAA CGGCAATTCA GGCAAGACGT CATTGGCGAA GTTGTTGAGT GCATATGCTA TAAAACTGGA CAGTACGCCG GTGCTAGTAA ACTTGAATCC TCGGGATGGC GTTTTCTCGT TGCCCGGATC ATTGACGGCA ACGCCTATCA GCGATAGTCT AGATGTGGAA CTGGCTAACG GTTGGGGGTT CACGACAACG TCTGGATCAT TATTCCACAA TCCCAAACAG CCCATAGTAA AGAACTACGG CTTTGTAGAT GTGAATGAAA ACTTAGACTT GTACAAGTAC CAGGTTTCGA AGCTTGGAGT CACCGTACTC TCGCGTCTTG AAGAAGATAT AGCCTGTAGA AACGGAGGCG TTATCATCGA CACACCTGCA TTGGGCATCA AAGACTTCAC GGTGATTGAA AATATCGTTT CAGACTTCGA GGTAAACCTC ATTGTTGTTT TGGGAAATGA ACGGCTCATG ATCGACTTAA AGAAACGCTT CAAACATAAG TCAGCTTTGC AGATTGTAAA AGTGCCCAAG AGCGAAGGTT TGGTAGAAGT AGATGAAGCG TTCATACGTA GAACCCAAGA AGAGTCCATC AAGGAGTATT TTAACGGAAA TTACAAAACA AGGTTGTCGC CCTTCAAAAC TGACATCGAT GTTAACGATC ATACTATCTA CAAATGCGTA CTTTCTCTGG ATGTTAACTC AGCTCTTTCT TTTCTACCAG CTGGTGATTC ATACACTGTT GCTGAAGACG AAGGCGACGA CAAGGATAAG GCAGAAGATG AGTTGAACAA ATACTATACC CTCTTGTCAG AACCCAGTTC GTCTAACTTA GACAACTCCA TTCTTGCCAT CACTCAGTTG CCTCTGACGC ACAAGCTGGG CAGAGAGTTG TTGAACACAA GCATTCTTGG CTATGTCCAT GTATCCAAGT TTGACGATGC CAAAGGTAAA ATCAAGGTAT TGTTACCTTT TCCAGGAGGA TTTCCCAGAA ACATGTTGAT TTCTACGAAC ATTGGATTCA ATGAGTAG
|
Protein sequence | VAIPEGSEWR IEVPHKTILK FKVTEGVAEI FGTELPINVE LQISGTKTMV YAPIIYETFK NKTVMSNESE EIVEYLSNDS VMANYINLHL VVEAMRQQVS DNNILNPTEL QSGPRVLIVG NGNSGKTSLA KLLSAYAIKS DSTPVLVNLN PRDGVFSLPG SLTATPISDS LDVESANGWG FTTTSGSLFH NPKQPIVKNY GFVDVNENLD LYKYQVSKLG VTVLSRLEED IACRNGGVII DTPALGIKDF TVIENIVSDF EVNLIVVLGN ERLMIDLKKR FKHKSALQIV KVPKSEGLVE VDEAFIRRTQ EESIKEYFNG NYKTRLSPFK TDIDVNDHTI YKCVLSSDVN SALSFLPAEP SSSNLDNSIL AITQLPSTHK SGRELLNTSI LGYVHVSKFD DAKGKIKVLL PFPGGFPRNM LISTNIGFNE
|
| |