Gene PICST_55332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_55332 
Symbol 
ID4837068 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp683678 
End bp685075 
Gene Length1398 bp 
Protein Length430 aa 
Translation table12 
GC content43% 
IMG OID640388383 
Productpredicted protein 
Protein accessionXP_001382371 
Protein GI150863777 
COG category[A] RNA processing and modification 
COG ID[COG5623] Pre-mRNA cleavage and polyadenylation factor IA/II complex, subunit CLP1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.692936 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTCGCCATAC CTGAGGGTTC AGAATGGCGA ATTGAAGTAC CGCATAAGAC AATTCTCAAG 
TTCAAAGTAA CTGAAGGAGT AGCAGAGATT TTTGGCACGG AATTACCAAT AAATGTTGAG
CTACAGATAT CTGGAACGAA GACAATGGTG TATGCTCCCA TAAGTGGGGG TGCGAAGTTA
GTTTACGAAA CTTTTAAAAA CAAAACGGTG ATGTCGAATG AAAGTGAAGA AATTGTAGAG
TATCTCTCTA ATGACTCTGT AATGGCCAAC TATATAAATT TGCACTTAGT TGTCGAAGCC
ATGCGACAGC AGGTTTCAGA TAACAACATA TTGAATCCGA CAGAATTACA GCTGGGCCCG
CGGGTGTTAA TCGTAGGGAA CGGCAATTCA GGCAAGACGT CATTGGCGAA GTTGTTGAGT
GCATATGCTA TAAAACTGGA CAGTACGCCG GTGCTAGTAA ACTTGAATCC TCGGGATGGC
GTTTTCTCGT TGCCCGGATC ATTGACGGCA ACGCCTATCA GCGATAGTCT AGATGTGGAA
CTGGCTAACG GTTGGGGGTT CACGACAACG TCTGGATCAT TATTCCACAA TCCCAAACAG
CCCATAGTAA AGAACTACGG CTTTGTAGAT GTGAATGAAA ACTTAGACTT GTACAAGTAC
CAGGTTTCGA AGCTTGGAGT CACCGTACTC TCGCGTCTTG AAGAAGATAT AGCCTGTAGA
AACGGAGGCG TTATCATCGA CACACCTGCA TTGGGCATCA AAGACTTCAC GGTGATTGAA
AATATCGTTT CAGACTTCGA GGTAAACCTC ATTGTTGTTT TGGGAAATGA ACGGCTCATG
ATCGACTTAA AGAAACGCTT CAAACATAAG TCAGCTTTGC AGATTGTAAA AGTGCCCAAG
AGCGAAGGTT TGGTAGAAGT AGATGAAGCG TTCATACGTA GAACCCAAGA AGAGTCCATC
AAGGAGTATT TTAACGGAAA TTACAAAACA AGGTTGTCGC CCTTCAAAAC TGACATCGAT
GTTAACGATC ATACTATCTA CAAATGCGTA CTTTCTCTGG ATGTTAACTC AGCTCTTTCT
TTTCTACCAG CTGGTGATTC ATACACTGTT GCTGAAGACG AAGGCGACGA CAAGGATAAG
GCAGAAGATG AGTTGAACAA ATACTATACC CTCTTGTCAG AACCCAGTTC GTCTAACTTA
GACAACTCCA TTCTTGCCAT CACTCAGTTG CCTCTGACGC ACAAGCTGGG CAGAGAGTTG
TTGAACACAA GCATTCTTGG CTATGTCCAT GTATCCAAGT TTGACGATGC CAAAGGTAAA
ATCAAGGTAT TGTTACCTTT TCCAGGAGGA TTTCCCAGAA ACATGTTGAT TTCTACGAAC
ATTGGATTCA ATGAGTAG
 
Protein sequence
VAIPEGSEWR IEVPHKTILK FKVTEGVAEI FGTELPINVE LQISGTKTMV YAPIIYETFK 
NKTVMSNESE EIVEYLSNDS VMANYINLHL VVEAMRQQVS DNNILNPTEL QSGPRVLIVG
NGNSGKTSLA KLLSAYAIKS DSTPVLVNLN PRDGVFSLPG SLTATPISDS LDVESANGWG
FTTTSGSLFH NPKQPIVKNY GFVDVNENLD LYKYQVSKLG VTVLSRLEED IACRNGGVII
DTPALGIKDF TVIENIVSDF EVNLIVVLGN ERLMIDLKKR FKHKSALQIV KVPKSEGLVE
VDEAFIRRTQ EESIKEYFNG NYKTRLSPFK TDIDVNDHTI YKCVLSSDVN SALSFLPAEP
SSSNLDNSIL AITQLPSTHK SGRELLNTSI LGYVHVSKFD DAKGKIKVLL PFPGGFPRNM
LISTNIGFNE