Gene PICST_33476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33476 
Symbol 
ID4840790 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp735541 
End bp737679 
Gene Length2139 bp 
Protein Length530 aa 
Translation table12 
GC content37% 
IMG OID640392105 
Productpredicted protein 
Protein accessionXP_001386153 
Protein GI150866518 
COG category[A] RNA processing and modification 
COG ID[COG5188] Splicing factor 3a, subunit 3 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.50882 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.174506 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATGT TGCAACGTAA TGTTGGTGTG TCTTGTTCTT TATGAATCTC ACAGGCGATA 
TCAGCAACAG GATGGTGTAA ATATATACGA GATACCGTTG ATCTGAGAAT GTTAGGCTTG
ATGTCAATGA CTTGTGAAAG GCTGATAATG AGATGAGGAA TGAAAATCCA ATAAATCTTC
CTATGATTTT GTATCAATTG CTGTACAAAG GGGCCGTAAT TGACAAAAAG ACTCCATATA
TAGATTTCCT ATACTGAATC AACTATTTGT ATAGTTCAAT TATACTTGTG ATTTCAAAAT
TATAGTACTT GAAACTTGGG GATCGAGTGC TTATTTACAG TTATAGAATA TAGCGGTGCG
ATTGAACAAG TAGACAGAAT TAGCGAGAGA TAAAAAAAAT AGGGCGGTGC GGCTATTAAA
TCAGATGACT TCAATCACGC AGCTTTCTGC ATCTATATCG AGATGCCCCA GACATGTCTT
ACCCATAATC TCCAACTTTA TAGCAGACCA TCCGGTTAAA TACTTTAAGA ATAGTAATAC
ACTATAAGAA TGTGGCTGTT TCTTGAACTG CAGCGGTCGA TTCTTGAAGA GCTCGATGTA
ATCGAACTTG AATCCTCGCG TAGGTTCAGA AAAGATCCTC TATTGTATCC ACAGAATGAG
AATAAAGAAA AGCTACAGAT TGTCAGAACA AAGAGACCAC AGAAAGAAGT CAAGTTGCAG
CAGCATGAAT TGGCTGTTTT CCAACAGAAA TACAAGAAAC ATTGCAATTC ACTTAGAAAC
CATACGGCTA ACGATAGCGA CATAATACAA TCAATCCTTG GTACCCTAGA TGATTCCAAA
GCTACGTTTT CCAATTTTGA TTCTGCTCTA GCCCAAATTC AAGAGAAGCA TAACAAGACC
AATAATGGTG AAATTGAAGT GGCGGAGAGC ATACGCAATA TGTATACTAT GTTTTCCAGT
ATTCTCTTTT CTGGGGAAGA ATCTGTATTG CTAGATGATG ATATAAAGAG AGTAAGAAAA
GAAGGCAAAG AAAAGACAAA AGTCAAGAGA AAGCACATCA TCAGCATCAC TGCATCCCAT
CTTGATCCAG ATGGAATATA CTCTACTGAA GAAGTATATG GAAAATATTT GGATTTGACA
AAATTCCACG AGATCTACAG AAATCAGACT TCCAGCAACG TCTCGTATTT GGAGTACTTG
AAAGTGTTCG ATATCTTTCC ATACGCAGAA AGCTTCCGCA GTTCAAGCAT ATATCTACAG
TACTTGAGAG ATCTAAGTGA ATATTTGGTA GACTTCGTAC TGAGAACAGA GCCATTGCAG
AACTTCAATG AAGTGTTTGA ATCTATTAAG AAATCATATT CTCCTAAAGA AGAACCAGCA
ACTAGAGATG GAGTAGAAAA TGAACTGGGT GAAGTGTATT GTAGTGTTTG TCAAAAGGTC
TTTGCAAAGA TATCAGTTTA TCAAGGCCAT TTAAATGGTA AGAAACACAA GAAGAATGCT
AAAGAATTGC AAACTGCAAC ACCAAAAGAG TCAATCATTT CTGAGAGTGA CTTGCAAGAG
CATATTAATA CAGAATTGGG TAAGTTCCTC TCCAACTATA AAGAGGCAAC TATACAGAAT
ACAGAGAGAA AATCAGCCAT GACTGAAAGA GAAAGATTGA TAGAGAATAC CACGATCGTC
GGAGACGAAT CTGACTATAC GACTGTATAT GATTCAAGTT CAGATAGTGG AAATGATTCC
AGCGATGAAG AAGAAAACGA GAACTTAAAA CACCTACCTT TAGGAGCCGA TGGAAAGCCA
ATTCCATTTT GGCTCTATAA ACTTCAGGGG TTGCACAAGA CTTACAACTG TGAAATATGT
GGCAATGTTA CATACAAAGG CAGAGTCACT TTTGAAAAAC ACTTTAGTGC ACCTAAGCAT
CAATATGGCT TGAAATGTCT TGGAATAACT GAGCAATTTG TGTCCTACTT TAAGGATATA
ATACTGATTA ACGAAGCACA AGATCTCTGG AAAAGATTGA AAAGAGATAA AAGAATCAAG
GAAGGAGACA TCGAGAATGC TGTGGAAGTC GAAGACGCCG AAGGTAATGT CATGTCAGAG
AAGGATTACC TTGATTTGAA GAAACAAGGT CTATTATAG
 
Protein sequence
MEMLQRNVAR SILEELDVIE LESSRRFRKD PLLYPQNENK EKLQIVRTKR PQKEVKLQQH 
ELAVFQQKYK KHCNSLRNHT ANDSDIIQSI LGTLDDSKAT FSNFDSALAQ IQEKHNKTNN
GEIEVAESIR NMYTMFSSIL FSGEESVLLD DDIKRVRKEG KEKTKVKRKH IISITASHLD
PDGIYSTEEV YGKYLDLTKF HEIYRNQTSS NVSYLEYLKV FDIFPYAESF RSSSIYLQYL
RDLSEYLVDF VSRTEPLQNF NEVFESIKKS YSPKEEPATR DGVENESGEV YCSVCQKVFA
KISVYQGHLN GKKHKKNAKE LQTATPKESI ISESDLQEHI NTELGKFLSN YKEATIQNTE
RKSAMTERER LIENTTIVGD ESDYTTVYDS SSDSGNDSSD EEENENLKHL PLGADGKPIP
FWLYKLQGLH KTYNCEICGN VTYKGRVTFE KHFSAPKHQY GLKCLGITEQ FVSYFKDIIS
INEAQDLWKR LKRDKRIKEG DIENAVEVED AEGNVMSEKD YLDLKKQGLL