Gene PICST_28766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28766 
Symbol 
ID4851516 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2034293 
End bp2035804 
Gene Length1512 bp 
Protein Length503 aa 
Translation table 
GC content42% 
IMG OID640393224 
Productpredicted protein 
Protein accessionXP_001388017 
Protein GI126274720 
COG category[A] RNA processing and modification 
COG ID[COG5182] Splicing factor 3b, subunit 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.194827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACCCA AGAGAAGCAA GAACCAGCTT CGTAGAGAAC GGGTGAAGCT CCGAAAGCTA 
GAAGCTGACA AGAAAGAAGA TACATTGGAG AGCACTGAAA TACAAAAGCC AGACAAAGTA
AAAGATTCAG AAACGACAAA TAATGACAAT AAAGAACATA GAAATGACAA AGCTCACGAC
ATAGATACCG CTGAAGAGCA GAAAGATTCA GTCGAAAAGT CTGTTGAGAA TCTTGAAGAT
TTTATAGCGA TTGCCAAAGA TTCTTTTCAA TCTGTTCCGG TTAATACTTC AATAGATGAG
TCTCTTTATC AGCAGTTCCA GGGAGTGTTC AGTAAGTTTC AAGGAGCTAC TGTTGCTGAA
GAAGAAGTAG AATCTGTTCC AGAATCCAAA GGTGATGTTC TCTATAATAG TGGGTCTGAT
GAAGAGTCAG AATTGGAGTC CCTGGATTCT GAAGAAGAGG AAGAGCTTTC TAAACGACAA
CTTCGTAAAC GTAACAAAGT GCCATTGGCC TCGCTCAAGG CATCAACTAT ACGGCCACAA
CTCGTTGAAT GGTATGATGT AGATGCTCTG GACCCGTTCT TCCTCGTAGC ATTAAAGACG
AGCCCCAATG CGGTTCAGGT ACCGAGCCAC TGGTCAGCAA AGAGAGAGTA TCTTTCATCG
AAGAAAGGAA TAGAACGATT ACCGTTTCAG TTGCCAAAAT TTATCACCGA TACAGGTATT
CAAGATATGA GACATAGTGA TGATCAAACG TTGAGGCAAC AGCAAAGAGA CAGAGTACAG
CCCAAGATGG GTCGATTGGA TATCGATTAC CAGCGCCTTC ATGATGCTTT TTTTAAATAT
CAGGAAAAGC CACGATTGCT TGGTTTTGGA GATGTATACT TTGAAGGTAG AGAAGCAGCT
GATGAATATA GCAATGATCT CTCTAGCATA AGACCTGGCA AGGTGTCATC TGAGCTACGT
AAAGCTTTAG GTATTCCTGA AGGTGCACCT CCTTGGATTT CCATTATGAA GGATATAGGT
AAGCCTCCTG CCTATTCCAG TCTAGCAATA CCTGGATTAG ATACTTCATA CGATAATGAC
GGCTACAGAG ATAGTAAATC TGTGAACACT AGTAAATTGC ACGAAACAGA ACATTGGGGG
AAGCTAGAGG ACTACGAAGA ATCTGAAGAA GAAGAAGTGG ATGGTGAAGA GGAAGATGAA
GAGCTGGATG CTGACGATGA CGAAATGATT GCATATGAAC AGGAAGAGGA ACAAGCTGAT
GATGAACCTG TGAAGGTGCA GATTTCAGAA TATGGTGGAA TAAAGTCAAG ACCACACAAG
CCTGTTGATG AAAGCAATGA ACACAAGTCG CTTTACACTG TAATCAAAGA GAAACAACCT
CTGGAAGGAG CTGGTCTATT ACAGAGTGGC TTTTCCTACG ACCTCTCCAA AGATTCACAA
GTTGATGAGA CACCTGTTAA ACAGGATACG AAGGTTGAAA CTCTTGAACC TAAGAAGAAG
TTCAAGTTCT AG
 
Protein sequence
MPPKRSKNQL RRERVKLRKL EADKKEDTLE STEIQKPDKV KDSETTNNDN KEHRNDKAHD 
IDTAEEQKDS VEKSVENLED FIAIAKDSFQ SVPVNTSIDE SLYQQFQGVF SKFQGATVAE
EEVESVPESK GDVLYNSGSD EESELESLDS EEEEELSKRQ LRKRNKVPLA SLKASTIRPQ
LVEWYDVDAL DPFFLVALKT SPNAVQVPSH WSAKREYLSS KKGIERLPFQ LPKFITDTGI
QDMRHSDDQT LRQQQRDRVQ PKMGRLDIDY QRLHDAFFKY QEKPRLLGFG DVYFEGREAA
DEYSNDLSSI RPGKVSSELR KALGIPEGAP PWISIMKDIG KPPAYSSLAI PGLDTSYDND
GYRDSKSVNT SKLHETEHWG KLEDYEESEE EEVDGEEEDE ELDADDDEMI AYEQEEEQAD
DEPVKVQISE YGGIKSRPHK PVDESNEHKS LYTVIKEKQP LEGAGLLQSG FSYDLSKDSQ
VDETPVKQDT KVETLEPKKK FKF