Gene PICST_65037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_65037 
Symbol 
ID4851858 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3024002 
End bp3026851 
Gene Length2850 bp 
Protein Length949 aa 
Translation table 
GC content44% 
IMG OID640393566 
Productpredicted protein 
Protein accessionXP_001387145 
Protein GI126275808 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5110] 26S proteasome regulatory complex component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.882076 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCTG CCAAAGAAAC GGAAACCGTG CCTCCCAAGG AGGTTACTGA AGCTACTAAG 
AAGAAGGAAC AGAAGGAGGA AGAGTTGTCT GAAGAAGACC AGCGTTTGAA GGATGACTTG
GAACTTCTCG TGGAACGTTT GGCAGAGCCA GATCAAGAGC TGGGTTTGTA TGACAAATAC
TTGTCGACAT TGAAGACCTA TATTAAGGAT TCCACTACTT CAATGACAGC TGTGCCCAAG
CCATTGAAGT TCTTGAGACC TCACTATCCT GCCTTGACTG AATTGTACGA CAAATGGGCT
GAAACTCACG GTGTCAAAAC AAATGTTGTC GTAAGTTTGG CCGATATTTT GTCTGTATTA
GCTATGACTT ACTCTGACGA CGGAAAGAGA GACTCGTTGA AATACAGATT GTTGTCGTCT
GATACTACTA TCGCCGACTG GGGTCACGAA TACATGCGTC ACTTGGCATT AGAAATCGGA
GAATCGTACC AGGAGGAGTT GGGTTCGGAC GAATCCATCG AAAAATTGGT CCAATTGGCT
ATACAGATTG TTCCTTTCTT CTTGGAACAC AATGCAGAAG CTGACGCCGT AGACTTGCTC
TTGGAAATCG AAAGTATTGA CAAGTTACCT AAGTTCGTAG ATGACAGCAC TTACGCCAGA
GTATGTCTTT ATATGGTCAG TTGTGTACCT TACTTAGCTC CTCCCGACGA TAAGTCATTC
TTGCACACTG CCTATGCCAT CTATTTGGAA CATAGTCAAT TGACACAAGC CTTAACTTTG
GCTATCAAGT TAGATGATGA GGACTTGATT AAGAAAGTTT TTGAAGTCAC CGACGATGTC
TTGATTCATA AACAGTTGGG ACTCATGCTC GCCCAGCAGA AGAACAGCTT CAAGTACCCT
GGTGAGAATC CAGAAGTGCA GGAAACCATC CACAATGTCA AGTTGCACGA ATACTACAGC
TATCTTGCTA AGGAGTTGAA TCTCTTAGAT CCAAAGGTGC CTGAGGATAT CTATAAATCA
CACTTGGAAA ACGCCAAGTT TGGTTTGGGA ACCTCTGGCT CTATTGACTC TGCCAAGCAG
AACTTGGCTG CTGCGTTTGT CAACACCTTC ATCAACTTGG GATATGGTAC CGACAAGTTA
ATTCAAACTG AAGAAGACAA CAAGTCGTGG ATCTACAAGA CCAAGGGACT CGGCATGGTT
TCGACTACTG CCTCGTTGGG TTCTCTCCAC CAATGGAATA TCAACGAAGG TTTCCAGGTC
TTGGACAAGT ACACCTATTC TTCTGAGGAC GAAATTAAAG CTGGTGCCTT ATTGGGTACG
GGTATTGTTT CTGCTAATGT TCACGACGAT GTCGAAGCTG CTTTGGCTTT ACTTCAAGAC
TATGTCTCTG ACCCAAACAA GCTCTTGCAA TCTTCTGCCA TTAATGGGTT AGGTATTGCA
TTTGCCGGTT CCTCCAATGA GGAAGTGTTG AACTTGTTAT TGCCATTGGT TTCTGATTTA
GATATTCCTT TTGAAATCTC CTGCTTAGCA GCTTTGGCTT TAGGGCATGT ATTTGTTGGT
ACTTGTCACG GCGATATTAC ATCCACCATA TTGCAGACCT TATTGGAAAG AGACTTCATT
CAATTGACCA ATAAGTTCAT CAAATTCATG TCTTTAGGTT TGGGTTTGTT ATACATGGGC
AAGACCGAAC AAGTCGAGGA CGTATTGGAA ACCATCGATG CCATCGAACA CCCTATCTGT
AAGACCTTGA AGGTCTTAGT GAATATCTGT GCTTATGCTG GTACCGGTAA TGTTTTGCAG
ATCCAGTCCT TATTACAGAT GTGCACTGCT AAGCCTAAAG ATCAAGCCGC TGATGAAAAG
AAATTGGAAC AATCAGAAGA AGACCAATTG AAGAGCGAAA ACAAAGAAGA CAAGACAGCT
GAAGATGTCG CCGACGTAGA AATGGAAGAA GCCACAGCTG AAAAGAAGGC AAGCGAAAAG
GAAGAAAAAG AAGAAAAAGA GGAAGAGGAA GAGGAAGAAA AGGAAGAGGA AGAAGAGTTG
TTTCAGGGTA TTGCTGTTTT GGGTTTGGCC TGTATTGCTA TGGGAGATGA AATCGGTCAA
GACATGTCAC TTCGTCACTT TGCCCATTTG ATGCATTATG GTAATTCTCT GATCCGTAGA
GCTGTGCCAT TGGCAATGGG ATTGGTGTCT ACTTCATATC CACAAATGAA GGTGTTTGAT
ACCTTGTCTC GTTACTCGCA CGACCCAGAC TTGGAAGTAG CTCAGAATGC CATCTACTCC
ATGGGTTTGG TTGGTGCTGG TACAAACAAC GCCAGATTGG CACAGTTGTT GAGGCAATTG
GCTTCTTACT ATATCAAGTC GCCAGACTCA TTGTTCATGG TACGAATTGC ACAGGGAATC
TTGCACTTGG GTAAGGGCAC ATTGACGTTG AGTCCTTTCA ATAGTGAGAG AAGCATCTTA
TCCAAGGTTT CATTGGCATC GTTGTTGACT ATTTCTGTAG CATTATTAGA TCCAAAGGCC
TTCATCTTGA GTGACTCTAC TACCGAGACC TCCCATCAAG TATTGTACTA CTTGATTCCT
GGTGTAAAGC CCAGAATGTT GGTTACAGTA GATGAAGATT TAAACCCTAT CAAGGTGAAT
GTCAGAGTCG GTCAGGCTGT TGATGTGGTA GGACAGGCAG GTAGGCCCAA GACCATTACT
GGATGGGTGA CTCAATCTAC TCCTGTGTTG TTGAACTACG GAGAAAGAGC CGAGTTGGAA
AACACTGACG AATGGATCTC ATTGAGTGGG TCGTTGGACG GGGTTGTGAT TTTGAAGAAG
AACCCTGACT TCATGGAGGT GGATGCCTAG
 
Protein sequence
MSPAKETETV PPKEVTEATK KKEQKEEELS EEDQRLKDDL ELLVERLAEP DQELGLYDKY 
LSTLKTYIKD STTSMTAVPK PLKFLRPHYP ALTELYDKWA ETHGVKTNVV VSLADILSVL
AMTYSDDGKR DSLKYRLLSS DTTIADWGHE YMRHLALEIG ESYQEELGSD ESIEKLVQLA
IQIVPFFLEH NAEADAVDLL LEIESIDKLP KFVDDSTYAR VCLYMVSCVP YLAPPDDKSF
LHTAYAIYLE HSQLTQALTL AIKLDDEDLI KKVFEVTDDV LIHKQLGLML AQQKNSFKYP
GENPEVQETI HNVKLHEYYS YLAKELNLLD PKVPEDIYKS HLENAKFGLG TSGSIDSAKQ
NLAAAFVNTF INLGYGTDKL IQTEEDNKSW IYKTKGLGMV STTASLGSLH QWNINEGFQV
LDKYTYSSED EIKAGALLGT GIVSANVHDD VEAALALLQD YVSDPNKLLQ SSAINGLGIA
FAGSSNEEVL NLLLPLVSDL DIPFEISCLA ALALGHVFVG TCHGDITSTI LQTLLERDFI
QLTNKFIKFM SLGLGLLYMG KTEQVEDVLE TIDAIEHPIC KTLKVLVNIC AYAGTGNVLQ
IQSLLQMCTA KPKDQAADEK KLEQSEEDQL KSENKEDKTA EDVADVEMEE ATAEKKASEK
EEKEEKEEEE EEEKEEEEEL FQGIAVLGLA CIAMGDEIGQ DMSLRHFAHL MHYGNSLIRR
AVPLAMGLVS TSYPQMKVFD TLSRYSHDPD LEVAQNAIYS MGLVGAGTNN ARLAQLLRQL
ASYYIKSPDS LFMVRIAQGI LHLGKGTLTL SPFNSERSIL SKVSLASLLT ISVALLDPKA
FILSDSTTET SHQVLYYLIP GVKPRMLVTV DEDLNPIKVN VRVGQAVDVV GQAGRPKTIT
GWVTQSTPVL LNYGERAELE NTDEWISLSG SLDGVVILKK NPDFMEVDA