Gene PICST_56651 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_56651 
Symbol 
ID4838152 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp702929 
End bp706153 
Gene Length3225 bp 
Protein Length1074 aa 
Translation table12 
GC content41% 
IMG OID640389467 
Productpredicted protein 
Protein accessionXP_001383768 
Protein GI150864794 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1025] Secreted/periplasmic Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.798232 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TACTCTACCT CATCCCGTCT CACCCATCTC ATGCTGCATT ATACGGTCTT GGCCGAAAAC 
GACGTTATTG AGAAGCCTCT TCTCGACAAC AGAACCTATC GTTACTTAAA GCTCGACTCG
AACGATCTCC AGGTCCTCGT GATTCATGAC TCAACTGCTG ACAAGTCTGC GGCCTCTTTG
GATGTCAATG TAGGCTCGTT CGCTGACAAA AAGTATGGAA TTCCTGGTTT GGCTCATTTC
TGTGAACATT TGCTATTCAT GGGTACCGAA AAGTATCCGG CTGAAAATGA ATATTCCTCG
TATTTATCGA AACATTCTGG GTACTCCAAT GCTTATACTG CAGCAGAACA TACGAATTAC
TACTTTCAGG TAAGTGCCGA CTACTTAGAA GGTGCTCTAG ACCGGTTTGC TCAATTCTTC
GTGGCACCCC TTTTTAGCCA GAGCTGCAAG GATCGAGAAA TAAACGCTGT AGACTCAGAA
AACAAAAAGA ATCTCCAAAA CGATCTCTGG AGACTCTACC AGCTCGACAA GCTGAACAGT
AACCCTGACC ATCCTTACAA TGGGTTTTCT ACAGGTAACT ACCAGACTTT GCATGTAGAG
CCTTCTGAAA GAGGACTCAA TGTTCGTGAT GTCTTGCTCG ATTTCTACAG CAACAGCTAC
TCGTCGAACT TGATGAGTTT GGTTGTGTTG GGAAAGGAAG ATTTGGACAC TTTGTCAGCT
TGGGCTATCG AGAAGTTCTC GGCCGTGCCT AACAAGAGCT TAACAAGACC AAACTTCCAT
GGCGAAGTCA TCTTGACAGA TAAGTACCTC GGTAAGTTGA CCAGGGCCAA GCCCATTATG
GACAAGCATC AACTCGAGTT AACATTCATG GTTCCTGATG ATTTGGAAAC CAAATGGAAA
TCTAAACCCA ACGGCTACTT CTCCCATTTA TTGGGACATG AAAGCGAAGG CTCTGTGCTT
TTCTTCTTGA AACACAAGGG TTGGGTAACA GAATTGTCGT CTGGTAACAT GCGAGTCTGT
CAGGGTAACT CTTTCTTTAT CCTCGAATTT GAGTTGACTC CGGAAGGGTT GCAGAATTGG
AAGGAGATAG TAGTTTCTGT GTTTCAGTAC TTGAAGTTGA TTCTTCCAGA AGAGCCCAAA
AAATGGATCT ACGACGAGAT CTCTATGATG TCTGCCATTA ACTTCAAGTT CAGGCAAAAG
GCAGATGCCG CTAACACTGT TTCTCTGATG AGCAACACTT TGTACAAGTT TGCTGTAGAT
GGTTATATTC CTCCAGAATA CATTTTGAGT TCTTCTGTTT ACAGAGAATT CAACAAACAG
GAGATCATAG ACTTTGGCAA ATTCTTGAAT CCAAACAATT TCAAGATTTC TTTGGTTTCG
CAATCGTTAG ATGGCTTGAA CAAGTCCGAA AAATGGTACG GAACTGAGTA TGCTTATGAA
GATATACCAG TAGATTTATT ACAAAACGTT GAATCGGCGC AATTGAATCC ACACTTTCAC
TATCCCAAAC CCAACGATTT CATTCCCAAG GACTTTGAAG TGTTAAGAAA GAAGTCAGAA
ACTCCTTTGC AACATCCTTA CCTCATTGAA GAGAGCAACA AGCTCCAGGT CTGGTACAAG
CAAGACGATC TTTTTGAAGT TCCCAAAGGT AATATCGATA TAGTGTTCCA TTTGCCCAAC
TCCAACTTGG ACAAGAAGAC TTCTACCTAC TCGTCTTTGT TGGCTGAATT GATAACCGAT
GAATTAAACC AAGTCACTTA TTATGCCTCA TTGGTTGGCT TAAAGGTGCT GATATCATGC
TGGCGTGATG GATTCAACGT GAGAGTATCA GGTTACAGTG ACAAACTTCC AGTGCTTTTA
GATCAGGTTT TGTCTAAATT CTTTAATTTC AAACCTAACA AGGAAAGGTT TGAAGCTATC
AGATTCAAAT TGTATCAACA ATTCAAGAAT TTTGGATATG ATGTTCCGTA CCGTCAAATT
GGAACTCATA TTTTACTGTT GCTCAACGAG AAAACCTACA CTTATGATGA AAAAGTTCAA
GTTATGGACG AAGATCTTTC ATTTGACGAA TTGAACGAGT TTGCAACCAA GAATCTTTGG
AAATCAGGAA TCTTTACTGA AGTTTTGATC CATGGTAACT TTGATATCGC CAAAGGAGAT
GAAATCAGAA AATTGATAGC TAGTCACACC AAGAGTCTTG CTCCTATTGC TGATACTTTA
GATGATGTCA ACAAAGCCAT CAAGCTTCAG AACTTTGTGC TTCCATCTAA GGAGTTTATT
AGATACGAAT TGCCATTGCA AGATGAGAAA AATATTAACT CTTGTATTGA GTACTACATC
CAGATCAGTC CCACGAACGA TGATCCCAAA TTGAGAGTGT TGACTGATTT ATTTGGTACC
ATTATCCGTG AACCTTGCTT CAATCAATTG AGGACAAAGG AACAACTAGG TTACGTAGTT
TTTTCTGGTA CTAGATTGGG CAGAACGTCG ATAGGTTTCA GAATCTTGGT TCAGTCGGAA
AGAACTGCAG ACTATTTGGA GTACAGAATT GATGAGTTTT TGGGCAAGTT TGGCAAGCAC
ATCAACTCAG AGTTGACAGA AGTCGATTTC GTCAAATTCA AGCAAGCACT CAAGGACCTT
AAATTATCAA AATTGAAGCA CTTGAACGAA GAAACTTCTA GACTCTGGAA CTCCATTACT
GATGGCTACT TTGATTTTGA AGCCAGACAG AAGCATGTTA AGATATTGGA GACCATCAGT
AAGGAAGAGT TTGTCGATTT CTTTAACAAC TACATTGCTG ATGGATCTGA CAAGTCCGGC
AAGCTTGTCG TCTACTTGAA TTCGCAATCT CCTCCTGAAC AGACAACGCT CAAGTTGGCA
CATAGTTCTA TTATAAACTA TATCTACAGA AATGGCTACG AGGCTTCCAC CGAAAAGTTG
GAATCCATAG TGAAGGAGAA TCTTGAAAAT CACCAACAGT TGGTTAAACA AGTTGCTGAA
GAAATTCTGG AATATGTATC CAACAAACCA CCAGCTAACT TGAAACAAGA TTTGCTTATT
GCTTTTGAAA ACGATATCAA GACTCCAGTT CCCACCAAAT ACCGCCAGGG AACGGTGTAC
AAGGATATTT CAGAATTCAG AAAACACTAC TCTTTGGGAG GAGTTCCTTC AGCAGTAGAG
CCTTTGACCA AATATTACTA TCCTGGACGT AACCCACACT TATAA
 
Protein sequence
YSTSSRLTHL MSHYTVLAEN DVIEKPLLDN RTYRYLKLDS NDLQVLVIHD STADKSAASL 
DVNVGSFADK KYGIPGLAHF CEHLLFMGTE KYPAENEYSS YLSKHSGYSN AYTAAEHTNY
YFQVSADYLE GALDRFAQFF VAPLFSQSCK DREINAVDSE NKKNLQNDLW RLYQLDKSNS
NPDHPYNGFS TGNYQTLHVE PSERGLNVRD VLLDFYSNSY SSNLMSLVVL GKEDLDTLSA
WAIEKFSAVP NKSLTRPNFH GEVILTDKYL GKLTRAKPIM DKHQLELTFM VPDDLETKWK
SKPNGYFSHL LGHESEGSVL FFLKHKGWVT ELSSGNMRVC QGNSFFILEF ELTPEGLQNW
KEIVVSVFQY LKLILPEEPK KWIYDEISMM SAINFKFRQK ADAANTVSSM SNTLYKFAVD
GYIPPEYILS SSVYREFNKQ EIIDFGKFLN PNNFKISLVS QSLDGLNKSE KWYGTEYAYE
DIPVDLLQNV ESAQLNPHFH YPKPNDFIPK DFEVLRKKSE TPLQHPYLIE ESNKLQVWYK
QDDLFEVPKG NIDIVFHLPN SNLDKKTSTY SSLLAELITD ELNQVTYYAS LVGLKVSISC
WRDGFNVRVS GYSDKLPVLL DQVLSKFFNF KPNKERFEAI RFKLYQQFKN FGYDVPYRQI
GTHILSLLNE KTYTYDEKVQ VMDEDLSFDE LNEFATKNLW KSGIFTEVLI HGNFDIAKGD
EIRKLIASHT KSLAPIADTL DDVNKAIKLQ NFVLPSKEFI RYELPLQDEK NINSCIEYYI
QISPTNDDPK LRVLTDLFGT IIREPCFNQL RTKEQLGYVV FSGTRLGRTS IGFRILVQSE
RTADYLEYRI DEFLGKFGKH INSELTEVDF VKFKQALKDL KLSKLKHLNE ETSRLWNSIT
DGYFDFEARQ KHVKILETIS KEEFVDFFNN YIADGSDKSG KLVVYLNSQS PPEQTTLKLA
HSSIINYIYR NGYEASTEKL ESIVKENLEN HQQLVKQVAE EISEYVSNKP PANLKQDLLI
AFENDIKTPV PTKYRQGTVY KDISEFRKHY SLGGVPSAVE PLTKYYYPGR NPHL