Gene PICST_51091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_51091 
SymbolPRD1.1 
ID4850851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp226796 
End bp228772 
Gene Length1977 bp 
Protein Length658 aa 
Translation table 
GC content40% 
IMG OID640392559 
Productsaccharolysin (oligopeptidase) 
Protein accessionXP_001387284 
Protein GI126273711 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCATGGAACC ATACCCCACA ACAGATCGCT GATCTTACGG AAGAATTGAT AGAAACAACA 
AAAGCATTCA ATGATCACAT CGCTTCTTTA AGTAGTAATT TAACTGTAGA GGGAGTCTTG
CTTCCGTATA TCGATTTTGA AAACGAGAGC CAGCTACTCA TCAATCAATT GTTCTTCTAC
CAGTACGTTT CCAGTGATAA GGATATCAGG GATGCCTCTA CTGCTGCCGA AGAACTCTTT
CTGGAGAAGA TGATTGAGCA GTCATTGAGA ACTGATGTCT ACGAGGTATT TAAAAAACTA
CAAGAAAAAG TAGATTCTGG AATGTTGTCT ATTGCTAGCA AAGAACACCA ATTGTTCTTG
AGCAAAACCA TGCTTGGGTT TAGAAAAAAC GGATTACATT TACCAGAGGA CCAGAGACAA
GTGGTCAAGT CGTATCTCCT GAAATTGAAA GAGCTCTGTA TACATTTTTC CAAGAATGCC
AACGAAGAAA ATGGGTACAT TCTTTTCAGC AAAGAAGAGC TTGAAGGTGT CCCTAAACTG
ACGGTAGATT CATTCGAGCA AGTAGACAAG GATGGCGTTC AATTGTATAA GATGACATTC
AAGTATCCAG ACATTTTTCC CGTTCTGGGA TTTGCTAATA ATGAAACGAC AAGAAAAACA
GTATATCTAG GCAATGGTGA CAAATGCAAG GCAAACAATG TAATATTGGA GGAAATAATA
GCGTTGAGAT ATAAGCTCGC AAAGCTTCTT GGGTTCAACA GTTTTTCAGA CTACGTTCTT
GATGAGACTT TGGCTCAGAA CGTAACTACT GCTGTATCAT TTTTGACGGA TTTGAGAAGA
AAATTAACTC CATTGGCTCA AATAGAACTT GAGAAACTTT CTGAGTTCAA GGGTGCAGAA
GTATTCAAGT GGGACTTCAA ATACCTCGAG AACAAGATGT TATCGAAACA ATACCAGGTC
AACGAGACAG AAATAGCCGA ATACTTCCCC ATGGAATCAA CCATAGAGAA GATGCTTGCT
ATTTATGAGA AGCTCTTTGA TTTGGAGTTC CAACCGGTTC TAACCAATAC CTCGGTCTGG
CATGAGGATG TCAGACAGTA TTTGGTGTTG ATTGGTGAAG GTTCAAACAA AAAGTTTCTT
GGTGTCATTT ATTTTGATTT GCATCCAAGA GAGGGCAAGT ATGGTCATGC TGCCAACTTT
GGAATTGCTC CAGGGTACGC AAAGAGAGAT GGAAAATCTC GTGCTTATCC GATCACTGCT
TTAGTTTGCA ACTTTAGCAA GAAAACGGAA TCAAAGCCTT CTCTTCTTAA GCATTACGAA
GTGAAGACTT TCTTCCATGA GTTAGGACAT GGTATTCATG ATCTTTTGGG CAGGACAGAG
GTGGCTCGGT TCCATGGAAC AAATGTACCA CGGGACTTTG TAGAGACGCC TTCGCAGTCG
TTTGAGTTCT GGACGTGGGA AAAGTCGATA TTGAAGAACT TATCGTCTCA TTATCTTACC
AACGAGTCAT TGAGTGATAC GTTGATCGAC AACCTAGTAT CTACAAAGCA TGTCAACGGA
GCATTGCATG CTTTAAGACA GTTGCATTTC GGACTCTTTG ATCTTGCTGT TCATCAACTT
GAAGACGATG AGAGCCTCGA GCTATTGAAT ATCAGCCGGT TATGGAACAA CTTGAGCAAT
GAAGTTTCTT TGATTTCACT GGGTAATTAC ACAGTTGATT CGTACGGGTC TTTTGGACAT
ATTGCCGGAG GTTACGAATC AGGATACTAC AGCTACTTCT TCAGCGAAGT GTTTGGTGAT
GATATTTATT ACACATTGTT CAAAGACGAT CCCATGAGTG TAGAAAATGG AAGAAAGTAT
AGAGATATCG TTTTATCTAA GGGAAATTCA GAGGATATAA TGGACAACTT GAAGTTGTTG
CTAGGAAGAG AACCTACCTC AGATGCTTTC TTAAAGGAAT ATGGATTGGA CAAGTGA
 
Protein sequence
SWNHTPQQIA DLTEELIETT KAFNDHIASL SSNLTVEGVL LPYIDFENES QLLINQLFFY 
QYVSSDKDIR DASTAAEELF LEKMIEQSLR TDVYEVFKKL QEKVDSGMLS IASKEHQLFL
SKTMLGFRKN GLHLPEDQRQ VVKSYLLKLK ELCIHFSKNA NEENGYILFS KEELEGVPKL
TVDSFEQVDK DGVQLYKMTF KYPDIFPVLG FANNETTRKT VYLGNGDKCK ANNVILEEII
ALRYKLAKLL GFNSFSDYVL DETLAQNVTT AVSFLTDLRR KLTPLAQIEL EKLSEFKGAE
VFKWDFKYLE NKMLSKQYQV NETEIAEYFP MESTIEKMLA IYEKLFDLEF QPVLTNTSVW
HEDVRQYLVL IGEGSNKKFL GVIYFDLHPR EGKYGHAANF GIAPGYAKRD GKSRAYPITA
LVCNFSKKTE SKPSLLKHYE VKTFFHELGH GIHDLLGRTE VARFHGTNVP RDFVETPSQS
FEFWTWEKSI LKNLSSHYLT NESLSDTLID NLVSTKHVNG ALHALRQLHF GLFDLAVHQL
EDDESLELLN ISRLWNNLSN EVSLISLGNY TVDSYGSFGH IAGGYESGYY SYFFSEVFGD
DIYYTLFKDD PMSVENGRKY RDIVLSKGNS EDIMDNLKLL LGREPTSDAF LKEYGLDK