Gene PICST_28404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28404 
SymbolPRD1.2 
ID4851181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1134745 
End bp1136808 
Gene Length2064 bp 
Protein Length687 aa 
Translation table 
GC content42% 
IMG OID640392889 
Productsaccharolysin (oligopeptidase) 
Protein accessionXP_001387447 
Protein GI126274167 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0460468 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACCG ATTTCATTGC GTTAGCTAGT AAAGAATCGT ATCCATTATG GAATCACACT 
GTCGAGGATC TCGAAAGACT TGCTAACCAA TTGGTTAACG AAGAGAAGGA GACCTACGAT
TACATTGCTA CAATTGAGAA TCCTACCATT GAGAATGTCG TCAAGCCCTA TGCCAGATAT
CTGCACAAGA ATGCTCTTTT AGAGAACCAA ATCACCTTCT ATCAGTATGT CTCTGCCTCG
AAGGACTTGA GAGATGCTTC TACTAGGGCC GAAGAGCAAT TGGAACAGGC ATCCATCGAG
CAATCGCTCA GAGTAGATGT CTTCCAGGTG TTCAACAAGT TGTACGAGTC AATCAAGGAC
GAGAACGACA TCGAGCCTGA AACCAAGAGA TTGGTAGATA AAGTAGTTAA GTACTATAAG
AGAAATGGTT TGGCTTTGCC TGAAGAACAG CGTGAGGAGA TCAAGAGGTT GAAAAAGGAG
TTGTCCACTT TATCAGTCAA ATTCTCCAAG AACATGAATG AAGAAAAAGA CTTCATTCTC
TTCACCACTG AAGAATTGGC TGGTGTACCT TTGGATGTCG TGGACCAGTA TGAAAAAGTT
GAAGAAGACG GTGTTCAAAA ACACAAGATG ACCTTCAAGT ACCCTGATCT TCATCCTGTT
TTGAAACATG CAACCAATCA GGAAACAAGA AAGAGAGCAT TTATTGCCAA CCAGAACAAG
TGTCCAGCCA ATGCTGAAAT CTTGGACACG ATTATCAGGA CCAGATTCGA GTTGGCTAAG
AAGTTGGGTT ACTCCACATA CTCCGAATAC GTTTTGGAAG ATAGAATGGC TAAAACCCAG
AAGAATGTTT TGGACTTCTT ATACGACTTG AAGAGCAAGT TAGTTCCCCT TGGCCAGAAA
GAAATTGCCA ACATGAAAGA ATTCAAGAAC AAGGATTTGA CTGCTAGAGG ATTGGAGCAA
CAGGATAAGT ACTACATCTG GGATTCCAAC TTCTACAATG AACTCTTATT GGAAAAGGAA
TACAAGGTCG ATAATACCAA AATTTCCGAG TACTTCCCCA TGGATGCTAC CATCGATAAG
ATGCTTGGTT TCTACGAAAC TTTGTTCGAC ATGAAGTTTG TCAGAATCGA CAAACCGGCA
GATGGAGCTA CGTGGCATGA AGATGTCAAA CAGTTTGCTG TGTACCAAAA CATCAAGTAC
GGTGAACCTA AGTTGGAATT CATGGGCTGG ATCTATTTTG ATTTGCACCC AAGAGAAGGC
AAATATAGTC ATGCTGCCAA CTTCGGTATT GGTCCTGGTT ACTTGGATGA AGATGGAGTT
ACCAGACACA CTCCAGTCAC TACCTTGGTG TGTAACTTCA CCAAGCCTAC AGCTGAGAAA
CCTTCTTTGT TGAAACACGA TGAAGTCACT ACATTTTTCC ACGAATTGGG CCATGGTGTG
CACAACATCT TGTCCAAGAC TAAATATGGT AGATTCCATG GTACTCATGT TGAGAGAGAT
TTTGTAGAAA CTCCATCACA AATGTTAGAG TTCTGGACTT GGTCAAAGAA TGAACTTAGA
AACTTATCTT CTCACTTCCA AACTGGAGAA CCAATCAACG ACGAGTTGAT TGACCAATTG
ATAAAGTCAA AACATGTCAA TACTGGCTTG TTTAACTTGC GTCAATTACA CTTTGGCCTC
TTTGATATGA AACTTCATAC CACTGCAACC AAGGAGGAAT TAGATGAGTT GGACTTGACC
AAGGAATGGA ATGAGATGCG TGATGAGATT GCTTTAGTCG ACAGCGACCA TATTCCCACC
AAGGGCTATT CGTCATTTGG TCACATTGCC GGGGGTTATG AAAGTGGTTA CTACGGCTAC
TTGTACTCTT CTGTGTACTC GGCTGATATC TACTACAGTT TATTCAAAAA AGATCCTATG
AACGTTGAGA ACGGTATCAG ATACAGAGAT ATAATTTTGA AGAGAGGTGG TTCCCGTGAA
ATCATCGACA ACTTGGTCGA GCTTTTGGGC AGACAACCTA ACTCTGATGC CTTCTTGGAA
GAAATCTTTG GTGGACAGCA ATAG
 
Protein sequence
MTTDFIALAS KESYPLWNHT VEDLERLANQ LVNEEKETYD YIATIENPTI ENVVKPYARY 
LHKNALLENQ ITFYQYVSAS KDLRDASTRA EEQLEQASIE QSLRVDVFQV FNKLYESIKD
ENDIEPETKR LVDKVVKYYK RNGLALPEEQ REEIKRLKKE LSTLSVKFSK NMNEEKDFIL
FTTEELAGVP LDVVDQYEKV EEDGVQKHKM TFKYPDLHPV LKHATNQETR KRAFIANQNK
CPANAEILDT IIRTRFELAK KLGYSTYSEY VLEDRMAKTQ KNVLDFLYDL KSKLVPLGQK
EIANMKEFKN KDLTARGLEQ QDKYYIWDSN FYNELLLEKE YKVDNTKISE YFPMDATIDK
MLGFYETLFD MKFVRIDKPA DGATWHEDVK QFAVYQNIKY GEPKLEFMGW IYFDLHPREG
KYSHAANFGI GPGYLDEDGV TRHTPVTTLV CNFTKPTAEK PSLLKHDEVT TFFHELGHGV
HNILSKTKYG RFHGTHVERD FVETPSQMLE FWTWSKNELR NLSSHFQTGE PINDELIDQL
IKSKHVNTGL FNLRQLHFGL FDMKLHTTAT KEELDELDLT KEWNEMRDEI ALVDSDHIPT
KGYSSFGHIA GGYESGYYGY LYSSVYSADI YYSLFKKDPM NVENGIRYRD IILKRGGSRE
IIDNLVELLG RQPNSDAFLE EIFGGQQ