Gene PICST_36688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36688 
SymbolMAS1 
ID4840047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp756785 
End bp758182 
Gene Length1398 bp 
Protein Length465 aa 
Translation table12 
GC content44% 
IMG OID640391362 
Productmitochondrial processing protease 
Protein accessionXP_001385848 
Protein GI126138650 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0178117 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.476468 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAGCTA GAGCATCACG TTTTAGTCGC TCTACTATTG CTGGACGTCG TTTGTTTACC 
ACAGCATCTC CTGTTCCAAC TTTCCAGACA TCAGTTCTTC CCAATGGTTT AACAGTAGCA
AGTGAATCCA TGCCAGGAAC TAGAACCGCC ACCGTAGGCG TGTGGATCAA TGCTGGTTCG
AGAGCCGACA ACCCTGCCAG TAGTGGCACT GCACATTTCT TGGAACATTT GGCTTTTAAG
GGAACCAACA AGAGATCGCA GTTGAACTTA GAATTGGAAA TAGAGAACAT CGGCTCTCAA
ATCAATGCTT ACACCTCAAG AGAGAACACA GTTTATTATA CAAAATGTTT GGAGACTGAT
ATCAACCAGA ACATCGACAT TTTGAGCGAT TTATTGACGA AGTCAAAGTT AGAAGAAAGG
GCCATCGAGA ATGAAAGACA TGTCATCTTG CAAGAAAGTG ACGAAGTCGA CAAGATGTAC
GATGAAGTGG TGTTTGACCA TTTGCACGCA GTTGCTTTCA AGAGTCAAGA CTTGGGCAGA
ACAATTTTGG GCCCCAGAGA GCTCATAAAG ACCATACAAC GAGATGATCT TGTAAACTAC
ATCACTACTA ACTATAAGGG AGACAGAATG GCACTTATAG GTGTAGGCTG TGTCAACCAC
GAGGACTTGG TCAAACAGGC ACAAAAGTAC TTTGGAGACA TCAAGAAGAG TGAAAAGCCC
TTTAAACAAA GTGGAGGTGA TTTGCCAGTC TTCTATGGTG ATGAAATCAG AATCCAAGAC
GATTCTTTGC CAACGACACA TGTTGCCTTA GCTGTAGAAG GTGTAAGCTG GTCAGCGCCA
GACTTCTTTA CGGCATCTGT TGCCAACGGT ATAATAGGAA CGTGGGATAG ATCTATCGGT
GTTGGATCCA ACTCTCCTTC CCCTCTAGCC GTAACAGCTG CTATTGGTGG CGCTGGAAAC
ACCCCTATTG CCAACTCGTA CATGGCGTAC ACTACATCGT ATGCCGATAC CGGGTTGATG
GGTGTGTATT TTACCGCCGA TAAAGATGCT AACTTGAAGT TGTTTATAGA TGCGGTTATG
AAAGAATGGG CTAGATTGAA GTCTGGTGAC ATTACTGTGG AAGAAGTGGA GAGATCGAAG
GCACAATTAA AGGCTTCCTT GGTTTTAGCA TTAGACGACT CTACGGCTAT AGCTGAAGAT
ATTGGAAGAC AATTAGTCAA TACAGGATTC CGTTTGTCTC CTGAAGAGGT CTTTGAGAGA
GTTGAGGCTA TCACTAAGAA GGACGTCATC GACTGGGCTA ATTACAGATT GAAGGATAAG
CCCATAGCCT TATCTGCCGT AGGTAACGTC AAGACACTTC CTTCTCACCA ATATCTCACT
AAGGGTATGT CCTTGTGA
 
Protein sequence
MLARASRFSR STIAGRRLFT TASPVPTFQT SVLPNGLTVA SESMPGTRTA TVGVWINAGS 
RADNPASSGT AHFLEHLAFK GTNKRSQLNL ELEIENIGSQ INAYTSRENT VYYTKCLETD
INQNIDILSD LLTKSKLEER AIENERHVIL QESDEVDKMY DEVVFDHLHA VAFKSQDLGR
TILGPRELIK TIQRDDLVNY ITTNYKGDRM ALIGVGCVNH EDLVKQAQKY FGDIKKSEKP
FKQSGGDLPV FYGDEIRIQD DSLPTTHVAL AVEGVSWSAP DFFTASVANG IIGTWDRSIG
VGSNSPSPLA VTAAIGGAGN TPIANSYMAY TTSYADTGLM GVYFTADKDA NLKLFIDAVM
KEWARLKSGD ITVEEVERSK AQLKASLVLA LDDSTAIAED IGRQLVNTGF RLSPEEVFER
VEAITKKDVI DWANYRLKDK PIALSAVGNV KTLPSHQYLT KGMSL