Gene PICST_49260 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_49260 
SymbolMAS2 
ID4840693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp678396 
End bp679886 
Gene Length1491 bp 
Protein Length496 aa 
Translation table12 
GC content45% 
IMG OID640392008 
ProductMitochondrial processing peptidase alpha subunit, mitochondrial precursor (Alpha-MPP) 
Protein accessionXP_001386145 
Protein GI126139245 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0300879 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.382382 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCGTT CTAGCCTTGC AAGAAGAGCA TTTACTGCTG CCACCAAACC CAACATCGAA 
ACTTCCACTC TTTCGAATGG GCTTCGATTA GTTACAGACT CCACACCGGG CCATTTCAGC
GCTCTTGGGG CCTATGTAGA TGCTGGATCG AGATTTGAAA ACCCTAATAA GCCCGGCTTG
TCTCATATAT GTGACCGTTT GGCATGGAAG TCTACTGAAA AGTACTCAGG CATGGAGCTC
ATAGAGAACC TTGCCAAGTT GGGTGGAAAC TACATGTGTT CCGCACAAAG AGAATCTGTC
ATCTACCAGG CTTCTGTTTT CAACAAAGAT GTAGAAAAGA TGTTTGATTG TATTGCCCAA
ACTGTGAGAG CTCCTCGTTT CACTGACCAG GAACTCTTTG AGACTCTTCA AACTGCAGAG
TACGAAGTCA ACGAAGTTTC GCTAAAACAC GATATGTTTC TTCCGGAAGT TTTACATTCG
GCTGCATACC AAAACAATAC CTTGGGATTG CCCTTGTTCT GTCCCCCAGA ACGGATCCCA
GAAATCGGCA AATCTGACAT CATCAACTAC CACAACCAGT TCTTCCAGCC ACAGAACATC
GTAGTGGCAA TGGTAGGTGT GCCTCATGAA CATGCTGTCA AGTTAGCTGA AAAACAATTT
GGGGATTGGA AGCCGGCAAA GAGTTATAGG CCCGACTTCG GAACCGTCAA GTACACTGGT
GGTGAAATAT CCTTGCCTTT CCAGCCTCCC ATCTACAGTA ATATGCCTGA ACTATACCAT
ATGCAAATTG CGTTCGAGAC TACCGGTTTA CTCAGTGACG ACTTGTATGC GTTGGCAACT
TTACAGAAGC TACTTGGAGG TGGTTCCTCA TTTTCTGCTG GTGGTCCAGG TAAGGGTATG
TTTTCCAGAT TGTACACCAG AGTATTGAAC CAGTACGCAT ATGTAGAGAA CTGCATGAGT
TTCAACCATT CGTACATTGA TTCTGGTCTC TTTGGTATAA CGATATCGTG TTCTCCAAAT
GCTGGCCATG TGATGTCGCA GATCATCAGT TTTGAGTTGT CAAAATTGCT TGAAAAAGAT
CCTGCCAAGG GCGGACTCAC AGAGAAAGAA GTCAAGAGAG CCAAGAACCA GCTTATCAGC
TCCTTGTTGA TGAATATAGA GAGTAAGCTC GCCAGATTGG AAGACTTGGG CAGACAGATC
CAATGCCAGA ACAAGATCAC CACCATCGAC GAGATGATCC AGAAGATCGA AAGCTTGTCT
CTAGAAGACT TGAGAGTAGT AGCTGAAAAG GTACTTACTG GCAGTGTAAT AACTAAAGGC
ATAAGTAGCG GACAACCTAC TGTAGTAATG CAAGGAGACA GAGCTTCATT TGGTGACGTT
GAGTTCATTC TTCGTCACTA CGGTTTGGGG AAGTTTCAAG GTCCTCCATT GGAAGAACCT
AGAGATTTCT CCAAGATAGA AAAGCCTCAT AGATTTGGTA AATGGTTCTA G
 
Protein sequence
MQRSSLARRA FTAATKPNIE TSTLSNGLRL VTDSTPGHFS ALGAYVDAGS RFENPNKPGL 
SHICDRLAWK STEKYSGMEL IENLAKLGGN YMCSAQRESV IYQASVFNKD VEKMFDCIAQ
TVRAPRFTDQ ELFETLQTAE YEVNEVSLKH DMFLPEVLHS AAYQNNTLGL PLFCPPERIP
EIGKSDIINY HNQFFQPQNI VVAMVGVPHE HAVKLAEKQF GDWKPAKSYR PDFGTVKYTG
GEISLPFQPP IYSNMPELYH MQIAFETTGL LSDDLYALAT LQKLLGGGSS FSAGGPGKGM
FSRLYTRVLN QYAYVENCMS FNHSYIDSGL FGITISCSPN AGHVMSQIIS FELSKLLEKD
PAKGGLTEKE VKRAKNQLIS SLLMNIESKL ARLEDLGRQI QCQNKITTID EMIQKIESLS
LEDLRVVAEK VLTGSVITKG ISSGQPTVVM QGDRASFGDV EFILRHYGLG KFQGPPLEEP
RDFSKIEKPH RFGKWF