Gene PICST_89481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_89481 
SymbolOCT1 
ID4839065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp117766 
End bp120315 
Gene Length2550 bp 
Protein Length812 aa 
Translation table12 
GC content44% 
IMG OID640390380 
Productmitochondrial intermediate peptidase involved in protein import 
Protein accessionXP_001384676 
Protein GI150865453 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.493678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAGGTATAGA GACAGCAGTG TTCACGATGC GTCTTCTGCG CCAGCTTCTT CGAAGTACAC 
CATTTCTCAC GCGGGCAAAG CCCGTGTCTG GTAAGGTGTC ACATTTCAGA CTGCGCACCG
ATCTCAAGGG AGGCTCATCC AACTCCTCTA AGTCGCCAGA TTCTGTTGGT GATGGTGCTT
CGGCACATCT TCGTCACATT TTCGACGACC AGAAGTACTT CAACAGCTTC ACCAAGTCTG
CAGCAGAAAC TTCGGGCAAG GTTTCGCTGC TTCCAGCCAT CTTCTCCTTC CGCAGGTCTG
GATTGTTCTG CAACGATAAT CTTCTGACTC CCCATGGATT GATAGACTTC TCGAAAAACT
CCCTAAGAGA AGCTAAGTCG CTTGTAGAAT CAATGCTCCA CGATGTGAAG TCTGATCCAG
CCGGCCGTTT GTCGTATATC AACAAGTTGG ATCAGTTGTC GGACATCTTG TGTAGAGTAA
TAGATGTAGC TGAGTTCATC AGAGTAGCCC ACCCATCCCA AAAATGGGTC AATGCTGCGC
AGCAAACCCA CGAAATCATG TTCGAATACA TGAACCAGTT GAATACAAAC GTAGAGTTGT
ACCAGAATCT CCGGGACATT TTGAGCGATT CCTCTGTGAC GGCCCAACTA ACAGAAGAAG
AAATTCAGGT TGGTGAGTAC TTGAAACAGG ACTTTGAAAG ATCGGGAATC CACATGAACC
CTTCTGCAAG GAATAACTTT GTAGCCATCA CCCAGGAGAT CTCTTTACTT GGATCACGTT
TTAACAACGA AATCCACAAC TTGAAGTCAT ACTGGTGTGA AATCCCTAGA TACGAGTTTG
AACAACTCGA GGACTCAAAC TTGAAAAAGG AGATTCTCGG CTACCAGTCC AAGGCCCCTC
CTTCCAAGCA TTCTTCCCAA ACTATCAGCA TCCCATTAGT GGGCCACATT CCCTTCACGA
TCCTTACCAC ATGTTCGATA GAGCTGATCA GAAGGGAGAT CTGGATTTCT TTGCATAATT
CTTCAGATGA GCAGATCGCT ACTCTTAACA ACTTCCTCAA ATACAGAGCT ACGTTGGCAA
AAATGTTGGG CTACAAGTCT TTTTCACACT ATCAATTGGA ACATAAAATG GCCAAGAATC
CCGAAAATGT AGTTACATTT TTGACTAACT TACAGAAGTC GTTGAGAGAA AAGGGTGTTA
CTGAAGAAAT CAAAAAGTTG TACCAATACA GAGATGATTC CACGATTTCA CAGGTACAGA
AGGCATCTAC TGAAGATATT ATTGATGGAG TTAAACCCTG GGATAGGGAT TACCTCTTGG
AAAAGCTCCA GAAAGCGTCT AACAAGAATT TGGAAGAGTT AGAAAACATC AACGAATACT
TGTCTGTTGG CACTATTGTC GCGGGATTGA GTGAATTATT TAAGCTGATC TACAATGTTG
AGTTTGTGCC TGTGGCAACG CTCAAGGGAG AAACGTGGGA TCAAAACCAA GTTCGTAAAG
TTGCGGTAGT TGACGATTCT ACAAAGAAGA AACTAGGGTT CCTCTATTTA GATTTCTGGT
CCCCCAAAGT CTTACCATCT CATTTCACGA TAGTTTGTCT GAGAAAGCTC AATTTAGATA
TTAAGAGCGA AACGAAAGAC AAGATGAGAC AATTGGTACA ATTGGATGAG GACGAAACGT
CACAACTCCC CGTGATTTCG TTGATTTGTA ACTTTCAGAA ATCAAATGAT GGTCACATAG
GTAGATTTGC AGGCGTAGAG AACGAGAAGC CTACATTACT TTCGTTGAAC CAAGTGGATA
CAGTTTTCCA TGAAATGGGT CATGCCATGC ATTCCATGAT TGGACGTACT GACTTGCATA
ACCTCTCTGG AACGAGGTGT GCCACTGACT TCGTAGAGTT GCCCTCGGTT CTAATGGAAT
CTTTCAGTAA GGACCCTCGA GTCTTGTGTA AAATTGCAAA GCACTACGAA ACGGGCGAGC
CATTATCTCC TAAACTATTG GCTCAGCACC AGACACAGAA AGTGATGTTA GACGAATGTG
AAACCTACAT GCAATCAAAG ATGGCCATGT TGGATCAAGT TCTACACAGC GAAGATGTCG
TCAGGACTAT TTCGGAAGAC TTTGCTAACT TCGACTCTAC GCCTATATAC CATAGTCTTG
AGTCCAAGTT GAAGGTTTTT GCCGATACCT GGTCTACTTG GCATGGTAAG TTTCCCCACT
TGTTCTCGTA TGGTGCCGTT TATTACTCCT ACTTGTTGGA TCGGGCCATC GCAGAGAAGA
TTTGGAATGG GTTGTTTGCA CACGATCCTT GGAGTAGAGA GGCGGGAGAG AAGTACAAAA
ACAGCATATT GAAGTGGGGA GGCACCCGTG ATCCTTGGGA ATGCCTTGCA GATGCGTTGG
AGAACGACGA GCTCAGCAAA GGAGACTCGC GAGCAATGGA AATAATCGGC AAGGATTCCT
TGTGACGTCA CAACAAAAGT AATTTTGGTT TTTAATGGCA TTATGAAACT TGTACATAGA
ACTTCTACAA TAGATAAAAT TACAGTATAA
 
Protein sequence
MRLSRQLLRS TPFLTRAKPV SGKVSHFRSR TDLKGGSSNS SKSPDSVGDG ASAHLRHIFD 
DQKYFNSFTK SAAETSGKVS SLPAIFSFRR SGLFCNDNLS TPHGLIDFSK NSLREAKSLV
ESMLHDVKSD PAGRLSYINK LDQLSDILCR VIDVAEFIRV AHPSQKWVNA AQQTHEIMFE
YMNQLNTNVE LYQNLRDILS DSSVTAQLTE EEIQVGEYLK QDFERSGIHM NPSARNNFVA
ITQEISLLGS RFNNEIHNLK SYWCEIPRYE FEQLEDSNLK KEILGYQSKA PPSKHSSQTI
SIPLVGHIPF TILTTCSIES IRREIWISLH NSSDEQIATL NNFLKYRATL AKMLGYKSFS
HYQLEHKMAK NPENVVTFLT NLQKSLREKG VTEEIKKLYQ YRDDSTISQV QKASTEDIID
GVKPWDRDYL LEKLQKASNK NLEELENINE YLSVGTIVAG LSELFKSIYN VEFVPVATLK
GETWDQNQVR KVAVVDDSTK KKLGFLYLDF WSPKVLPSHF TIVCSRKLNL DIKSETKDKM
RQLVQLDEDE TSQLPVISLI CNFQKSNDGH IGRFAGVENE KPTLLSLNQV DTVFHEMGHA
MHSMIGRTDL HNLSGTRCAT DFVELPSVLM ESFSKDPRVL CKIAKHYETG EPLSPKLLAQ
HQTQKVMLDE CETYMQSKMA MLDQVLHSED VVRTISEDFA NFDSTPIYHS LESKLKVFAD
TWSTWHGKFP HLFSYGAVYY SYLLDRAIAE KIWNGLFAHD PWSREAGEKY KNSILKWGGT
RDPWECLADA LENDELSKGD SRAMEIIGKD SL