Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_89481 |
Symbol | OCT1 |
ID | 4839065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 117766 |
End bp | 120315 |
Gene Length | 2550 bp |
Protein Length | 812 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390380 |
Product | mitochondrial intermediate peptidase involved in protein import |
Protein accession | XP_001384676 |
Protein GI | 150865453 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.493678 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CAGGTATAGA GACAGCAGTG TTCACGATGC GTCTTCTGCG CCAGCTTCTT CGAAGTACAC CATTTCTCAC GCGGGCAAAG CCCGTGTCTG GTAAGGTGTC ACATTTCAGA CTGCGCACCG ATCTCAAGGG AGGCTCATCC AACTCCTCTA AGTCGCCAGA TTCTGTTGGT GATGGTGCTT CGGCACATCT TCGTCACATT TTCGACGACC AGAAGTACTT CAACAGCTTC ACCAAGTCTG CAGCAGAAAC TTCGGGCAAG GTTTCGCTGC TTCCAGCCAT CTTCTCCTTC CGCAGGTCTG GATTGTTCTG CAACGATAAT CTTCTGACTC CCCATGGATT GATAGACTTC TCGAAAAACT CCCTAAGAGA AGCTAAGTCG CTTGTAGAAT CAATGCTCCA CGATGTGAAG TCTGATCCAG CCGGCCGTTT GTCGTATATC AACAAGTTGG ATCAGTTGTC GGACATCTTG TGTAGAGTAA TAGATGTAGC TGAGTTCATC AGAGTAGCCC ACCCATCCCA AAAATGGGTC AATGCTGCGC AGCAAACCCA CGAAATCATG TTCGAATACA TGAACCAGTT GAATACAAAC GTAGAGTTGT ACCAGAATCT CCGGGACATT TTGAGCGATT CCTCTGTGAC GGCCCAACTA ACAGAAGAAG AAATTCAGGT TGGTGAGTAC TTGAAACAGG ACTTTGAAAG ATCGGGAATC CACATGAACC CTTCTGCAAG GAATAACTTT GTAGCCATCA CCCAGGAGAT CTCTTTACTT GGATCACGTT TTAACAACGA AATCCACAAC TTGAAGTCAT ACTGGTGTGA AATCCCTAGA TACGAGTTTG AACAACTCGA GGACTCAAAC TTGAAAAAGG AGATTCTCGG CTACCAGTCC AAGGCCCCTC CTTCCAAGCA TTCTTCCCAA ACTATCAGCA TCCCATTAGT GGGCCACATT CCCTTCACGA TCCTTACCAC ATGTTCGATA GAGCTGATCA GAAGGGAGAT CTGGATTTCT TTGCATAATT CTTCAGATGA GCAGATCGCT ACTCTTAACA ACTTCCTCAA ATACAGAGCT ACGTTGGCAA AAATGTTGGG CTACAAGTCT TTTTCACACT ATCAATTGGA ACATAAAATG GCCAAGAATC CCGAAAATGT AGTTACATTT TTGACTAACT TACAGAAGTC GTTGAGAGAA AAGGGTGTTA CTGAAGAAAT CAAAAAGTTG TACCAATACA GAGATGATTC CACGATTTCA CAGGTACAGA AGGCATCTAC TGAAGATATT ATTGATGGAG TTAAACCCTG GGATAGGGAT TACCTCTTGG AAAAGCTCCA GAAAGCGTCT AACAAGAATT TGGAAGAGTT AGAAAACATC AACGAATACT TGTCTGTTGG CACTATTGTC GCGGGATTGA GTGAATTATT TAAGCTGATC TACAATGTTG AGTTTGTGCC TGTGGCAACG CTCAAGGGAG AAACGTGGGA TCAAAACCAA GTTCGTAAAG TTGCGGTAGT TGACGATTCT ACAAAGAAGA AACTAGGGTT CCTCTATTTA GATTTCTGGT CCCCCAAAGT CTTACCATCT CATTTCACGA TAGTTTGTCT GAGAAAGCTC AATTTAGATA TTAAGAGCGA AACGAAAGAC AAGATGAGAC AATTGGTACA ATTGGATGAG GACGAAACGT CACAACTCCC CGTGATTTCG TTGATTTGTA ACTTTCAGAA ATCAAATGAT GGTCACATAG GTAGATTTGC AGGCGTAGAG AACGAGAAGC CTACATTACT TTCGTTGAAC CAAGTGGATA CAGTTTTCCA TGAAATGGGT CATGCCATGC ATTCCATGAT TGGACGTACT GACTTGCATA ACCTCTCTGG AACGAGGTGT GCCACTGACT TCGTAGAGTT GCCCTCGGTT CTAATGGAAT CTTTCAGTAA GGACCCTCGA GTCTTGTGTA AAATTGCAAA GCACTACGAA ACGGGCGAGC CATTATCTCC TAAACTATTG GCTCAGCACC AGACACAGAA AGTGATGTTA GACGAATGTG AAACCTACAT GCAATCAAAG ATGGCCATGT TGGATCAAGT TCTACACAGC GAAGATGTCG TCAGGACTAT TTCGGAAGAC TTTGCTAACT TCGACTCTAC GCCTATATAC CATAGTCTTG AGTCCAAGTT GAAGGTTTTT GCCGATACCT GGTCTACTTG GCATGGTAAG TTTCCCCACT TGTTCTCGTA TGGTGCCGTT TATTACTCCT ACTTGTTGGA TCGGGCCATC GCAGAGAAGA TTTGGAATGG GTTGTTTGCA CACGATCCTT GGAGTAGAGA GGCGGGAGAG AAGTACAAAA ACAGCATATT GAAGTGGGGA GGCACCCGTG ATCCTTGGGA ATGCCTTGCA GATGCGTTGG AGAACGACGA GCTCAGCAAA GGAGACTCGC GAGCAATGGA AATAATCGGC AAGGATTCCT TGTGACGTCA CAACAAAAGT AATTTTGGTT TTTAATGGCA TTATGAAACT TGTACATAGA ACTTCTACAA TAGATAAAAT TACAGTATAA
|
Protein sequence | MRLSRQLLRS TPFLTRAKPV SGKVSHFRSR TDLKGGSSNS SKSPDSVGDG ASAHLRHIFD DQKYFNSFTK SAAETSGKVS SLPAIFSFRR SGLFCNDNLS TPHGLIDFSK NSLREAKSLV ESMLHDVKSD PAGRLSYINK LDQLSDILCR VIDVAEFIRV AHPSQKWVNA AQQTHEIMFE YMNQLNTNVE LYQNLRDILS DSSVTAQLTE EEIQVGEYLK QDFERSGIHM NPSARNNFVA ITQEISLLGS RFNNEIHNLK SYWCEIPRYE FEQLEDSNLK KEILGYQSKA PPSKHSSQTI SIPLVGHIPF TILTTCSIES IRREIWISLH NSSDEQIATL NNFLKYRATL AKMLGYKSFS HYQLEHKMAK NPENVVTFLT NLQKSLREKG VTEEIKKLYQ YRDDSTISQV QKASTEDIID GVKPWDRDYL LEKLQKASNK NLEELENINE YLSVGTIVAG LSELFKSIYN VEFVPVATLK GETWDQNQVR KVAVVDDSTK KKLGFLYLDF WSPKVLPSHF TIVCSRKLNL DIKSETKDKM RQLVQLDEDE TSQLPVISLI CNFQKSNDGH IGRFAGVENE KPTLLSLNQV DTVFHEMGHA MHSMIGRTDL HNLSGTRCAT DFVELPSVLM ESFSKDPRVL CKIAKHYETG EPLSPKLLAQ HQTQKVMLDE CETYMQSKMA MLDQVLHSED VVRTISEDFA NFDSTPIYHS LESKLKVFAD TWSTWHGKFP HLFSYGAVYY SYLLDRAIAE KIWNGLFAHD PWSREAGEKY KNSILKWGGT RDPWECLADA LENDELSKGD SRAMEIIGKD SL
|
| |