Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_14973 |
Symbol | |
ID | 4840751 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 169255 |
End bp | 171975 |
Gene Length | 2721 bp |
Protein Length | 871 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640392066 |
Product | predicted protein |
Protein accession | XP_001386048 |
Protein GI | 150866443 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4935] Regulatory P domain of the subtilisin-like proprotein convertases and other proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.251561 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTTTC CGTTACGGAC TTTACTCCTC ATTCTCTCGG TGTTGCTTCA GGTGCAATCG ACAGCGATTC CCAAGAGAGA CTACGAGCTG AAGAACTACT ACATTTTCGA GATCGACACC TCGATTTCAC AACAGCCATT AGCTGACTTC ACCAGCAAGT ACCGTCTGCA CTACAAGTTC GAACACCAGC TTCAAGGTCT TGATAACCAC TATGTTTTCA GTATCAACAA ATCGCACCCT CACAACGACT TCTTAGGAAA CCACAAGTCC AACGACTTCA ACTTGATGAA GAGATCTCCG GGGTTTGAAG ACGAATACGA CTACTTGGTT TCGAACCCGC ACTTGAGATC AATTCACCAG TTGAAGCCCA GAACTTTGTC TAAAAGAATG CCTGTCTTGA TCACAGACGA CAAGGAATAC GAAAGAATAA TAGATAGCAA ACGTGCGGAT GCTATTACTA ATTTTCCCGT TGTTAATACT GTAGATGCTA ATACAGATGC TGCTGGGAAT ACTGTAGATG CCTCGAAGCA GCTTTTGAAG GACGTGTCGG ACAGTCTTAG TATCCGCGAT CCTGGCTTCA TTGAACAGTG GCATTTGATC AACACTGCAT ATCCTGGTCA CGACGTCAAC GTCACAGGCT TGTGGTATGA AGGTATAACT GGTACCGGTA TAGTCAGCGC AATTGTAGAC GATGGTTTGG ACGCTGAAAG TGAAGACTTG CGTGCCAACT TCAACGCTAA GGGTTCATGG GACTTCAATG ACAACACCAA CATCCCGTTG CCGCGCTTGT ATGACGATCA CCACGGCACT AGATGTGCTG GTGAGATCGC TGCTGTCAAG AACGATGTCT GTGGAGTAGG TGTGGCGTAT GATTCCACAG TAGCCGGTAT CCGGATCTTG TCTGGACCCA TTACAGCTGC CGAAGAGGCT GCCGCCTTGA TCTACGGCTT GGATGTCAAC GACATCTACT CTTGTTCCTG GGGTCCAACC GATGACGGAA GAACGCTTGC GGAACCGGAA ACTGTCGTGA AAAAAGCTAT GATCAAGGGT GTCCAAGAGG GTCGTAAAGA TAAGGGATCT ATCTATGTGT TTGCCTCTGG AAACGGTGGC AGATCATATG ACTCATGCAA CTACGATGGA TATACCAATT CCATCTTTTC CATAACAGTA GGTGCTATCG ACTACAAGGG AATTCATCCT GATTATGCCG AGGCCTGTTC AGCCGTCATG GTGGTGACCT ATTCATCTGG CTCGGGAGAA CATATCCACA CTACTGATAT CAAGAAGAGA TGTACAGCTA GCCACGGAGG TACATCTGCA GCTGCACCCT TAGCAGCTGG AATCTATGCC TTGGTATTGC AAGCTAATCC GAACTTGACA TGGAGGGATG TCCAGTATGT ATCTGTGCTA AGTTCGGTTC CCATCAACCA ACAGGATGGT AACTATCAAA CCACAGCATT GAATAGAGAG TACTCTCACA AGTATGGTTA CGGTAAGATA GACGCTTATC AGATGGTACA CTTTGCCAAG GACTGGAAGA ATGTAAAGCC TCAGGCGTTC TTCTACTCCG ACATCCAGCT GGTTAGAGAG ACTTTGGAAA CGAACGCTCC TCCCCCTAAA GAAAACAATG CGAACGATGG AGTTGAGAAA CCTCCCGCTG ATTCTCACAA GAGAGATGGC AATATCATCC GGAAGAAGAT CACTGTTACT GAAGAGGATT TGAAGATTAT GAATGTAGAA AGAGTAGAAC ATGTTACTGT GAAACTCAAC ATTATGGCTA CATTTAGGGG CCGTGTAGGA GTCAGATTGA TTTCTCCCAC TGGTGTGACC AGTGACTTGG CTACTTTCCG CCCTCGTGAT AACTCTGGTG TAGGGTTCAA GGATTGGACG TTCATGTCTG TTGCTCATTG GGGAGAGTCA GGTTTGGGGG ACTGGACCAT AGAGGTGTTT GGAGACGAAA ATTCTTCCAA ACAGAAGAAT ACCATTGTCT TTGAAAACTG GCAATTGCGT TTCTTTGGTG AGTCTATTGA TGCCGACAAA GCTGAAACAT ACGAGTTAGA AAAGGACTAT GCTGCTGTTA GAAGAGACAG ACTTTCGCAG AACAATGACA AGCAACCTGA GACAACTTCA GAGACTTTGT CATCGTCTGA GAGTGTGTCA TCTTCAGAAG TTGGAACCTC AACTGTGTCT ACTGAGAGCT CTACAAGTGT AACTAGTACT TCAGACTCTA CAACACCTGT AGAAGAGGAT CACAATGCTG AAAAAGTAAC AGAAAGTGTT TCCGCTAGCT CGAGTTCTAC CCAAGACGCC GAGGCTACGG AGTCGAGTGG AGCAGAAGAA GATGAAGACG GCAAGTTGAA ATATTCCGCA GACCATACGG GCCAGTATTT TATGGCACTT GCTGTTGTCG GCTTTATAGT CATCATTCTT TTCATGAAGT TCTTCAAGAC GCCAGGAAGC GGCAGAAGAA GAAGAAGAGA GGATTTTGAG TTTGACATAA TTCCTGGCGA GGACTATTCT GACAGCGAAG ACGACGAAGA TTCTATGGAA TTTGGACGTA GAAGCGGAAG AAGAGCACCA CCAGCACCTT CATTTATTCC TAATGAAGTC GACGATGAGG ACGACGATCG TGCCAGAGAT AGAGTCTATG ATGAGTTCAA TAGCGACACA TTGCCCGAGT ACGAAGAAGA GATGTTTAGA ATAGACGACG AGGATGAGGA C
|
Protein sequence | MLFPLRTLLL ILSVLLQVQS TAIPKRDYES KNYYIFEIDT SISQQPLADF TSKYRSHYKF EHQLQGLDNH YVFSINKSHP HNDFLGNHKS NDFNLMKRSP GFEDEYDYLV SNPHLRSIHQ LKPRTLSKRM PVLITDDKEY ERIIDSKHAN TDAAGNTVDA SKQLLKDVSD SLSIRDPGFI EQWHLINTAY PGHDVNVTGL WYEGITGTGI VSAIVDDGLD AESEDLRANF NAKGSWDFND NTNIPLPRLY DDHHGTRCAG EIAAVKNDVC GVGVAYDSTV AGIRILSGPI TAAEEAAALI YGLDVNDIYS CSWGPTDDGR TLAEPETVVK KAMIKGVQEG RKDKGSIYVF ASGNGGRSYD SCNYDGYTNS IFSITVGAID YKGIHPDYAE ACSAVMVVTY SSGSGEHIHT TDIKKRCTAS HGGTSAAAPL AAGIYALVLQ ANPNLTWRDV QYVSVLSSVP INQQDGNYQT TALNREYSHK YGYGKIDAYQ MVHFAKDWKN VKPQAFFYSD IQSKPPADSH KRDGNIIRKK ITVTEEDLKI MNVERVEHVT VKLNIMATFR GRVGVRLISP TGVTSDLATF RPRDNSGVGF KDWTFMSVAH WGESGLGDWT IEVFGDENSS KQKNTIVFEN WQLRFFGESI DADKAETYEL EKDYAAVRRD RLSQNNDKQP ETTSETLSSS ESVSSSEVGT STVSTESSTS VTSTSDSTTP VEEDHNAEKV TESVSASSSS TQDAEATESS GAEEDEDGKL KYSADHTGQY FMALAVVGFI VIILFMKFFK TPGSGRRRRR EDFEFDIIPG EDYSDSEDDE DSMEFGRRSG RRAPPAPSFI PNEVDDEDDD RARDRVYDEF NSDTLPEYEE EMFRIDDEDE D
|
| |