Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_68327 |
Symbol | |
ID | 4840445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 601822 |
End bp | 608526 |
Gene Length | 6705 bp |
Protein Length | 2212 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640391760 |
Product | hypothetical protein |
Protein accession | XP_001386126 |
Protein GI | 150866499 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATCC GTAAGAGCAC GACATCTCCG GTTTCTGGAA CCGATCGGGC CGATCTCAAC GACAGCTTCG CCGTTTCGGA AGCAGAACCT TCACAACTTC ATGGCCATGT AGAGTACGGA GCAGACGTGT CAGTAGGAGT TGAAGATGCG TCTGGTTTCA CAGCTGAAGA CGCGTTTCAA GTCGGGGGCG AAGAAGACGA AGAGCAGGTA TCCACTGTGA CCAATGGCGC AGTCGTACCA CAATCCGTAG TGGCAGCAGA ATCCGCAGAC GAAGACGTGT TGCTGAGATC CACCAAAAAG GAAGAGTTTC TTCCGGAATC GACTTCAATT GCAGAACCTG TTAAGCTGGA TGTTATTACA AATGTCATCG ACGCCAACGC CGATACTGGC CTTGATGATG ATCCATGGAT GAAAGATGGC ACGCCAGAGT TCCCCCCCGC AGACGAAAAC CTCCCAGAAG TGCCTCCAGT AGCAGAAACA AAAGTAGAGA CGTCAAAAAG TATCGCCAAG CAGCAAATAT ATGGAGAAAA GGTGGAAGAT TTCACTGAAA ATTCGGCTTC TCCACAGGAA GTCGATTCGA AGCAAACATC AAGCACCATA GCAGACTTTA TTCCAACGCC AGAGCATATT CATTCAAATC TTCATACTAG TGAAACAGAA AATGAACCTA TTAAACAGTC TCAATTGCCT TGGGAAAGCC ATGTAGAACA GATAGAAGCA TTACCATGGG AAGAACAGCC ACAAAAGGTT GAACATGTAC TACCATGGGA AGAAAAACCC GATGAAGTAT CACAGAAAGA TGAAGCTTTG CCATGGGAAG AGCGAAGAGT AGAACACCCA GAACAAGCCG CAGAGTCTAC ATTGCCATGG GAAGAACAAC CTAATGTAGA AGAATTTCAA GCTCACGAAC CTGCATTACC ATGGGAAGAA CGACCCGAGG TAGAAGAGTC TTCATTACCT TGGGAGGAAA AAGTTGGAGA ATCTCAAGGC CAGGAAGCTG CATTACCTTG GGAGGAACAA CAAGAAGTCG AAGTATCATC ATTACTCTTA GAGGAAAAAG TTAGAGAATC TTCGTTGCCT TGGGAGGGGG AAGAAGGAGA ATCTCAAGCT CAAGAATCTA AGTTACCTTG GGAAGAGCAA AGAATACAAT CTACACCACC TTCGGAAGAA CAAGTATTAG AACCTGTAGA GGGTTCTTCA TTGCCTGTGG AAGAAAAAGA TCTGACAGAA GGATCATTGC CGTGGAACCA ACATCACGAA GGGGAAGCAC AAGAGATTTT ACCATGGGAA CAAAGCACGA AACCACAAGA TACGCAACCA CAAGTATCAC AGGAAGAACT ACATGATTAC TCTGTTTCGA ATGTAATTTC ACAAAGTGAG ATTGTGGAAG AACAAGCCCA GCCTGTATCA CAGGCTTACG AAAGTCAGTT TTTGGCTGGT TTCTTTACTG AACGTCGCCC GGAACAAGAA TCTGAAACCA AACAAGAGTC GTTAGATGAG TTGTTTGAGA AGGACGAAGA CTTTCTTCCA GAATTAAAGC AACCTGTTGA ACAAAGCACT CAAACTTCGA AACCAAAGGA GTTCAACTTA CCGGAATTGG ATCTTGACGA CGATTTACTT TTGGATGATG ACTTGTTAGA CGATGACGAA CCTGAACTTG TTCCAGAACC TGTAGCGCCA GTGGAGCCAA TTCAAGAAGC TATCCAGAAT AATGTTGCCA AGTCTCCACG TCAGACTTAC ATTCCAACTC AGCCTCATCA TAACATGTAT GCTCCTCAAA TCAGCAGAAC AGATACTGGA GAGTATGTAA AAAAACTTGA AGAAAACAAG AAGAGGAACG ATGCTTACGA CTTTCCAGAC ATTTTGATGC CACCGAAAGT TAAACCTGCT CCGAGACATC ATCAGCCTAC TACCAAATAC TCGCAACCAG TAGCATCACC AAATCTTTCC AACATCCCGT CTGCTCCACC TGCAACTACT ATTCCTCCAC CTGTGAGTAT TGGAATTGAA GCATCTAAGA AGGAGCTTCC TACAACTCCT CAAGTCGCAC CAGGAAAGCC CAAGTCTTTC TTTGAGGAAC TTCCTGTTTC AATGCCCAAG AAAGCTGCAA GAGCAGCACC CGTAAAGGCT GTCAACCAGC CACATATCCA GAAGTCTCCA CAGATTAGTA ATACGCCAAT CAGTGCGAGC TCAAAACCAG CACCAGTAAA TCCGTATATG CCTAGTAATA CTCAAAAACA GAATCCTCAT TCTCCTCTTC AAGGCTCTCA ATACTCGTTT CCAAGGAAGA CCAGTTCTGG GTTGGCACCG CCACCAGCTT TGAATCAATA TGCTCCTCCT TCTCTGAATC AATATGCAAC AGTATCTGCA CAAGTTCAAC CAGGACTTCC TCAGTCAAGC TTGCCGCAAT CAGGAATGTC TCAGCCTGGA TTGCAACCCT TCCCACAACC TCAGGGTCAT TTGGTTCCTC CCAACTTAGT GGCTCCTCAG ATTCAGAATT CTCCACTAGG ACAACCTCAG CAGACACACA ACTATGCTCC TCCAATCAAC ACCAACGTTG CTAGAGCTCA AGGAAATCTT TCAGCTACGA GTCCATATGT TCCCAACGCT GGTCCTTATG CCCCTAATAA TCACAATAGG TCTCATTCCC GAGCTAGTTC GTTGGTAGGT GGTGGAAAAG GTAAGGAGAT CAATCCGTAT GCTCCTGCTT TACCTCCTGT AAGTCATCTG GGTACTTCTC CTTCAATCGC TCAGTCTTCA TTAACTTCGA GAAACAGAGG TATATCCAAC CCTAGAAATA TCTACAGCAA GGCCCAGCCA GCTCCAAAGA TTTCTAATCC CAACTCGTTG AACCAGAGAC AGTTCCCTAT CTTCAACTGG AGCAACTCGC AGAAAGTTGT CACTTTAATT CCAAACAGTA GCCATAACTT GTACGAGCTG CACGGAGAAT CTATCAGAGT AAGAGCTGCG ATAGATTTGC TCAAGGATAA GGAGATGTAT TCTACGTTTC CAGGTCCATT GCTGAAGAGT AAAACCAAGA GAAAGGATGT AGAGAAATGG TTAGAATCCA ACATTGCTTT GCTCACCACT AACAGTATTG AAAACCAGGA TGAGTACTTG CTCAACCAGG TATTGTTGGA GGCAGTGAAG TTTGAAGGTG GTTTCAATTC TCATGAATTC ATCAAAGCAG TATGTATGGT ATTGAATCCT ACAGCAGACT ACAATACTCC TGGTGATATT TCGGGGATGA CAAGTGTCAG TGCCAATGCC TACAAATTGG ACAATGCTGG AATTGGTATT GTATTCGGAT TGATTCAGGG AGGTCATATT GACAAAGCGC TTGAATTCAC GTTGTCTAAA GGTGACTGGG CTTTAGCTCT TGTTGTATCG AACTTTGCTG GCCACGATCG TTTTGCTAAG GTAGCAGCGG ACTATGCAAG ATTCAGTTTT CCCTATCAGA AGTCCAACAA CAAAATCCAC CACATCATTC CTATTCTTCT CAAGTTGGCT GTAGGAAATG TGAAGAGTGT CATTGACGAT TTAAACGCAG TAGCTACAGA AGGTGAATAT GCCAGTCATC ACTGGCGAGA AATCGTATCC TCTGTAATTA TCAGCGGAGT CAGTAAGTCA CAGGACTTCT TGGTTGAATT CGGTAAATTT TTGGGTCTTC ACCACAACAT TGCAGCTTCG GAAATCTGCT ATGTAATTGC AGGTTTGCCA TTATCCCCTC AGGCCTTACA ACCTAGTGGT ATAGTAGTTT CTGTAATTGG ATCCCTCACT GGCACTTCAA TGTACACGGA AGTGTACGAG TACATTTTGA AGATCAGTAC TGTATTTGTT CAACAGGGAG TTGGCATTAT TCCGCACTTG TTGCCATTGA AGCTCAAACA TGCTACTGTT TTGGCTGATT ATGGGTTGTT TAACGAATCG CAGAAGTACA TTGATAGCAT CAACAACAAC ATGAAGACAT TGGGTAATCG TTCTCCTTAC TTGAATGCTG CGTTCATTCA TGAGTTACAG AGTTTGATTG TCAGACTTCA AGAACTAGGC TCCAGTGAAC TGAGTTGGTT TAGCGGAAAG ATGAGTAAAG TAAATCTTGA CAATATCTGG GGACAAATTG ATAAGTTTAT TGGCGGCGAA GAACCCAAGT CCAAGTCCGG CGAGAATGGC GTATTCAGTA AGTTCAGTCC TTCTGTTTCA AGAAACACTT CTACCTTGGA TTTTACCGCT ATGAATGTTT CTTCAAACAA GTATCCTCAT TCTCTGCCAC AGATGCCTTC GGGACAGTTT GGATCTTCCT CAATTGCTCC ATCTACTGCT GATGGTGTTC TGGTACCAGT TACTAGAGCC ATGCCTCCAT TGCAATCGTT CAATTCTACT CCTGCTATCA ACACCTTAAG CATGAATAGC AAGTACTCAC TTCCTCACTT ACAACAGACT AAGAATGCCA GTTTCTCTGT TCACTCACAG TCTACGCAAG TACCTCCATC TCATCATCTG ACCCCACAGC AGTCACATGT ACATCCTAAT GTTCCGCTTC ATCATGCGGC CGAGTCGAAG TATTCTCCAT CGAATGCTTC TCCACAGGTC CCTGACTCTA CTGCCTCTCC GTATGTTCCA CCAGTAAAGA GAACCACACC TAGAGTTGCT GCTAAAGCAG TGAATGCTAC TGGACCATAT GTGTATGCTA ATCCAGAGGG CCACGTCTCT AATTCTTCGA TTGGATCTCA TAGGCAGCAC GGTTATCATC CTCCTACCAG CCAGACTCCT ACTGTGAATG CTAAGAGACA TTCTGTAGCC AGCGTTATTT CCAACGAGGC CACACCTAAT AGCGAAGTGA TTCATTCGCA CCACAACCAC TCTCCACTGA TTCAAAGTGA TATCAGTATG GATTATCCAC TGGAATTCAA ACCTCCCCCC GCAGTCAAGC AAATTGTAGA CCATAAGCTT GTATCGGCTC TTAATGATCA TGCCCCAGAC ACCATTGACG AGTCTCCGGA ATTTAAGAGT GAAGCTATTG TAGAGAAAGT TCCACCTTTG AGTGCTTCCC AGGAAGTCTC AGGACAGCCA TCTGCTGCTT CCACTTCTGT GCCTTCTTCC GAGAAGGCTG CAACTGTGGA TGACGAAACT ACATCAAGTG CTGCATCTGC TCCACCTCCT CAAAGTAAGG CTTCATTAAA GAAACCAGCA AAGGTCAATC CATATGCTCC AGGTGCCAAC AGGTCTGGCA CTGCTAGAAA GAGCAAATAT GGACCTCCCA CGGGAGCTTC CTCTAGCAAG TATTCAGTAC CTGCTAACTT GGCTCAGCAG GATGAGCCAA TCAATGATGA CACCTTGAAG TACGGAAGCA TGTTTGATTA TAGTGGGTAT AAGACAGGAG AGCCAACGGT AGCTGACAAC GACAATGTAG AACCGCCTAC GCAATCTGAG ACTACTTCGA AGCATGCGGC TGAAGAAGAT GTTACTGAGT TGATAGTTGA GCCCAAGCCT TCTATCCCAG AGAGTCCTAG CGCTCCTAAG ATCGTCAGAG AACCAGAAAC TTCAGCACCA CCTCATTTCA AACAGCCCGA ATCGCACTTT GCGCCATTTG GCTATAACAC TCCATTGAAG GCTAGAACCA AACATTATGC AAATATCGAT GATAGTTTTG ACAACTCTGA GGTGTCTGGC GACCAAGATC TTACCGATAT CCATACTCCC AGAAAGAGAC CATTGATTCT CAACAACGGT CCTCCGCAAT TGACAGGCAA AGAATCAATG TTCAATCCTT ACCAAGGCGA TAGTGGAGTT AATAAGTTTG GACTTGACTT CCCCATTCCC GGCTCTCCAG ATTACACCAC TAGAGCCAAC AGTGTAGTAG ATCAGCCCGG CTACTTCTCT TCGAGACTAT CGCAATCACA GCAGTCAGCC TTGTACCAAC AATACGAAGT AGAGGACGAC ACTGTGAGAG ACTACGTACC AACTGTAGAA GAAGACGACG AAGAAGATGA AGATGAAGAA GATAGGGATA AGAAGAGATT ACAGGAAGAA GAAGAGAAAA GGGCCAAGGC AGCTTCTGAA GAAGCAAAAA AGAAAGCTTC TGAAGCTGCC GCTAAGAGAG ATCCCGGTAG AGGGTGGTTC AATTGGCTGG GTAAGAATGA TGGCAAGCCT AAGCCTATCA AGGCCAAGCT TGGAAACCCC AGCACTTTCT ACTACGACGA AAAACATAAA AGATGGCTCG ACAAGTCTAG GCCCATTGAA GAACAACTAC AGGCAGCAGC TCCTCCTCCA CCTCCTGCCA TGAAGAAAAA GGCACCTGCA GCTTCTAGTA ATATAACAGC TTCTTCTGGT CCACCTTCAG CTGGCCCACC TCCAGGAATT AATCCATCTG GGGTTGCTCC AACTGCTGCA CCTTCTGGTC CTCCACTGGC ACCACTGGGA GTTACCAGTG GACCTCCTTC CATCGGAACC GGCCCATCTA ACGGAGCCGG AGGTCCACCT TTTGCCAAAG CTACATCTGT ACAAAGCAGT GCCCCTAGCC TTGCTAATGC TGGTTTGGAT GATTTGCTTT CTATGGGTGG CAGTTCTGTT GCAGGAGGAA CTCGTAAAGC AAAGAGAAAC ACCCGGCGTG GACATATCAA TGTGTTTGAT AAAAAGTAGA AATGTACATA AAATAGCATT ACATCACAAT ATATATGGTA GACTTATAAG AGTAGAACAA TAATT
|
Protein sequence | MSIRKSTTSP VSGTDRADLN DSFAVSEAEP SQLHGHVEYG ADVSVGVEDA SGFTAEDAFQ VGGEEDEEQV STVTNGAVVP QSVVAAESAD EDVLSRSTKK EEFLPESTSI AEPVKSDVIT NVIDANADTG LDDDPWMKDG TPEFPPADEN LPEVPPVAET KVETSKSIAK QQIYGEKVED FTENSASPQE VDSKQTSSTI ADFIPTPEHI HSNLHTSETE NEPIKQSQLP WESHVEQIEA LPWEEQPQKV EHVLPWEEKP DEVSQKDEAL PWEERRVEHP EQAAESTLPW EEQPNVEEFQ AHEPALPWEE RPEVEESSLP WEEKVGESQG QEAALPWEEQ QEVEVSSLLL EEKVRESSLP WEGEEGESQA QESKLPWEEQ RIQSTPPSEE QVLEPVEGSS LPVEEKDSTE GSLPWNQHHE GEAQEILPWE QSTKPQDTQP QVSQEELHDY SVSNVISQSE IVEEQAQPVS QAYESQFLAG FFTERRPEQE SETKQESLDE LFEKDEDFLP ELKQPVEQST QTSKPKEFNL PELDLDDDLL LDDDLLDDDE PELVPEPVAP VEPIQEAIQN NVAKSPRQTY IPTQPHHNMY APQISRTDTG EYVKKLEENK KRNDAYDFPD ILMPPKVKPA PRHHQPTTKY SQPVASPNLS NIPSAPPATT IPPPVSIGIE ASKKELPTTP QVAPGKPKSF FEELPVSMPK KAARAAPVKA VNQPHIQKSP QISNTPISAS SKPAPVNPYM PSNTQKQNPH SPLQGSQYSF PRKTSSGLAP PPALNQYAPP SSNQYATVSA QVQPGLPQSS LPQSGMSQPG LQPFPQPQGH LVPPNLVAPQ IQNSPLGQPQ QTHNYAPPIN TNVARAQGNL SATSPYVPNA GPYAPNNHNR SHSRASSLVG GGKGKEINPY APALPPVSHS GTSPSIAQSS LTSRNRGISN PRNIYSKAQP APKISNPNSL NQRQFPIFNW SNSQKVVTLI PNSSHNLYES HGESIRVRAA IDLLKDKEMY STFPGPLSKS KTKRKDVEKW LESNIALLTT NSIENQDEYL LNQVLLEAVK FEGGFNSHEF IKAVCMVLNP TADYNTPGDI SGMTSVSANA YKLDNAGIGI VFGLIQGGHI DKALEFTLSK GDWALALVVS NFAGHDRFAK VAADYARFSF PYQKSNNKIH HIIPILLKLA VGNVKSVIDD LNAVATEGEY ASHHWREIVS SVIISGVSKS QDFLVEFGKF LGLHHNIAAS EICYVIAGLP LSPQALQPSG IVVSVIGSLT GTSMYTEVYE YILKISTVFV QQGVGIIPHL LPLKLKHATV LADYGLFNES QKYIDSINNN MKTLGNRSPY LNAAFIHELQ SLIVRLQELG SSESSWFSGK MSKVNLDNIW GQIDKFIGGE EPKSKSGENG VFSKFSPSVS RNTSTLDFTA MNVSSNKYPH SSPQMPSGQF GSSSIAPSTA DGVSVPVTRA MPPLQSFNST PAINTLSMNS KYSLPHLQQT KNASFSVHSQ STQVPPSHHS TPQQSHVHPN VPLHHAAESK YSPSNASPQV PDSTASPYVP PVKRTTPRVA AKAVNATGPY VYANPEGHVS NSSIGSHRQH GYHPPTSQTP TVNAKRHSVA SVISNEATPN SEVIHSHHNH SPSIQSDISM DYPSEFKPPP AVKQIVDHKL VSALNDHAPD TIDESPEFKS EAIVEKVPPL SASQEVSGQP SAASTSVPSS EKAATVDDET TSSAASAPPP QSKASLKKPA KVNPYAPGAN RSGTARKSKY GPPTGASSSK YSVPANLAQQ DEPINDDTLK YGSMFDYSGY KTGEPTVADN DNVEPPTQSE TTSKHAAEED VTELIVEPKP SIPESPSAPK IVREPETSAP PHFKQPESHF APFGYNTPLK ARTKHYANID DSFDNSEVSG DQDLTDIHTP RKRPLILNNG PPQLTGKESM FNPYQGDSGV NKFGLDFPIP GSPDYTTRAN SVVDQPGYFS SRLSQSQQSA LYQQYEVEDD TVRDYVPTVE EDDEEDEDEE DRDKKRLQEE EEKRAKAASE EAKKKASEAA AKRDPGRGWF NWSGKNDGKP KPIKAKLGNP STFYYDEKHK RWLDKSRPIE EQLQAAAPPP PPAMKKKAPA ASSNITASSG PPSAGPPPGI NPSGVAPTAA PSGPPSAPSG VTSGPPSIGT GPSNGAGGPP FAKATSVQSS APSLANAGLD DLLSMGGSSV AGGTRKAKRN TRRGHINVFD KK
|
| |