Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_53208 |
Symbol | |
ID | 4851754 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2748112 |
End bp | 2751543 |
Gene Length | 3432 bp |
Protein Length | 1143 aa |
Translation table | |
GC content | 39% |
IMG OID | 640393462 |
Product | predicted protein |
Protein accession | XP_001386857 |
Protein GI | 126275500 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.691313 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTCAT TAAACACGGC TGAGGCTGTG ACCAAATTCT TGCGGTCAAA AACTGCCACT ATCGACCAAA TCATAGAAAC ATGCCTTTCC CTTCTTGATA ATAGTACAGT TAACCAAGTA TATCTTCCCA ACAAATGTAC TTTTATTCTT GAGTTGCTTA GTGACCGGTT GAATGACTGG AGTAATGGCC CGTTCAAACT GTGGAAATAC AATAGTTCAA TCTGGCGTTT GCTAACAAAA GTATGGGTTC TGTTAGCTTC GGAAGCTTCA AGCAAGACTT CGAGAAACAA AATATTTAAG AAAGTAAAGC TAGTTGATAT CATTATAGAC ATATTGTCCA ATGCTCCGGA TCAGGAATTG CTTCTGGAGC TCATGAATTT TGTTTCTTTG GCCAGAGAGG GCACTTATAT CGAAATCGAT GAGAATAACT CTATGAACAT GCTTAGCTCA TTCTCCAATT GTAGTACTTT GCTCACAGCT AACGATTCGG ATATCGCCAG ATGGGTCTCA CTCGTTAGAG ACTTGTACCA ATTGCCTCGT CTTACGGTCA GCTTTTCACC TTCTAAAAAG TACTACAAGA AGTTCATCAC TACTTGTTCA CCTAATCTAT TGCATTTCAT CGTCATATAT GGCACAAAGG GCTACAGTTC CATATTCAGA GATATCCTTG TCAAAGAGCT CTTCTTCGGC GAGTTGGTTT CCAACTTTGT TTCTGATATT GCCAGCGTCT TGAAAGATGA CAAGATTCTA TTGGAATCGA GTTCTGTTCA GTTATTGTTC AAACTCGCTG TAGAATTCGT AGCCTCGAAG GACTCAGCAT TGTGTGAAGA ACTTTTCACA ACCATCATTG GTATCCCCAG ATTCTCCTCC TTGTCTGAAA CACTACTTTC CATTCTAGCC TCTGCTAATA AATCACTTTC TAGTAAGTTT TTCCTAGCAA TTTATGAGTC GGAATTCACA AAACAGAAGC AACAGATCAA CTGGAATCTC ACAAACTACT TGCTAGAACT TGACATAGAC TTGGCGATGC AGAAATGTGA AGAAGTCGTT ACTAAGGCTC CCAAAAGCTC CCGAGAATCT ATCGGTGTTT CTATCCTTGA AGCCTATGTT AAGGCTAGAG AATTGCCCGA TTTCATAGAG TCGATTTGGC CACGGGCAAT TAAAGTTGAT CTCATTTATG CCCAGTCCTC TTTTATCAAT GAAGTTTCCA AAAATATCAA GATCTTGTCA TCTAAACAAT TGGGTCTTCT CATTTCCGAT ATTTCGAACC TAGACGATGC GATTCAGGTT CCGTTGTTAA CAGCCATATC GAAGGGTTTA CTTAGCTGCC CTGTGTCTAC TCAGGATACC TTGAAACAAA CACTTCTAGG AAGTAAGAAC TATTACATGA CCTCCAAGAA TACCAGATGG GAAATTTTGT TCTACTTGCT TTGTTTGTAT GGAAGAAATG TCGACGTCCC AATTGAGTCG TTGAAGAGCG ATACCACTCT TTACTATTAT TATTGCATCT TACGTCTTCT TGAATTGGGA AAGGATTTTC CGGATATGAA TTCATTTCTG AAATCATTTG CTTCTCTTCT TTTGAGTAAT CCAGAGTTGC TGAGAGTGGT TTTGAAGAGA TGGTTAGTTG TTATTGACTA CTATTTTCCC AAGCCTCAAA TCGACTCTAT AATTGCATCT CTTGTAAGAC ACCTTCATGA ATTTGGTGAG TTGTTGACAC CAGTATTTTT TGAATGTCGC AAGTTACCAA AATCATTAAT TAATTATTGC ATTGACAATG TCGATACCGC CATCGAATTG ACTACCCTAA TTCCTATCTA CTGTTTTGAC AGAACTTCTA AGAAGAAGCT ACTTGAAATC TTCTTGGAAC AAACAGGCAA GACCATAAAC GAAGATAAGC AAACAAAGAT CAGACTTGCC ATCAAACATT TGCTTGAGCA GCCTACTTTC ACTTCTACGA TCGAAACTGA TTTCAGAAAA GCATTAAATT TGTTTAGAGT TTCCAATTCC AATTCATTGC AAATTGCTCA AGACATCTTT AGTATCATTT GGAAGAACCA CTTGAATCAA ATATCCAGCA AGCAGAATGA AGCCGACGTT GTGGATACTG TAAAGTACTT AACGAAGTAT TTCTCTTCCA AAAAGTTCAA AGGAATTGAT GCTGAATTCA GTGCTGCTGA AATCATCATT ATGAATATCA AGAAATATGA TCAGATAGAG TCCAGTTTAC TTGTCCATTG TGAAACCTTG AGGACCTACT TTTCCAATGC TGTCATATCA TCGTTGAAGC AGCAGTTAAG CAAGATGAAT ATTGACGTGG TTAGTAGACT TTTGAGGTTT TTGGAATTAT CTGGCGACGA ATCTTCTGAT GATGTATTAA AAATTGTAAA GGAAATTGGT CTGAAGATTG AAAATGATGA TGATGACTCA TTAAGAGAGG TAAGGCTAGC ACTCTTCACC TTGCTTTCTT CCAGACTCGA GCATACGTTT GAAAACTCAG TATACATTTT CTCCTTGTAC ATTGCACTTC AAGACGGTTT CAATGTGGAT GCTTCTTTGG CGTTAGATTC CTATTTGCAA GTCTTATCAT CCAATTATGC TCAATTCTTG AAGTGCTACA TCTATGTCCT TGGATCTCTT GAAGAAGCAC ACGATGAACA TTGTGATGGC TTGCTGAAAT TGATCCTCTT GATGGTTAAA TATTTGCAAA AGAGTCATCT AGAAGAGAGT AAAAAAGCTT TCACAAGAAC TATTCTGAGT ATTAGCACCA AGATTGGCTA TATCCACGCG GATGTTGCTG GCATGATCTT AGTTTCTCTC AAGAATAGTC TCAGTGAGAC TGCCTGGTTG TTTAGCCAGT ACCTGCTTGA AATGGTATTT TCGTTGACGA ATCTGATTGT CCTTCTGGAG AATGGCAACA ATGCTAACAA CTACATTCTT GCTACCCAAG TCATTTCTCA TGTGTTATTG TTCCATAGGT ATAAGCTCAC TTCTCGTCAT CATATCGTCA TTAACACCTT TAATGTGTTG ATGAGAAAGT TAAGTATGAA CAAGTCGGAT TTGGCATTGT CTGATAGCGA AGAAGCTGCT GCATCCTATT CAAGATTGAT TGCCAACTTG TGTGAGCCTT CGAAGCTAGT TGTTTCGAGA GAAGTGGCCA ATGAGAAGCT TACTACGGCT ACGTCGTTGA TCAAGAAGTC TCTCAGAAAG CATGTACCTA TGTTGTTGAT CAATTATATT TACCTCAACC TTCGGGTCAA CTTCAAGAGC AACGTTAATG ACCAGTTGAT GTCTGGAATC TACAGCATAT TCGACGTACT CTCACAGAGC GAGCTTCAAT TGGTGAGTTT GTCTCTCGAT ATTCCTGGAA AATCGTACTA CAGAACATTA TATAATAACT ACAAGGACCA CGGAAAGTGG AAAGATACAT AA
|
Protein sequence | MSSLNTAEAV TKFLRSKTAT IDQIIETCLS LLDNSTVNQV YLPNKCTFIL ELLSDRLNDW SNGPFKLWKY NSSIWRLLTK VWVLLASEAS SKTSRNKIFK KVKLVDIIID ILSNAPDQEL LLELMNFVSL AREGTYIEID ENNSMNMLSS FSNCSTLLTA NDSDIARWVS LVRDLYQLPR LTVSFSPSKK YYKKFITTCS PNLLHFIVIY GTKGYSSIFR DILVKELFFG ELVSNFVSDI ASVLKDDKIL LESSSVQLLF KLAVEFVASK DSALCEELFT TIIGIPRFSS LSETLLSILA SANKSLSSKF FLAIYESEFT KQKQQINWNL TNYLLELDID LAMQKCEEVV TKAPKSSRES IGVSILEAYV KARELPDFIE SIWPRAIKVD LIYAQSSFIN EVSKNIKILS SKQLGLLISD ISNLDDAIQV PLLTAISKGL LSCPVSTQDT LKQTLLGSKN YYMTSKNTRW EILFYLLCLY GRNVDVPIES LKSDTTLYYY YCILRLLELG KDFPDMNSFL KSFASLLLSN PELLRVVLKR WLVVIDYYFP KPQIDSIIAS LVRHLHEFGE LLTPVFFECR KLPKSLINYC IDNVDTAIEL TTLIPIYCFD RTSKKKLLEI FLEQTGKTIN EDKQTKIRLA IKHLLEQPTF TSTIETDFRK ALNLFRVSNS NSLQIAQDIF SIIWKNHLNQ ISSKQNEADV VDTVKYLTKY FSSKKFKGID AEFSAAEIII MNIKKYDQIE SSLLVHCETL RTYFSNAVIS SLKQQLSKMN IDVVSRLLRF LELSGDESSD DVLKIVKEIG LKIENDDDDS LREVRLALFT LLSSRLEHTF ENSVYIFSLY IALQDGFNVD ASLALDSYLQ VLSSNYAQFL KCYIYVLGSL EEAHDEHCDG LLKLILLMVK YLQKSHLEES KKAFTRTILS ISTKIGYIHA DVAGMILVSL KNSLSETAWL FSQYLLEMVF SLTNLIVLLE NGNNANNYIL ATQVISHVLL FHRYKLTSRH HIVINTFNVL MRKLSMNKSD LALSDSEEAA ASYSRLIANL CEPSKLVVSR EVANEKLTTA TSLIKKSLRK HVPMLLINYI YLNLRVNFKS NVNDQLMSGI YSIFDVLSQS ELQLVSLSLD IPGKSYYRTL YNNYKDHGKW KDT
|
| |