Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_54680 |
Symbol | |
ID | 4837496 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1201638 |
End bp | 1205360 |
Gene Length | 3723 bp |
Protein Length | 1228 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640388811 |
Product | predicted protein |
Protein accession | XP_001382447 |
Protein GI | 150863836 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTATCG ATGATGATTC GCTCTATCTC TACCATCTCA CGCTAAGGGC TCCTTCCAAT TTCACGCTGT CGGTGTTAGG GCAGTTTTTG GGCGAAAAGA AGTCACAGGA GATTCTAGTA TCATCCGTTA GCACTCTTCA ATTGTTACGT CCGAATGCCG AAACTGGCAA GATAGAAGTG GTGGCGAGTC AAAATACTCT CGGTGTAATA CATAAAATTG AAAAGATCCG AATCGTGGGT ACTCAAAAGG ACCTAGCAGT TGTAGTAGGA GAGTCTGGAA AAGTGGTGTT TCTAGAGTTT GACGTAGATT TGCATAGGTT TGTGCCTGTT TTACAAGAAC CGTATGCAAA GACAGGATTT GGAAGAGTCA ATCCAGGAGA GTACCTTGCT GTAGATCCTC AGAGTCGCTG CATTTTTCTT GGTGCCATAG AAAGAAACAA ATTGATTTTC AAAGTAGAGA CAGACTCTCA AGGAAAGGTA GAGCTCTCTT CACCTTTAGA GGCACACTCG AAACATACTT TGACATTAAG TGTCGTAGCG TTGGATACTC AGTTTTCCAA CCCTGTTTTC GCTGCCATTG AGTGCGATTA TTCTAACTAC CACAGTGATG GAAAGGTTCA GTTCGATGCG GATTCCTCGC CTCTTCTACT CAATTATTAC GAGCTTGACC AGGGGTTGAA TCATATAGTC AAAAAAAAGT CCACAAATAC CATTCCTTCA TCGGCTACGC ATTTGATCCC ATTACCGAGC CATGTTGGTG GGGTTTTCGT CTGTTGCAAA AACTATATCA TATATGATAA TCTTCATAAA AATCTCGAAA GACTCTATCT TCCTTTACCA CTTAGAAAGG ACAGCGAATA TACCGTAGTT GTCAGTCATG TCGTCCATAA ATTGAAAAAG AACAATTTCT TTGTTTTACT TCAATCTTCT ATGGGTGACT TATTCAAAGT GACTGTTGAA TACAACAGCG ACAAAGAGTT GATTGAGGAT ATTCAGATCG GCTATTTTGA CACCATTCCC GTTTCATCGT CGTTGAACAT TTTGAAAAGT GGATTTCTCT TCGCAAATGT GTTGAACAAT GACAAGCTCT ACTACCAGTT CGAGAAATTG GGAGACGACG ACGAAAATAT TCAGTTGAAA GCATCTCCTG ATATTTCATC TATTGACGAA GAAGATAGAA GCAACAGAAC ATTTACAGTG AAAGCTCTCG ACAACTTAGC ATTAGTAGAG ATTTTCACTT CTCTCAGTCC GATAACTGAT GCTGGCATTG TGGAGTCGAT TTCTAGTGGT ACAGCTGACT CATTACAACA GATGATTACC GCATCTTCAC ATTCGCATTT GAAATCACTA GTACATGGAA TTCAAACCTC GACTCTTGTA TCTTCACCAT TGCCTATCAT CCCAACTGGA GTGCTCACAA CGAAGTTGTT TGCAGATTCT CGTAGCGATG AATACTTGGT CATATCCTCC ACAGTAGCTT CCCGAACTCT TGTATTGTCT ATTGGCGAGG TAGTTGAAGA AGTCGAAAAC TCTCAATTTG TCAATGATCA ACCTACTCTT GCAGTTCAAC AAGTAGGGAC TTCTTCAGTA GTTCAAATCT ACACAAATGG GATACGACAC GTAAAACACA CAAGAACCGA AGATAAAGAA CAGTCTATAT CCAGAAAGAT CACAGATTGG TATCCGCCGG CTGGCATCAC TATTGTCAAT GCCAGCACTC ACAGGGAACA GGTGATCATT GCTCTTTCAA ATGCCGAGAT TTGCTACTTT GAAGTAGATG CCACAGACGA CCAGTTAATA GAGTATCAAG ACAGAGTAGA AATGTCTAAT TCAATTACAT CGATCGCTAT ATGCGAGGAG ACAGCAAACA AAAAGAACTT GTTTGCTGTA GTTGGCTGTT CTGATGAAAC CATTCAGGTT TTATCTTTGC AACCACACAA TTGTCTTGAG ACATTATCGT TGCAGGCTCT TTCAGCTAAC AGTACTTCAT TGCTGATGCT CCAGAACGAC AACACAACAA TGGTTCACAT TGGAATGGAC AATGGACTTT ATGTTCGGAC TTCTATAGAA GAAATCAGTG GAAATCTATC TGACACTCGC ATAAAGTATC TTGGCTCCAA GCCCGTGACT TTATCAGTAA CCAAATTACC AAATGGAAGC AAAGCGATTT TGGCTATCTC CTCCAGACCT TGGATTTGTT ACTACAATAG GAGCGAGTTC AAAGTCACTC CTCTCCTTGG TGTCAAAATC TTGAAAGGAG CATCTTTCAG TTCTGAGGAC ATAGGAGGTG AAGGAATTGT TGCTCTATCT GACAACAATT TAATTGTCTT CACTATAGGC AAAGAAGACG TAGAGTTCGA TATAAACCAG GACGTCAATA TTGAAAAAAT CCGTTTAAGG TATACACCCA GGAAGTTAAT TATAGACGAT GACGGAAAGA GCTCTAAAGT AAATTACATA TATGCCTTAC AGTCAGAATA CGGAACTAAG AGTCCATTCT CACCTTCAAA CTTGAATTCG GAAGATGACC CTGAAAGTGA AATAGACCAA GATTATTACG ACGCTTTTGG CTATGAGACT GAAGTTGATA AATGGGCATC ATGCATTCAA GTGGTAGATT TTGAAAATCT GAGTATCATT CAAACAGTTG AGTTCTCTAG CAACGAGAGT GCTATATCCA TGGCAAAATT GCACTTTGTA TCGTCAGGCA AGGGTAATAT GGAACATTTA ATTATTGGAG TTACTACAGA TAGGAAGTTT CTCAAAAATT CAGTTGGGAA AAGTTACCTA TTCACATTCA AAATCCAAAA GAATACCAGA AAATCCAATA AGAAAAGACT AGAGTATCTT CATAAGACAG AGATAGATTG TTCACCTACA GTGATGATTC CTTTTAATGG AAGATTGTTG GTTGGCATGG GAAAGTATTT ACGACTTTAT GATATTGGAC ATCGCCAATT GCTTCGAAAA TCGTCTACAA ATATTGACTA CATTTCTTCT ATAGTAGACC TTGTACATAC TGGAGGAGAG AGAATAGCAT TTGGAGATTC TCATTCGTCC ATTGTATTTG CCAAGTTTGA CTCTGCTGAG AACAGATTTG TACCATTTGC TGACGACATA ATGAAGCGAC AAATTACAGC AGTCGCAGCT TTGGATTACG ATACCGTTAT AGGCGGTGAC AAATTTGGAA ATGTATTCGT TTCTCGAGTT CCCGATTCCG TTTCGAAAAA GTCTGATGAA GACTGGAGTC TATTGAAAGT CCAGGAATCA TATTTGAATG CTTCTCCATC TAGAACGAAG AACCTCTGTG AGTTTTTCCT TCTGGATACA CCAACTTCCT TCACCAAAGG CAGTATGACG ATTGGTGGAC ATGATGGCAT TATTTACACT GGTATTCAAG GAACTGTAGG ATTGCTTTTG CCTCTTTCTA CAAAGCTGGA AGTCCAGTTC ATAAACAGTT TGGAGCAATC GTTGCGACAA GTATTCGACT ATAACTTTGA TGACTACGAT AGTAAGCAAA TGGGTTTCAA TTTACTTGGT ATGGATCACT TGAAATTCAG AAGTTATTAT AATCCAGTGA AGAACGTCAT TGATGGGGAT TTGATAGAGA AGTACTATGA GCTTAGCCAA AGCTTGAAAA TAAAAATTGC CCGTGAATTG AATAGAACAC CAAAAGAAGT CGAGAAGAAG ATCTCTGACT TACGAAATAG ATCAGCATTC TAG
|
Protein sequence | MSIDDDSLYL YHLTLRAPSN FTSSVLGQFL GEKKSQEILV SSVSTLQLLR PNAETGKIEV VASQNTLGVI HKIEKIRIVG TQKDLAVVVG ESGKVVFLEF DVDLHRFVPV LQEPYAKTGF GRVNPGEYLA VDPQSRCIFL GAIERNKLIF KVETDSQGKV ELSSPLEAHS KHTLTLSVVA LDTQFSNPVF AAIECDYSNY HSDGKVQFDA DSSPLLLNYY ELDQGLNHIV KKKSTNTIPS SATHLIPLPS HVGGVFVCCK NYIIYDNLHK NLERLYLPLP LRKDSEYTVV VSHVVHKLKK NNFFVLLQSS MGDLFKVTVE YNSDKELIED IQIGYFDTIP VSSSLNILKS GFLFANVLNN DKLYYQFEKL GDDDENIQLK ASPDISSIDE EDRSNRTFTV KALDNLALVE IFTSLSPITD AGIVESISSG TADSLQQMIT ASSHSHLKSL VHGIQTSTLV SSPLPIIPTG VLTTKLFADS RSDEYLVISS TVASRTLVLS IGEVVEEVEN SQFVNDQPTL AVQQVGTSSV VQIYTNGIRH VKHTRTEDKE QSISRKITDW YPPAGITIVN ASTHREQVII ALSNAEICYF EVDATDDQLI EYQDRVEMSN SITSIAICEE TANKKNLFAV VGCSDETIQV LSLQPHNCLE TLSLQALSAN STSLSMLQND NTTMVHIGMD NGLYVRTSIE EISGNLSDTR IKYLGSKPVT LSVTKLPNGS KAILAISSRP WICYYNRSEF KVTPLLGVKI LKGASFSSED IGGEGIVALS DNNLIVFTIG KEDVEFDINQ DVNIEKIRLR YTPRKLIIDD DGKSSKVNYI YALQSEYGTK SPFSPSNLNS EDDPESEIDQ DYYDAFGYET EVDKWASCIQ VVDFENSSII QTVEFSSNES AISMAKLHFV SSGKGNMEHL IIGVTTDRKF LKNSVGKSYL FTFKIQKNTR KSNKKRLEYL HKTEIDCSPT VMIPFNGRLL VGMGKYLRLY DIGHRQLLRK SSTNIDYISS IVDLVHTGGE RIAFGDSHSS IVFAKFDSAE NRFVPFADDI MKRQITAVAA LDYDTVIGGD KFGNVFVSRV PDSVSKKSDE DWSLLKVQES YLNASPSRTK NLCEFFLSDT PTSFTKGSMT IGGHDGIIYT GIQGTVGLLL PLSTKSEVQF INSLEQSLRQ QMGFNLLGMD HLKFRSYYNP VKNVIDGDLI EKYYELSQSL KIKIARELNR TPKEVEKKIS DLRNRSAF
|
| |