Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_73961 |
Symbol | |
ID | 4841059 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | - |
Start bp | 347377 |
End bp | 351627 |
Gene Length | 4251 bp |
Protein Length | 1325 aa |
Translation table | 12 |
GC content | 38% |
IMG OID | 640392374 |
Product | predicted protein |
Protein accession | XP_001386654 |
Protein GI | 150866904 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.828275 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.279842 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TGCAATATTC GATTGACAAT TCCTACTATA CATCTCGTTT GCACCAATTG CTTTGCTGTC AAGGAGTTTT CCACAGTCTC CCTTACCATT TTTTTACTTA AAGGTAGAAA ATTCCAGGAG TGTTTCTCGA TAACTTCACC AATTTCTCCG ATACAAGAAG ACAGTCCTTC GGAAGCAAAT AGACAAGTCA GACAAAGTCA GGATGTCGCA CTACATGGAC GAAATTGCTC CTGATGTCAT ATCGCCCCAT GATACAAAGC CTACTGTAGA CCAGCCTTTG AAGCACTTAG ATACTGTTAT TGCTTTATAT GATTTCCCAG GTACCCAACC TTCGCATTTG CCATTAAACC TTGGTGACAC CATCTATGTT TTGTCAAAAT CCGACACGGG CTGGTGGGAC GGTGTTCTTG TGGGCCAGAC AGGAGAATTG CAGAGAGGCT GGTTTCCTCA TCACTACGTG AGATCTGTAA ACTACGTGCA ACCCGTTTTA AACAAGTTGA AAAGCAATAA GGAGTTGGAC TCCATTACCG CAAGCAACAC TGCGGCCAAT GTTTTGATTC CATCTTTTAC CAATCTCTTA CAGAAGAACT TGATCGACTC GGAAAAGAAC ACTCCAGCCA ACAGCACGAG AAAGAACTCG GTGGTCAGTT TTGCCAGCTC TGAAACAAGC ATACCTTCAG ACTCCAAAGG CAGTCTGCTT CCTAAGCATC ATCTGGAGGC ATCTACTCAA TTTCTGCAAC ACCAACCGAC ATCGATTTCT CATACTTTAA CGTCAGCAGA CTTCAACAAC GATAATATAA TCTTCACAGA CGTTGAAGAA GCTGAGCAAA TGATTTTAGA ATACAAAGAG AAAAACAAGA AGAATTTAAC ATGGATTCCG AGAAATACTA CAAGTGGTGA CTTTGTTTTT TACTGTGAAC AATTGAATAT TTATTGTCAA ACAATGCCCA TGGTTCCTTT TGAGCTTACT GACGTGACTG CTGAAGTTGA AATGCCCAGC AAAGAAGCTT TAACAGATAA ATTTGATGTT CAAGTGTATG GCACACGGTC ATCTTCAGGC GACATTGGTT CTCGGGCCAA TTCTGCAGGT ACTTTTGATC CTTTGAAAAG AGACTCTAAT GCATCCTCGG TTACCCAAAG CTCGGGGGCT TCTTCGTATC ATCATTTCAC CAAGCCTTTC TTTTCTATGG ACAATCTTTT TTACAAACAT CCCTCGGATT TGACTCGTTG GAGTGAACTC AGAGAGCAGT ATAACTACCT CTTGGATTTA TCATCGAAAT CGCTCAAAGA CCATAATAAG CAACTATTTA GTACACACTT TTCGAGATTA AATAAGGTCA TGTCCATAAT ATTGGCCACT TCTAGATTAA ATCAAGATGA TTTCTTCGGC ACGAAGTACG AAAGATCAAT TAGGAGAAAA TTGAAGAGAA TCGCAGGGGC ATTTGCTCAG TTGTACATTA ACGGTATTCT TCATTTAAGT GTGATGCACC ATTCGCAATC TACATCAGAA GCTCAATTAT TTAGTTTCGA CATCTTCAAA TTAAATAAGT CCACAAGTAA AGCCGGAAGT GAAGCGTTTT CTGCTCACTC CTCCGTGTCT ACCATCCGTC AAAATTCTAA TGAGAGTACA GTGCCTGCTT CAACTGGAGT TAATCACGAC CAGCAAGAAA TGAGTTATCT TCTGCAATTA GAAAATGATA TTGAAACATT GAGATCCAAT GTCAATTCGT TGGTGAAGAT ATTTATGAAA TTATCTGAAG ACAAAGTCGT GAAAAGATCA GACTATGATT CGTCAGATGC ATCCGAAGAT GACGGTCAAG ACCGTTATGA CATTTTACCC CAAACATATC CCAGGTTTAT TGTTGATGAA TTCAATGGAG GAAATTGGTG CAACCCTTTC TTCACTACCA ACAACCCTGT CTTGAATGTC AGTGGGGATG ATTTGAAGAA CAGATACCAT TCGAAAGTTA TAATTGACCA CCTGACATAC GAATCTGTTT ATCAATTGCT GGAAGAAATG ATTAAATTGA GTAAGGAAAC ACTAGAGTAT CTTAAACCGC AAGCACAGAA GCTTTATTAC AATGATGTAT TGAAGAACGA AAGAAATACT CAGATTTTGA GATTGATATA CAAGTACTTA TATCATGGAA GCTCAATGGT AGATCTTATT GAGTCATTTG ATTTTACTGT CTTTTGTTTG ATCAATAGAA GCTCTTCTGG CGAGGAAGCA GACCCTGAAG GAGTGAGAAA TCAACTAAAT GATGAAAACA ACCAAAAGGA TGAATCAAGA TCAACAAATA CGTCGAATTT GACATTTGAT TACCCTGTTG TTCTTGACTT CTTTCAATTG AAGCAAGAAT TCCACAACTT AATTTTGAGG ATAATCATGT CAACACAGTC TTTGACCTTG GATGACCCGG AAGTATTTAA GGCCTTGCGA GATGAGGATC CTTTGTTCTA TAACAGAGAT ATTTTGAAAT TGCAAACAGA AAAGGCTGCT CTCTTGTTAA CTAATATATT GGCCGATCAA GTCAATCCTA ACAAGGGAGA TTCAATTTCC TTAAATCCAG ATACGCTTAT GTCTACTTAT TTAACTGAAG GAATCAAGTT CTTTGATGTG CTTTTGGGCA CTATTCAGCA ATTGATTGAT GAACGTGAAA CTATATTGAA TTACGCAACA AGGGTTATGC ATGATGACTT CAATGTTCAG TTGTTGGTGA TTGAAAGAAA CAACACGTTG TTATCAGAGA AGTCAGAGGA TCATTCCTAC TATAGTGGTG GTCACAAGAA GTCAAGTGAC TTACCATGGT ATTTGGAAGG AGATGAGGAA TTTGATTTAC TATTAGATTT GAAGGGTAAT ATCAAGGGAG GAACAAAGGA GGCTTTGATT GCCCACTTAA CTCATCATGA TTTATTTGAT AGTAATTTCA ATGCCGCATT TTTGTTGATG TTTGCATCAA TAATGCCACT TTCAGAATTT GTTGGATTGT TGATTCACCG TTTCAACTTG GAGGCTCCCG AGGGATTAAG TTACGAAGAA TATAACACTT GGATCACAAA GAAGCAAAAC CCAATCAGAT TGAGAGTCAT GAATATCATG AAACTTTTAG TTGAAAAACA TTGGTCAAAG TCATACTACA ATGAGAGTTT GTTGAAGAGA TGGATTGCAT TTGCTCAATC TCCTGCAGTA CAATCGTACT CGATTGGTAA GCAATTGACA GGTTACTTGA TTAGAATCTT GAATGGAGAA CTCATCTATA TTGAAAGGGA GCCAGTTATT CCGAACACTA AACCACCTGC TTCATTGACA AAGGGATCAT CCTTGAAGAA ACTCAAGTTA TTGGATATTG ACTATATTGA GTTGGCAAGG CAGTTGACGT TGAGAGAATT CAGATTGTAT TCAAAGATCA CTAAGTTCAC GTGTTTAGCA AAAGTTTGGG GTAAGAAGTC GGGATTAAAT GAACGTTTTG AAAACATCAC TGCATTCATC AAAGCTTCCA ATCAGTTGAC TAACTATGTG GCATATATGA TTTTAAGAAA GACCGATGCT AAAAAGAGAA TTCAGGTTAT CAGATACTTT GTTCAAGTAG CAGAAAAGTG CCGTCAATAT AACAACTTTT CGAGTATGAC AGCAATTATT TCTGCGTTGT ATTCTTCTTC TATTCATAGA TTGAAGAAAA CGTGGAAGTA TGTCAGTGCT GATACCTTGT CGCACTTGCA AAGTATGAAC AAATTAATGA ACTCTTCCAG AAATTTCAAT GAATATAGAG ACGTTTTGAA GTTTATTGGG TCTGAACCTT GTGTTCCATT TTTTGGTGTT TTCTTAACCG ATTTGACATT TGTGTACCAT GGAAATCCGG ATTATTTGAT GAATCGTACC AGAATGATCA ACTTTGCTAA GAGAGCCAAG ACATGTGAAA TCGTTACGGG TATCGACAGA TTTAAGACAA CAGGGTACAA CTTCCAGACG GTTCCTGAAA TACAGAAATA CTTGGATTCG TGGTTTGACA AATGTCCTAC CATCGAAGAA CAATATCAAT TGTCGCTTAA TTTGGAACCA AGAGAAGGAC AACCATCTCA ACATTCTCAA AGTACATCTT CGGCCGTTTC CGGAAGAGAT CTTCCGCTGT TTAGAAACTC GAAGATGTCG TCTACAATTA CGAATCTTGC ACTCAAGTAG AGATACTCGA GGTATTTACT TCTGATAATA TATAAGATTG TATATAATGA AATACAGTAG AGAATAGGTC A
|
Protein sequence | MSHYMDEIAP DVISPHDTKP TVDQPLKHLD TVIALYDFPG TQPSHLPLNL GDTIYVLSKS DTGWWDGVLV GQTGELQRGW FPHHYVRSVN YVQPVLNKLK SNKELDSITA SNTAANVLIP SFTNLLQKNL IDSEKNTPAN STRKNSVVSF ASSETSIPSD SKGSSLPKHH SEASTQFSQH QPTSISHTLT SADFNNDNII FTDVEEAEQM ILEYKEKNKK NLTWIPRNTT SGDFVFYCEQ LNIYCQTMPM VPFELTDVTA EVEMPSKEAL TDKFDVQVYG TRSSSGDIGS RANSAGTFDP LKRDSNASSV TQSSGASSYH HFTKPFFSMD NLFYKHPSDL TRWSELREQY NYLLDLSSKS LKDHNKQLFS THFSRLNKVM SIILATSRLN QDDFFGTKYE RSIRRKLKRI AGAFAQLYIN GILHLSVMHH SQSTSEAQLF SFDIFKLNKS TSKAGSEAFS AHSSVSTIRQ NSNESTVPAS TGVNHDQQEM SYLSQLENDI ETLRSNVNSL VKIFMKLSED KVVKRSDYDS SDASEDDGQD RYDILPQTYP RFIVDEFNGG NWCNPFFTTN NPVLNVSGDD LKNRYHSKVI IDHSTYESVY QLSEEMIKLS KETLEYLKPQ AQKLYYNDVL KNERNTQILR LIYKYLYHGS SMVDLIESFD FTVFCLINRS SSGEEADPEG VRNQLNDENN QKDESRSTNT SNLTFDYPVV LDFFQLKQEF HNLILRIIMS TQSLTLDDPE VFKALRDEDP LFYNRDILKL QTEKAALLLT NILADQVNPN KGDSISLNPD TLMSTYLTEG IKFFDVLLGT IQQLIDERET ILNYATRVMH DDFNVQLLVI ERNNTLLSEK SEDHSYYSGG HKKSSDLPWY LEGDEEFDLL LDLKGNIKGG TKEALIAHLT HHDLFDSNFN AAFLLMFASI MPLSEFVGLL IHRFNLEAPE GLSYEEYNTW ITKKQNPIRL RVMNIMKLLV EKHWSKSYYN ESLLKRWIAF AQSPAVQSYS IGKQLTGYLI RILNGELIYI EREPVIPNTK PPASLTKGSS LKKLKLLDID YIELARQLTL REFRLYSKIT KFTCLAKVWG KKSGLNERFE NITAFIKASN QLTNYVAYMI LRKTDAKKRI QVIRYFVQVA EKCRQYNNFS SMTAIISALY SSSIHRLKKT WKYVSADTLS HLQSMNKLMN SSRNFNEYRD VLKFIGSEPC VPFFGVFLTD LTFVYHGNPD YLMNRTRMIN FAKRAKTCEI VTGIDRFKTT GYNFQTVPEI QKYLDSWFDK CPTIEEQYQL SLNLEPREGQ PSQHSQSTSS AVSGRDLPSF RNSKMSSTIT NLALK
|
| |