Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_53071 |
Symbol | |
ID | 4851594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2245305 |
End bp | 2248112 |
Gene Length | 2808 bp |
Protein Length | 935 aa |
Translation table | |
GC content | 41% |
IMG OID | 640393302 |
Product | predicted protein |
Protein accession | XP_001386779 |
Protein GI | 126274971 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0581449 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTCCG ATTTTAGGTT CTCTAATCTC CTCGGAACAG TTTACAGACA GGGTAATCTT GTTTTCACCG ACGATGGAAC AAAGCTCTTG TCCCCAGTAG GAAATAGAAT GTCTTGTTTT GATCTCATTC ATAATAGATC CTTCACTTTC AACTACGAGC ACCGTAAAAA CATAGCGAGA ATCGCCTTAA ACAAGCAAGG AACTCTTTTA TTATCGGTGG ATGAAGATGG CCGTGCCATT CTAGTCAATT TTATCCTGAG GACGGTATTG CATCACTTCA ACTTCAAGGA AAGAGTGCTT GATTTGCAGT TCAGCCCAGA TGGTCATTAC TTCGCCATTG CTTGTAATAG ATTTATTCAA GTATGGAAAA CGCCTGATTT TACCGAAGAC AGACAGTTTG CTCCATTTGT CCGCCATCGT ATCTATGCTG GCCATTACAG TGATGTCACT TCTGTTACTT GGTCCAGCGA CTCTCGTTTT TTCATCAGTA CTTCAAAGGA TATGACTTCC AGAATTTTTT CGCTACAGTC TCTGGAAAAC GATGTGGCAA TGACATTGGC CGGTCACAGA GACTACGTAG TAAAGGCATT CTTCAACAAG AGTCAGGAAA TCATATATAC AATAAGTAAG GATGGTGCTT TGTTCACTTG GGAATACACA GAAAAACCGG GTGACGACGG AAGTGAGGAA GATAGTGAAG ATGAAGAACA AGAGGACACA CAGACAAACA ATAAGCCAAT GAGCTGGAGA ATTACGGCTA AGAACTTCTT TCATTCTGAC GCCAAAGTCA AATGTGCAAC CTTCCACGCT GACTCTAACT TGTTAGTGGT TGGTTTCTAC AACGGAGAGT TTAGATTATA CGAATTGCCT GATTTTAATC TTGTACAGCA ATTGAGCATG GGTCAGAATG CCGTTGATAC AGTCAATATC AACAAGAGTG GTGAATGGCT TTCATTTGGT TCGGCTAAAT TGGGTCAATT GTTAGTATAT GAATGGCAAT CCGAATCCTA CATTTTGAAG CAACAGGGCC ATTTTGATTC AATGAATGCA CTTTGCTATT CTCCTGATGG ATCCAGAGTC GTGACCGCTT CAGATGATGG TAAGATTAAG ATTTGGGATG TGATATCTGG TTTCTGTCTT ATGACATTTA CAGAACACAC ATCATCAGTC ACCGATGTCA AGTTTGCAAA GAAGGGACAA GTTTTATTCT CTTCGTCTTT GGATGGTACC GTTAGAGCTT GGGACTTGAT CAGATTCAGA AATTTCAGAA CCTTCACCGC CACGGAACGT ATTCAATTCA ACTCTTTGGC TGTTGATCCT AGCGGTGAAG TGGTCGTAGC TGGATCTCAA GACACTTTTG AAATCTATGT TTGGTCTGTT CAAACCGCTC AATTGCTTGA ATCTTTGACT GGCCATGAGG GTCCAATTTC TTGTTTGACT TTCGGTCAAG AAAACTCGGT TTTGGCATCA GCTTCCTGGG ATAAGACTAT TCGTATTTGG AACATTTTTG GTCGTAGTCA GCAAGTTGAA CCAATTGAAA TTGAGAGCGA TGTTCTTTGT TTGGCAATGA GACCAGATTG CAAAGAATTG TCTGTTTCCA CTTTGGACGG TCACATTGCT GTTTACGATG TAGAAGATGC GAACCAGTTG CATTTAATAG ATGGGAAGAG AGATATCATT GGTGGTAGAT ATTTGGAAGA TAGATTCACT GCCAAGAATG CTGCTCGATC TAAATATTTC ACAACTATCA ACTATTCCTT TGATGGATTG ACCTTGGTTG CTGGTGGTAA TAACAACTCA ATTTGCATGT ACGATGTCAG TAATGAGGTG TTGTTGAAGA GATTTACCGT TTCGCAGAAC ATGACTCTTA ATGGTACTCA GCAAGTCTTG AACAGTAGCA AAATTACAGA TGGTGGAATA TCCCTCGACT TAATTGATAG GGCCGGTGAA AACTCCGATG TTGAAGATAG ATTGGATAAT TCTTTGCCAG GGTCTCATCG AGGAGACCCA AGTGTCAGAT CATACCGTTC CGAAATTCGC GTGAACGCCA TTCAGTTTTC ACCGACTTCT TCTGCTTTCG CAGCTGCATC TACTGAAGGA TTGCTAATCT ATTCTATTGA TAGCACGGTG ATTTTTGATC CATTCGATTT AGATGTGGAC ATTACACCAG AGGCTACACT AGAGACTTTG GCTGATCAGG ATTTCCTCAC GTCTCTTGTT ATGGCCTTCA GATTGAACGA AGAGTACTTG ATACAGAAAG TTTATGAAGC TATCCCATTA CAAGATATCA AGTTGGTTTC TAGCGATTTG CCAATTGTCT ATGTTAATAG AATGCTTGCT TTCATTGGAA ACACATTAAT AAAATACGAG AATCAGCACT TTGAGTACAA TTTACTTTGG ATCAGGCTGA TCTTGTCTGC TCACGGAAAA TATATAAATG CACACAAGCA CGAATTTGTT AGCAGCTTGA AGTTGATTCA AAGATTCTTG GCTAAGGTTG GTAAGGAAGT GGTAGCAGTT GGAAAAAAGA ATGGCTACTT TTTGGAATAT TTACAAGTTT CGAAGGACTT GAACAATCAA GTACAAGAGG AGGAGGGCGA AAGAATAGAT GAAGACGAAG AAAATGAACT GGATAATGAA GCCATAATTG AAGATGAAGA CAATGAAGAA GAATCTGAAT GGTTTGGCCC TGGTGGAGAA AAATCTACTA AGAGAATTGA GTATGGTTCT GAACTGGGCG ATAGTTCCGC TGAGGAAGAG GAAGGCGACG ATGAAGATGA TGACTCTGAT GACAACATCG AGGTCTAA
|
Protein sequence | MKSDFRFSNL LGTVYRQGNL VFTDDGTKLL SPVGNRMSCF DLIHNRSFTF NYEHRKNIAR IALNKQGTLL LSVDEDGRAI LVNFILRTVL HHFNFKERVL DLQFSPDGHY FAIACNRFIQ VWKTPDFTED RQFAPFVRHR IYAGHYSDVT SVTWSSDSRF FISTSKDMTS RIFSLQSLEN DVAMTLAGHR DYVVKAFFNK SQEIIYTISK DGALFTWEYT EKPGDDGSEE DSEDEEQEDT QTNNKPMSWR ITAKNFFHSD AKVKCATFHA DSNLLVVGFY NGEFRLYELP DFNLVQQLSM GQNAVDTVNI NKSGEWLSFG SAKLGQLLVY EWQSESYILK QQGHFDSMNA LCYSPDGSRV VTASDDGKIK IWDVISGFCL MTFTEHTSSV TDVKFAKKGQ VLFSSSLDGT VRAWDLIRFR NFRTFTATER IQFNSLAVDP SGEVVVAGSQ DTFEIYVWSV QTAQLLESLT GHEGPISCLT FGQENSVLAS ASWDKTIRIW NIFGRSQQVE PIEIESDVLC LAMRPDCKEL SVSTLDGHIA VYDVEDANQL HLIDGKRDII GGRYLEDRFT AKNAARSKYF TTINYSFDGL TLVAGGNNNS ICMYDVSNEV LLKRFTVSQN MTLNGTQQVL NSSKITDGGI SLDLIDRAGE NSDVEDRLDN SLPGSHRGDP SVRSYRSEIR VNAIQFSPTS SAFAAASTEG LLIYSIDSTV IFDPFDLDVD ITPEATLETL ADQDFLTSLV MAFRLNEEYL IQKVYEAIPL QDIKLVSSDL PIVYVNRMLA FIGNTLIKYE NQHFEYNLLW IRLILSAHGK YINAHKHEFV SSLKLIQRFL AKVGKEVVAV GKKNGYFLEY LQVSKDLNNQ VQEEEGERID EDEENELDNE AIIEDEDNEE ESEWFGPGGE KSTKRIEYGS ELGDSSAEEE EGDDEDDDSD DNIEV
|
| |