Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_79644 |
Symbol | |
ID | 4840649 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 346758 |
End bp | 349748 |
Gene Length | 2991 bp |
Protein Length | 772 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640391964 |
Product | predicted protein |
Protein accession | XP_001386269 |
Protein GI | 150866614 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.241607 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.527548 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTCATAGTGC ACTCTGTGCT CAACTCTCGC TTCGCGTCCC ATAGCCTCAA AAGACAAGAA AAAGTCTCGA CCACAAAATC AAGCCAAACA AATCAAGCCA ACCAAATCTT CAACTTACAA AACCATCCCA CGCATTGATT AGAGAAGGAG CTATCCGCGT GTCAGTATCT ATCTTAGACC GTTATCTATC TTAGACCGTT ATCATAGACC TCATCCTGGC CAACTTGCAT CTCGTAGACC CGCTTGTCCC ACGATTGTAT CTTGCCATAG ATCTCCGTAT CTTCACAGCT GATATTCTCT TCTAGAAATA ATTCCATTTC ACCATCTGAC CTGTGGTAGC TGCTATAAAA AGCGCCTTAT TATTGCCTCG CACTCCCCAG CCATTTTTTC CGCGACTGAG ATCATCTGAA CAGATCATCT ACAGATATCT TTTTTCCACA CATTAGTATA CACTCGTATT TCCATTTTCC TCTACTTATT CCCACTACTT TTCTTTCTTC CAATCTGCTG AATCTTCTAC TGCTAGTACT CCGTCTACAG TTCTTCCAAT TTTCACAATG CGGTTGAACC TCATATACAT CTGGATTTTC AGCTTGCTCC AGGCAGCTTT GGCTCAGTCG GTTCTTAAGA CCTCGTCCTT GTTGACATGT ATGGACAACT CGAAGTTCAC TGCCTCCTTC TTCGATGTCA GATACTTTCC CCACAACACA AGTGTCTACT TTCAAGTGAA TGCGATCTCG TCACTCGACA CAAATGTGAC TGCTCAAATC ACGTTGATTG CCTACGGACT CAATATTCTT TCGCGTAACG TCTCGCTCTG TAATCTTAAC TACCCCGAGA TCTGCCCCTT GACTTCCGGC CACTTGGATT TGACATCCCT GTACAATGTG TCGAGAAGCA TCACTGACCA GATCCCTGGA ATTGCCTTCA CAATCCCGGA CCTCGACGCC AGAGTCCGTG TCACAATCAC AGAAGATGGT AAGTCAGACC AGTTAGCTTG TGTTGAAGCA GTATTGACCA ACGGGAAAAC TGTCCAGACA AAATACGCTG CCTGGCCCAT TGCTGCCATT GCTGGTCTCG GTGTTATTAC ATCGGGAGTC GTCTCAGTGA TTGGCCACTC CAACACAGCA GCACACATTG CCTCGAACTC GATGTCGCTT TTCGTTTATT TCCAGTCTTT GGCCATTACA GCCATGATGG CCGTGGCTAA AGTGCCTCCT ATTGCCGCAG CCTGGGCCCA GAACTTCCAG TGGTCGCTCG GTATCGTCAG GGTCGGATTT GTCCAAAACA TTGCAAATTG GTACCTCCAG GCTACTGGGG GTACCCCTAC GGATATCTTG GGATCGCAGT ATCTCTCTAT TTCCGTCCAA AAGAAGCTAA AGAAAAGAGC CTACGAATTG TTCGAGTCCT TCTACAAGCC TCAAGAAAGT GTGGTTTCAG GCTTATCCAA GCGTGCCTCG ATCACGCTTG ATTCCGACGA CTTTGGCTAC AGCGACTCTC TCAACTCCAC GTTGTACTCT TTAAACGAAA AGGACAAAGA CTTGTCTTCC AAGATCTTAG TTCTTCGTGG TATCCAGAGA GTTGCTTTCT TGACCAGAAT AGAAATCACT GACCTCTTCA TGACCGGTAT CATTTTCCTC TTGTTCTTTG CTTTTGTCAT GGTGGTGTGT CTAATGTTGT TCAAGGCAAT CATCGAAATC TTGATCAGAG CCAAGTTGAT GAATGAAGGA AAGTTCAACG AATACAGACT GCAGTGGTCT CTTGTCATCA AAGGTACCCT CTACAGATTG TTTGTATTGG CTCTTCCGCA AATCGCCGTG TTATGCCTCT GGGAATTGAC TACGAGAGAC TCAGTGGGTA CTACCGTGAT TGCTGTCTTC TTGTTTGTCT TATCCGTAGT TTTATTGTTT CAAGCAGCCA TCAGAGTATT CATGTTTGGT AGAAAGTCTG TGCTGCAATA CAAGAACCCA GCCTATTTGT TGTATGGCGA CGGTGCCTTT TTGAACAAAT TCGGTTTCCT CTACGTTCAA TTCAGAGCTG ATTGCTACTA CTTTATTCTC GTCAGTTTGG TATACATGCT TGCTAAGTCA TTGTTTGTGG CAGTCTTACA AACCCACGGA AAAGTACAAT CTGTCATTGT CTTCGTCATT GAATTGGCCT ACTGTGTACT TGTTAGTTGG ATTAGACCAT TTATGGACAA GAGAACTAAT GCGTTCAATA TCACCATTGC TGTCATCAGC ACTCTCAATG CCCTTTTCTT CATGTTCTTC TCCTTCGTCT TTAGGCAACC GCATGTTGTG GCTTCGGTCA TGGGTGTCGT TTACTTTGTC ATTAATGCCG TATTTGCCTT GTTCTGTTTA ATCTTCACCG TTGTCACCTG TGTTTTGGCC TTACTTTATA AGAACCCTGA TGCGAGATAC CAACCAATGA AGGATGACAG AGTCTCGTTC CTTCCTAGAT TTGACAACCC GAAGCAAGCA CAAAACGGTG AAGAAGATTT GGAGTTGATG GCCTTGGGTG CTACTGCCAG AAAGGGTCAC GAACACGGAG GCAAACCTGC CAACTTGTAC GATGAAGATG AATCGATGTA TGAAGAAGAT TCCATGTTTC CTAACAAGGA TTCCAGAAAC GAGCTGAACT CCAACTCCAA CTTCAATTTC TCCCACGATG CCAATGACTC TAAGCATGAT TCTTACTTAG AGACAATGGA ACCTACCCAA CCCGGTTCCA CGATTGTTGG TAATCCTGGT GCCATAACAG GGTATCATAA TAGTGCATAT GTTGGTGGCT CAAGCAGAGG TCCACCTGTC AATCCATATT CACAATCGAC ATCTTACAAC ACAAGTCAAA GTGGCAGTCG TGTCAACTTC ATATGATAGA AGTAAAGATC AGAGGACTAC TTGTATCAAT TTATCTTTTC AATTCTTATC TTATTCTTAT TTATGTATTT TAGCTTAACG ATATATTACC AATTCAATCA GTTTCGAATC G
|
Protein sequence | MRLNLIYIWI FSLLQAALAQ SVLKTSSLLT CMDNSKFTAS FFDVRYFPHN TSVYFQVNAI SSLDTNVTAQ ITLIAYGLNI LSRNVSLCNL NYPEICPLTS GHLDLTSSYN VSRSITDQIP GIAFTIPDLD ARVRVTITED GKSDQLACVE AVLTNGKTVQ TKYAAWPIAA IAGLGVITSG VVSVIGHSNT AAHIASNSMS LFVYFQSLAI TAMMAVAKVP PIAAAWAQNF QWSLGIVRVG FVQNIANWYL QATGGTPTDI LGSQYLSISV QKKLKKRAYE LFESFYKPQE SVVSGLSKRA SITLDSDDFG YSDSLNSTLY SLNEKDKDLS SKILVLRGIQ RVAFLTRIEI TDLFMTGIIF LLFFAFVMVV CLMLFKAIIE ILIRAKLMNE GKFNEYRSQW SLVIKGTLYR LFVLALPQIA VLCLWELTTR DSVGTTVIAV FLFVLSVVLL FQAAIRVFMF GRKSVSQYKN PAYLLYGDGA FLNKFGFLYV QFRADCYYFI LVSLVYMLAK SLFVAVLQTH GKVQSVIVFV IELAYCVLVS WIRPFMDKRT NAFNITIAVI STLNALFFMF FSFVFRQPHV VASVMGVVYF VINAVFALFC LIFTVVTCVL ALLYKNPDAR YQPMKDDRVS FLPRFDNPKQ AQNGEEDLEL MALGATARKG HEHGGKPANL YDEDESMYEE DSMFPNKDSR NESNSNSNFN FSHDANDSKH DSYLETMEPT QPGSTIVGNP GAITGYHNSA YVGGSSRGPP VNPYSQSTSY NTSQSGSRVN FI
|
| |