Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33787 |
Symbol | |
ID | 4840941 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | - |
Start bp | 489732 |
End bp | 491492 |
Gene Length | 1761 bp |
Protein Length | 586 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640392256 |
Product | predicted protein |
Protein accession | XP_001386684 |
Protein GI | 150866925 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.715519 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.121325 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCCTG ATCCAAATAC AAAATACGAC GATCAGCAAC TGAAAAGGCG CAAACTCGAA CAACAATCGT ACCCTGATGT ATCTAGAAAT ATTTCAATCT ACAGCATCTC CAGTAAGCAC CCTTTGAACG TAAGACCAGC AGGAAACTCA TATTTGACTC TGGAAGATGT TGCATTGAAA GAAGCAAAAC GGAACCTGTT GGGACATCTC AATATGTTCC CAGAAGAGTT ACTTATGGAA CTCTTGACAT ACATAGACGA TAAAGAGACC TTGCGGAACC TTTCTCATAC TTCCAGAATC CTATACGCAT ATCTTTATGA TGAAGAGATC TGGAAGAAGC TCTTTGTGAA AAGCATAGAA GACTCGACTC AGAATCTGCC TCAAAAGTGG AACGGTTCAT GGAGATGTAC CGTACTTGGA ATTGATAAGA AGCATCTGGC CAATATCATA CTACCAGATA ACCTTGTCTG CTCCGATATC TTATACAGAC CTTTCCAATG TTCACAGATC AACTATGAAA AGTTATTCCG TAAAATCATA CAGGAAGAAG AAACCTACCA TCTCGATGCC TTGTCAGATA ACCTCAAGCA ATTACCACCT GGCCGAATTC AGAGAATACC AGAATCAGAA TTGTCTCTCG AGCAATTCAA TACAGAATAT CATGATGTGC CCTTCATATT AACCAATAAA GACAAGACCA GGTGGCCACG CTGGGATTTT CCAACTTTGT TAAGTCGGTT TCCGAATGTA AAATTCCGTC AAGAGGCCGT TCAGTGGGAT TTGGCACTTT ATTCTGAGTA TTTGAAGTCT AACCTTGATG AAAACCCATT GTACTTATTC GATTGTAGCA GTGAAGCTAT GACTACTTTA CGTAAGGAGT ATGACTCTCC TCTGATATTC AAAGAAGACT TGTTTACTCT TTTTAACTTG AATAATGGAC AACTGAACTG CCGTCCAGAC CATGCTTGGT TGATAGTAGG ACCAGAAAGA TCTGGTTCTA CCTTCCACAA GGATCCCAAT TATACATCTG CATGGAATGC AGCTTTGAAG GGCAGAAAGC TTTGGGTGAT GTTACCTCCT GGAATCACTC CACCTGGTGT AGGCACTGAT GAAGAAGAAA GCGAAGTGAC TTCACCTGTA GGAATTGCTG AATGGGTTAT CTCAGGTTTC TTTAACGATT CGTTGAAGAT CAAGGAATGC TTAGTGGGAA TCACATTCCC AGGTGAATGT ATGTACGTTC CATCAGGTTG GTGGCATTCG GTTATAAACT TGGACGACTC GGTTGCGTTG ACTCAGAACT TTGTACCGTT TTCCAAATTG ACCAATGCCA TGAACTTCTT GAAAAATAGA AGGGACCAAA TCAGTGGGTT CCGCCCCTAT CCAGTCAAAG AATCAATTGA CTATGCGGTA GAGACGCTTC TTAAAGGAAA GAATAACGAG GATATAGAGA AGATGAGGGA GTACAGTGAA AAATTCAATT CCTTGAACTT GGGAGAGAAG TTAATTAATG AAGACTGCGG TGAAATCAGT GAACTACCAC CCATGCCTGT TTACGAGCTT TTCAAGCAGT TGTTGATACT TAACGGAAAA GAAGATGAGT TGGCTACAGC TTTGGAAGAG TTGAAGAAGC TAGAATCGAG AAACAGAGCA AAAACTTCAG GTAGGAGTGA AGCATGGGAG AAATTGACTA CTCCGGCACT TGAAGAGCAA CAGGGATTCA GTTTCGGGTT CAACCTCGAT GAAAGCAGCG ATGAGGAATG A
|
Protein sequence | MSPDPNTKYD DQQSKRRKLE QQSYPDVSRN ISIYSISSKH PLNVRPAGNS YLTSEDVALK EAKRNSLGHL NMFPEELLME LLTYIDDKET LRNLSHTSRI LYAYLYDEEI WKKLFVKSIE DSTQNSPQKW NGSWRCTVLG IDKKHSANII LPDNLVCSDI LYRPFQCSQI NYEKLFRKII QEEETYHLDA LSDNLKQLPP GRIQRIPESE LSLEQFNTEY HDVPFILTNK DKTRWPRWDF PTLLSRFPNV KFRQEAVQWD LALYSEYLKS NLDENPLYLF DCSSEAMTTL RKEYDSPSIF KEDLFTLFNL NNGQSNCRPD HAWLIVGPER SGSTFHKDPN YTSAWNAALK GRKLWVMLPP GITPPGVGTD EEESEVTSPV GIAEWVISGF FNDSLKIKEC LVGITFPGEC MYVPSGWWHS VINLDDSVAL TQNFVPFSKL TNAMNFLKNR RDQISGFRPY PVKESIDYAV ETLLKGKNNE DIEKMREYSE KFNSLNLGEK LINEDCGEIS ELPPMPVYEL FKQLLILNGK EDELATALEE LKKLESRNRA KTSGRSEAWE KLTTPALEEQ QGFSFGFNLD ESSDEE
|
| |