Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_39580 |
Symbol | |
ID | 4851704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2595600 |
End bp | 2596730 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | |
GC content | 47% |
IMG OID | 640393412 |
Product | predicted protein |
Protein accession | XP_001387070 |
Protein GI | 126275321 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCCGT TCAGAACAAG AGGTTACAAC GGCTATGGAG TACAATATTC GCCATACTTC GATAACAAGC TTGCTGTGGC TACAGCTGCT AATTATGGAT TGGTCGGTAA TGGACGGCTC TTTATATTGA ATATAGAGCC CAACGGCACC ATTGTAGAAC AGACATCGTG GGAAACCCAG GACGGACTCT TCGATCTTGC GTGGAGTGAG GTTCACGAAA ATCAGGTAAC AGCAGCCAGC GGCGATGGGT CCATAAAATT GTTTGACTTG ACGGTGGGAC AATTCCCTGT CATGAACTGG AAGGAGCATA CGAGAGAAGT TTTCTCTGTC AACTGGAACT TGGTGGATAA AACTAACTTC ATCTCTGCAA GCTGGGACGG ATCTATGAAA GTGTGGTCAC CACAGCGTCC AGATTCGCTT TTGACCTTGA GCCATGCACA GGACTTCACC ACCAAATCTC TGCCTGTAGA GCTGACTGCC AGACCACCTT TATCGCATCA ACAACAACAT CAACAGCTGC AACATGTGAA CACAGCTAAT TGCATCTATA ACGCTACCTT TTCTCCGCAT TCACCATCAA CTGTAGTTAG TGTAAATGGA TCTTCCCACG TTCAGATATG GGATATAAGA GCACCCAGAC CCTTACAAAT AGATTACGTT GCCCACGGGG GTCTTGAAGC CCTTTCGTGT GATTGGAACA AGTACAAGCC CACGATTATA GCATCAGCTG GTACTGATAA ATCAGTGAGA ATATGGGACT TAAGGATGAT CACCAAAATC GACCAACCAC ACGCCCATGC TCCTATGCCT GCGTACCACA TCAGAGGTCC TACTCCCTTG AACGAACTTC TTGGTCATCA GTTTGCTGTT AGAAAAGTAC AATGGTCTCC TCACGATGGC CAGGAATTGA TCAGTACTTC CTACGATATG TCCGTGCGAG TTTGGAGAGA TGAGTCTAAC GAGAGAGCCA GATTCTTGAA CATGAAAAAT GGAGGCTGCA AGGGTGTTAT GGGGCAGCAC AAAGAGTTTG TCATTGGTTG TGACTACAGT TTGTGGGGAG AACCAGGTTG GGTGGCGTCC ACAGGCTGGG ACGAAATGGT GTATGTTTGG GACAGCAAGA GGTTACAGTA G
|
Protein sequence | MLPFRTRGYN GYGVQYSPYF DNKLAVATAA NYGLVGNGRL FILNIEPNGT IVEQTSWETQ DGLFDLAWSE VHENQVTAAS GDGSIKLFDL TVGQFPVMNW KEHTREVFSV NWNLVDKTNF ISASWDGSMK VWSPQRPDSL LTLSHAQDFT TKSLPVELTA RPPLSHQQQH QQLQHVNTAN CIYNATFSPH SPSTVVSVNG SSHVQIWDIR APRPLQIDYV AHGGLEALSC DWNKYKPTII ASAGTDKSVR IWDLRMITKI DQPHAHAPMP AYHIRGPTPL NELLGHQFAV RKVQWSPHDG QELISTSYDM SVRVWRDESN ERARFLNMKN GGCKGVMGQH KEFVIGCDYS LWGEPGWVAS TGWDEMVYVW DSKRLQ
|
| |