Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31740 |
Symbol | |
ID | 4838701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 1566321 |
End bp | 1567697 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 12 |
GC content | 47% |
IMG OID | 640390016 |
Product | predicted protein |
Protein accession | XP_001384255 |
Protein GI | 150865153 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGCCC TACTCTTGGA TAGAGCCTTT GGTAAGGTAG CACCGCTTGA GTTTGCTACG ACCGTAACGG AGTCGTACTA TTCTTTACTC CAACAGAACG CACTTACACG CGTGTTTCCT CCCAATTGTC ATACAAATGC AGCTATAAAC AGTCTCTCGT TAGAAACGGA AGACTACCAG TACCTTTTGA GCGGATGTGC TGATTCTTCC ATCAAGCTCT GGGACCTAAA TTCACAACTG GAGATAGACA ATGGTTCCAG CACCATCCAC CAGGATTTAA ACAAACAGCA TTCGGACTAC GATATTTACG ACTACGATCA TCCGGTCCAG ACTTTTACCA ACATTGCTAC AGTTCCCCGG AAGTCTGCCC ATACATTTGG TATTTCTGCC ATTCAGTGGT GGCCGTACGA TACAGGGATG TTTGTTCTGG CCAGTTTTGA TCACACTGTG AAAATATGGG ATACCAATGA ACTCACACCG GTACACTCTT TCGATGTTAC CAATCGGGTA TATGCCATCG ACCTCTCGGG AAGCGAGTCA CCGAATGGCT TTTCTTCCTC GGCTTTGGTA GCTGTAGGCA GTGACCAACC ATTCATTCGG CTCTTGGACT TGCGATCTAC TTCAAGTGCC CATACGCTCA CAGGTCACAA GGGGAAGACG TTGGCTGTCA AATGGCATCC GCTCAATCCT AACTTACTTC TGTCTGGAGG ATTTGACGGT GAAGTCAAGA TTTGGGATAT CAGGCGAAGC AAGAGTTGCC TTTGCCGCTT GGATATGCTC CGTACCAACA ATCAAGCAGA CAGTGCAGAT AATCTTGCTA AAGCCTCGGT CAAAGCCCAT CTGGGTCCTG TCAATGGTCT CGTCTGGAAT GAACAGGGTA CAGAGCTATA TACTGCTGGT AACGACGACA AGGTGCGAGT CTGGGACATG ATTTCCTCTT TGGCTCCACC TATCAATAAA TTGGTCAACT TTGGGCCATT GACACGAAAC AAGTATCCCC AGACTATCCC CATTATGCTT AACCCCAGCT ATGAGACCGA GTTGCAGTAT TTATTATTTC CCTCTGATAA TAGCGACTTG TTTGTATTCA GAACTGTTGA CGGCAAGATG GTTTCGCGAT TATCTAGAAA AGGCACCAAG AACAGCGGTA GGACATGTTC TATGGTTAAT GCAGGGCCAT TTACAGGGAA GTATTATTGT GGGACAATTG ATGGAGAAAT CATCGCCTGG TCGCCGCATT GGGAACAGCC CAATATTGAG GATTTAGTCG AGGACACGAA CGAGGTGGAT GTTCAAGATG TCTTATCCAA GCGAAAGTTG GCTGAAGAAG CTCGACGCAA CCTTGAGGAC GATCCCTACT TTAATGGCGA ACCGTAG
|
Protein sequence | MQALLLDRAF GKVAPLEFAT TVTESYYSLL QQNALTRVFP PNCHTNAAIN SLSLETEDYQ YLLSGCADSS IKLWDLNSQS EIDNGSSTIH QDLNKQHSDY DIYDYDHPVQ TFTNIATVPR KSAHTFGISA IQWWPYDTGM FVSASFDHTV KIWDTNELTP VHSFDVTNRV YAIDLSGSES PNGFSSSALV AVGSDQPFIR LLDLRSTSSA HTLTGHKGKT LAVKWHPLNP NLLSSGGFDG EVKIWDIRRS KSCLCRLDML RTNNQADSAD NLAKASVKAH SGPVNGLVWN EQGTELYTAG NDDKVRVWDM ISSLAPPINK LVNFGPLTRN KYPQTIPIML NPSYETELQY LLFPSDNSDL FVFRTVDGKM VSRLSRKGTK NSGRTCSMVN AGPFTGKYYC GTIDGEIIAW SPHWEQPNIE DLVEDTNEVD VQDVLSKRKL AEEARRNLED DPYFNGEP
|
| |