Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_60945 |
Symbol | |
ID | 4839247 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 1039445 |
End bp | 1040695 |
Gene Length | 1251 bp |
Protein Length | 399 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640390562 |
Product | predicted protein |
Protein accession | XP_001385209 |
Protein GI | 150865830 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0140768 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTCTC CCCACGCAAA CCCTAACGGC ATCGAAACCA AAATCATTGC AGCACGTAGC GAAGCCCAAG CGTTGTACAA CGAGTTGGAG AAAGTGAAAA ATAGGCTCCA CGACTCTACG TTGGACGAGA TATCGTATTC CATCCCAGGC ATTCCCAAGA ACTTCAACAA TTTGAAGCTC TACAATACGC TTCGGGGCCA TAATAACAAG ATCGCCAAAA CGCAATGGAG CTCCGACTCA AGCAAGCTCC TTTCAGCTAG TCAGGACGGC TATATGATTC TCTGGGATGC CGTTACTGGT TTCAAGAAAC AGGCTATCAA TCTTGAGAAC CAATGGGTTC TCACATGTAG CTACTCGTCA GACGGAAAGC TAGCAGCGTC CGCAGGACTC GACAATGCCT GTACTATCTA CAAGGTAAAA CAGGATGGCG ATTTCCGCTT TGGGGGCACC AGAGGTGAAG CCAGAAAAGG GTCTGCTACT GGAAATGACC TTGACATTTT GCCAGTTCAG TCTGTGTTCA AGGGCCACAC AGCGTATGTA TCAGACTGCG GGTTCATCAC CAACACCACT ATAATTACAG CCAGTGGTGA CATGACTTGT TCGCTATGGG ATATAACCAA AGGAGTCAAG TCGCGAGATT TTGTAGAACA CTTGGGCGAC GTTCTCTGTA TGAGTATCTT TCCCTCCAAT AAGCTCAATG ACAACCTCTT TGTTTCTGGT TCTTCTGACG GTAGTGCAAA GATTTGGGAT TTACGAAGTC CTACGCCTGC TCTGAGTTTT TTTGTCTCCA ATAGCGACAT CAACACTGTT CTGATCTTTC CTAATGGAAA CTCGTTTGCA ACAGGTTCAG ATGATGGACT AATTCGACTC TTTGATATTA GAGCAGATTG CGAATTGAGC AACTATTCTC TCTTATCTCA GTTCCAGAAA CAAAACCACA AGATCCCCAA AGCCAAGATC CCTAGTAGGA GACACAGCAC CACTGACCAG GTGAGCACAG GGTCCATCAG CATCTACTCT AGCATAGATA ATCCGGGAGT TTTTTCCCTT GATTTCAGCA ATAGTGGAAG ACTACTCTAC GCATGCTACT CAGAATTTGG CTGCTTAGTA TGGGATGTCT TGAAGAATGA GATCGTAGGC TCCGTAGGAA ACGATCATGT CAACAAGATC AACCATATAA GCGTATCTCC TGACGGAACG GCCGTTGCCA CGTCGTCATG GGACTCCACG ATCAAAATCT GGTCCGTGTG A
|
Protein sequence | MTSPHANPNG IETKIIAARS EAQALYNELE KVKNRLHDST LDEISYSIPG IPKNFNNLKL YNTLRGHNNK IAKTQWSSDS SKLLSASQDG YMILWDAVTG FKKQAINLEN QWVLTCSYSS DGKLAASAGL DNACTIYKVK QDGDFRFGGT RVQSVFKGHT AYVSDCGFIT NTTIITASGD MTCSLWDITK GVKSRDFVEH LGDVLCMSIF PSNKLNDNLF VSGSSDGSAK IWDLRSPTPA SSFFVSNSDI NTVSIFPNGN SFATGSDDGL IRLFDIRADC ELSNYSLLSQ FQKQNHKIPK AKIPSRRHST TDQVSTGSIS IYSSIDNPGV FSLDFSNSGR LLYACYSEFG CLVWDVLKNE IVGSVGNDHV NKINHISVSP DGTAVATSSW DSTIKIWSV
|
| |