Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_77338 |
Symbol | |
ID | 4838961 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 549461 |
End bp | 552413 |
Gene Length | 2953 bp |
Protein Length | 922 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640390276 |
Product | predicted protein |
Protein accession | XP_001384402 |
Protein GI | 150865259 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.98376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CACGCATCGT AGTTACCTTC ATATCACCAT GAAATTGGAC GTGGTCAAAC AGTTCTCCAC GCGTTGTGAC CGTGTCAAGG GAATCGACTT CCATCCCTCT GAGCCGTGGA TCCTCACCAC CTTGTACAAT GGTAAGATCG AGATTTGGTC CTATGCCACA AACACGCTTG TAAAGTCGAT CCAGGTAACG GAGATGCCCG TCCGGACGGG AAAGTTCATC GCTCGTAAGA ATTGGATTGT TGTCGGCTCC GACGACTTCC AGATCCGGGT CTATAACTAC AACACGGGTG AAAAAATCAC CCAGTTCGAG GCCCATCCAG ACTATATCAG GTCCATTGCC GTTCATCCTT CCAAGCCATA TATATTGACG TCGTCTGATG ACTTGACCAT CAAGTTGTGG AACTGGGACA ATTCCTGGAA GTTGGAACAG GTCTTTGAAG GACACCAACA TTACGTTATG AGTGTCAACT TCAACCCCAA GGACCCAAAC ACGTTTGCTT CTGCCTGTTT GGACAGAACC GTCAAGATCT GGTCCTTGGG TTCCTCGGTG CCAAACTTCA CGTTGGTAGC TCACGATGCC AAGGGTGTTA ACTATGTTGA TTACTACCCT CAGGCTGATA AGCCTTATTT GATTACATCA TCAGATGATA AAACGATCAA GATATGGGAT TACCAGACTA AATCATGTGT CGCCACATTG GAGGGTCATT TGTCGAACGT GTCGTTCGCG ATTTTCCACC CTGAGTTGCC GTTGATCGTA TCTGGTTCTG AAGATGGAAC CATTCGTTTC TGGAACTCCA ACACCTTCAA GTTGGAGAAG TCTATTAATT ACTCGTTGGA ACGTGTGTGG TGCATTGGTA TCTTGCTGAA GTCCAACTTG ATTGCCGCGG GGTTCGACTC TGGCTTCGTT ATCGTCAAAC TTGGAAATGA AGAGCCCTTG TTCCTGATGG ACTCCAACAA CAAACTCATC TATGCTAAGA ACTCAGAGGT TTACCAGTCA GTGATTAAGC CCTCTTCTAC GGAAGGATTG AAGGATGGAG AAGCCTTGCC TTTGCAACAG AGAGAATTGG GTAACATCGA GATATATCCA CAGTCCTTAT CGCATTCTCC TAACGGTCGA TACGCTGCAG TCTGTGGAGA TGGGGAATAC ATTGTCTACA CTGCTTTGGC ATGGAGATCG AAATCATACG GTAATGCTTT GGACTTTTCC TGGAATACAC ACGATACTTC CAACGCATGT TCTTTTGCTG TGCGTGAATC GCAAGTATCG GTCAAGATTT TGAAAAACTT TCAGGAGTAC TTGACGCTCG ACTTGATCTA CCAGGCTGAC AAGATCTTTG GTGGTGCCTT GTTGGGAGTC AAGCTGGAAG GTTGTATTTC TTTCTACGAT TGGATCCACG GTAAGCTAGT TAGACGTGTT GACTTAGACG ACGACATCCA GGACGTAATC TGGTCCGACA ATGGCGAGTT GTTGGCCATT GTTACTTCTT CCAGTGTCGG CGATAGTAAT TCTGTAGGAG CTAAGAAGAG TGATGAGACG TACTTCTTGA GTTACAGCCA GGAAGCTTTC GAACTGGCCT TACAAGCTGA CGAGCTTGAT CCAGAAGAAG GTGCTGAATC GTCTTTCAAT GTTTTGTACA CTCTTCCAAC CTCTGAGCCT ATATTGTCGG GCAAGTTCAT TGGCGATGTT TATGTCTATA CCACAGCCTC TACCAACCGA TTGAATTACT TTGTGGGTGG AGAAGTGATC AACTTGGGAC ATTTCGACCA CAAATACTAC ATAATTGGCT ACAAGGCGCA AGAAGGCAAG TTGTATCTTA TTGACAAGTC GTTCAACGTC GTCTCCTGGT TCGTCAACGC CGAGGTCTTA GAGTTACAGA CCTTGGTAAT GCGTGGTGAT CTTGAACAGT ACGCTGTCAA AACTGTAGAA GATGAGGAGA CAGGTGAACA GATCCCAGAC TTGGCTAGTG TAGAAATCGA CAACTTGTCG GACGATTACG CAAACCTCAA GTCGGGATTC AGCAAGACTG AATTGAACCA GTTGTCGCGT TTCTTTGAGA AGTTAGGTTA CTTATCGTTG TCGTACTCGT TGTCGCAGGA TTTCGACTCC AAGTTCCAGA TATCACTCTC TACCGGTAAC TTGAAACAGG CATATGAATT GTTGTCTACT AACCAAAAGG AAAACCCATC TACAGCATTA GCCAACTCCA ACAAATGGAA GAGACTTGGA GACCTTGCAT TAACCAAGTG GCAGATTAAA TTGGCGGAAG ACTGCTTCTG GCTTGCCAAC GACTATTCTT CTTTATTGTT GTTGTTGTCT TCGTCTAACA ATCAGAAAGA GCTCTCCAGG TTGGCTACCG AATGTGAGGC CAAGGGTAAA TACAATATTG CATGGCAGGC ATGGTGGTTG ACTGGACAGA AGGAAAAGTG CTTGGACTTG TTGGTCAAGA GTGAAAGATT GCCGGAGGCT GCTATCTTTG GTGCCAACTA CGGTGTAAGC AGCGAAAAGT TGGAATCTAC TGTGAAATCG TGGAAAAACA AACTTGATAG CAAAAACAAA AGTAAGGTCA GTGCAAGATT AGAGGACAGC TTATCGGGAT TAAAGATCTC TACCAATGGC AGTGCAGCTC CGTTAATTGA CCTTGAAGCT ACCGAAGCAG TTGCTGAAGT TGAAGATGTA GCTGAACCAG AAACGGAAGC AGATGCTGAA GAGGCCAAAG AAGTACCACA GGCTGAAGAG GAAGAAGCTG CAGTGGAAGA GGATGAAGTG GAAGAGGATG ATGATGAAGA TGCTTAAACT AAATAAATTT TACGATCCTG AAAAGTACAC TTCCATTCAT TGTACAATTA CATACATGAA ATACAAAATG ATAAATTATA TTTGTTTTGG TTTTAGCTTC ATTTTATACA TTTTATATTT ACGTCTTTTA AGTTCATTGT TGGCATTAGT CGT
|
Protein sequence | MKLDVVKQFS TRCDRVKGID FHPSEPWILT TLYNGKIEIW SYATNTLVKS IQVTEMPVRT GKFIARKNWI VVGSDDFQIR VYNYNTGEKI TQFEAHPDYI RSIAVHPSKP YILTSSDDLT IKLWNWDNSW KLEQVFEGHQ HYVMSVNFNP KDPNTFASAC LDRTVKIWSL GSSVPNFTLV AHDAKGVNYV DYYPQADKPY LITSSDDKTI KIWDYQTKSC VATLEGHLSN VSFAIFHPEL PLIVSGSEDG TIRFWNSNTF KLEKSINYSL ERVWCIGILS KSNLIAAGFD SGFVIVKLGN EEPLFSMDSN NKLIYAKNSE VYQSVIKPSS TEGLKDGEAL PLQQRELGNI EIYPQSLSHS PNGRYAAVCG DGEYIVYTAL AWRSKSYGNA LDFSWNTHDT SNACSFAVRE SQVSVKILKN FQEYLTLDLI YQADKIFGGA LLGVKSEGCI SFYDWIHGKL VRRVDLDDDI QDVIWSDNGE LLAIVTSSSV GDSNSVGAKK SDETYFLSYS QEAFESALQA DELDPEEGAE SSFNVLYTLP TSEPILSGKF IGDVYVYTTA STNRLNYFVG GEVINLGHFD HKYYIIGYKA QEGKLYLIDK SFNVVSWFVN AEVLELQTLV MRGDLEQYAV KTVEDEETGE QIPDLASVEI DNLSDDYANL KSGFSKTELN QLSRFFEKLG YLSLSYSLSQ DFDSKFQISL STGNLKQAYE LLSTNQKENP STALANSNKW KRLGDLALTK WQIKLAEDCF WLANDYSSLL LLLSSSNNQK ELSRLATECE AKGKYNIAWQ AWWLTGQKEK CLDLLVKSER LPEAAIFGAN YGVSSEKLES TVKSWKNKLD SKNKSKVSAR LEDSLSGLKI STNGSAAPLI DLEATEAVAE VEDVAEPETE ADAEEAKEVP QAEEEEAAVE EDEVEEDDDE DA
|
| |