Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33460 |
Symbol | |
ID | 4840618 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 698987 |
End bp | 701440 |
Gene Length | 2454 bp |
Protein Length | 817 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391933 |
Product | predicted protein |
Protein accession | XP_001386147 |
Protein GI | 150866515 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.242264 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCATTT GCATAGCTAG GACTTTGGCT CGAAGTCCCG TGCGACGTTT AGGGCGACCG GTTCGACTAA ATTTGCGAAT AGCATATTCT TCCAAAACCA AACCGCATCC AACTACTTCT AAGGAAGAAG AACCTACTTT CTTTGAAGAA AACTCCGAAA ATAACAAGAA TAGTATTCCA TTCAAAGACA AGTTGTCAAG CTTTATAGTT AACTTCATTA AGCAAGAGTC GCAGGCATTG GCATCTTCAT TTGACAACGA TCTTCTCTAT GGAAATGCCA TAGATGTCAA TACAGACTAC AGTATCATTC TCGAGCCACA ACTTCAGAGC GAAATAGACA ATGCCTTGGA TAACTACAAC AAGCTCTTTA TACTATTGTC ACATAACCCG ATATCCATTC CTGCTCAGGC ATATTTGCAT TTTTTGGAGA AAATTGACAC TCCCCTAGAT CTGAAACTTA GGTCTCTTTT GTTGAAGAGA CTTCTATACC ACCAGCAGTA TGAAACGTGC TGGCGGATTT GCATAGACAC ATATACTTCT TTGACTGATA TTGAAGATTT CATAGACTTG GCAGCTGTAA GTTTAAGGGA GAACAACAAT TCCACCTTTG GGCTAAATCT GCTTCTTGTT GCTTCCCATT CTCAGGTGTT CAACCAGAGA CTTCATAACC ATATACTTGA CACCCTAAGT TTCAAGTTCA AGATTCCTCG TCCAGATCTT GAACATAAAT TGAGTTTCTA TGACGAGCTC CAGACGATGC AGACCCTAGA AGATTTGGCG TTATTTAGGG AGAATAATGG GCTGTATATG TCAAATGATG TCGACTACGA AGTCATGTAT TTACGAAAAC ACATTCTGCT CATCCGGAGC GATACAAGCA TAAAAGTGAG CGACTGCTAC GGTTTGCTTC ACGAAAGAGA AAACTTGGTT AGAATGCCAG GTTGGCTCAG TTTCATTTCA CCATCATTCT TGGGATCCAC GACATTGAAT AATGCTGTAG GATCTTTAGT CTCAACCCCA TCCATATCTA CGAAAATCGT AGAAAGCATC AATCACATAT TGAGGACAAA CAAACATATT GGACTCAACG AAGCTGATGT CGTGTACATA CTTAATTCAC AGGCCAATAT TAAGGCCTAC AACATCTACC TGTTATATAT TGCATCCAAT CCCCTGTTGC CCAACAAGCG GATTGTCAAC CTTCTCATGG CACAAGTTAT TTTACAACTA CATTACGTTC AGATTCGGTC GATACTCTTT CGTCACTACC AAGTCCTAGA CGACGATGTC TTGGTTGAGG CATTGGTAAG AGTATTGGGA GAGTCTACTA AAGACTTTGA AGAGATTATA GGTAAACTCT TCAAGAACTC TACTTTTTCA CAATCACTAA CAATTTCTAC TACTATTGTC GACTTAGCTG TTAACTCTGG CTACTCGGTG TCACAGATTG AGCGAATGTT GCTCATTTTC CACGGGTTCG ACAAGTCTGG CAAGTTGTTA GGAAACTTAC TCAAGTCTGA GACACTCCTG TCATATTCTG AACAACAACA TATAGAACTA TACTCCAAAT TAATCATGGC ACCAGAGATT AGCTCGACCA AGACGCTACT TGAATTGAAC CGTTGTATAC TTCGCCGTGG TTTGATTGAG GAGACGTTGA TAAGTGGCAT TCTTGAACGA GTACTCAATA GCACCCTACG AAAAGATTTC ATCTTGGCAA GAGCTCGCGA TCTGAAAAGA AAACTTCCCC AGGGTTTCCA ACGTATTCAC ATGTTAGCCA ATGTGAGTGA GCGAGCCAAT TTCCACAACT CACTCAGAGC CTTGGGCCAG ACCTATTCTC TCTTAGGTGC TAAAGACATG GCACGAGTTG TGGACATCAC CAGCAACTAT ATCTTCTCGC GCCACTTCAC GTTTTGTAGA GACAAGTTTG GGCGTGATTA CTTGATTAAT AATGTCGTAT CTGAGATGAT GCGATTTGTA GAACGAGAAT CACGTACAAA GCCGAAAGAA ACGATTTTCA AAGTCAGAGA CTTGTTAACA GAGTTGAAGA GCGACTCCAA GGTGATCCGG TGCCATTTGT TCAGAATGAT GGTAAGAGAG GATCCTTCAA AGGCGATACA GTTGCTTCAG TTCTACAGTG ACAACAAGTC CAGCTTGGCT GGGATCATAC CGTATATGAT TTCGGGAATT CTTTCTACAG AGAAGTTGGA GAAGAATCGC AAACTTCAGG TTCTAGATCG ATTCCTTTCT GAGCTTGTGG TTTTGGGATA CAGGCATAGA ATTACGCAGA AGACGGGCCA CGAGTTGGTG AGACTCTTGA AACAGGATAG TGTTTCAGGC AAAGCGCTTA CGCCGCAGCT GGTGAGCTGG ATTCTTGAGT TTTCACGTAA CAATAAGGCT CTTAACAGAG TGCTACAAGT ACATTTCCGC AGAGACAAAA AGAACACGTT GTAA
|
Protein sequence | MFICIARTLA RSPVRRLGRP VRLNLRIAYS SKTKPHPTTS KEEEPTFFEE NSENNKNSIP FKDKLSSFIV NFIKQESQAL ASSFDNDLLY GNAIDVNTDY SIILEPQLQS EIDNALDNYN KLFILLSHNP ISIPAQAYLH FLEKIDTPLD SKLRSLLLKR LLYHQQYETC WRICIDTYTS LTDIEDFIDL AAVSLRENNN STFGLNSLLV ASHSQVFNQR LHNHILDTLS FKFKIPRPDL EHKLSFYDEL QTMQTLEDLA LFRENNGSYM SNDVDYEVMY LRKHISLIRS DTSIKVSDCY GLLHERENLV RMPGWLSFIS PSFLGSTTLN NAVGSLVSTP SISTKIVESI NHILRTNKHI GLNEADVVYI LNSQANIKAY NIYSLYIASN PSLPNKRIVN LLMAQVILQL HYVQIRSILF RHYQVLDDDV LVEALVRVLG ESTKDFEEII GKLFKNSTFS QSLTISTTIV DLAVNSGYSV SQIERMLLIF HGFDKSGKLL GNLLKSETLS SYSEQQHIEL YSKLIMAPEI SSTKTLLELN RCILRRGLIE ETLISGILER VLNSTLRKDF ILARARDSKR KLPQGFQRIH MLANVSERAN FHNSLRALGQ TYSLLGAKDM ARVVDITSNY IFSRHFTFCR DKFGRDYLIN NVVSEMMRFV ERESRTKPKE TIFKVRDLLT ELKSDSKVIR CHLFRMMVRE DPSKAIQLLQ FYSDNKSSLA GIIPYMISGI LSTEKLEKNR KLQVLDRFLS ELVVLGYRHR ITQKTGHELV RLLKQDSVSG KALTPQSVSW ILEFSRNNKA LNRVLQVHFR RDKKNTL
|
| |