Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_61283 |
Symbol | |
ID | 4839791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 1225479 |
End bp | 1228472 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640391106 |
Product | predicted protein |
Protein accession | XP_001385930 |
Protein GI | 150866359 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0585405 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGTTTT CATACTTCCC AGCGTTGGTT GCTCTTTGCA GCACCGTTAG CGCATTGGGT GGGTTGCAAA ATATTGTGTT CAAAAATCTG AAAGACGATT TGCAATTGGC TGCACACAAA AAGTCGGCTA CGTTATTCTT GGATACTGAA GACTGGCCAG GTGTTGTCAG AGCTGGTTTG GACTTAAGTG ATGATTTCAA GAAAGTCACT GGTGAAGCCT TGCCAGTTGT CAATTTTACT GGTAGTGGAA TTTGTACTAA AGTACATGGT AAGGTTGAAT CTGCAATTAT TATTGGTACT GTTGGCAACT CTTCAATTAT CGATACATTA GTTTCTGCAA AGAAGTTAGA TGTCAGTGAA ATTGAAGGCA AGTGGGAATC TTATGTCATG AAGGTTATTG AGAATCCAGC ACCATGTATC AGTAGTGCTT TGGTAATTGC TGGAAGTGAC AAGAGAGGTT CAATTTTTGG TGCCTATGAT ATTTCTGAAC AAATCGGTGT CTCTCCATGG TACTGGTTCG CCGACGTCGT GCCAGCTACT CATAGTGAAA TCTATGTTTC CAAGTCAATC GTAAAGGTCC AAGGAGAGCC CAGTGTTAAA TACAGAGGTA TTTTCCTTAA CGATGAACAA CCAGCTTTGG CAGCTTGGGT TGCCGAATTC TTCCCAGAAG GAAAATACAA CTCTTACTTC GTCCACCAAT TCTACGTGAA GTTATTCGAA TTGTTGCTCA GAATGAGAGC CAACTTCTTG TGGCCCGCTA TGTGGGCCAG TATGTTTGGA GAGGATGATC CCGAAAACCA GTACTGGGCT GACTACTATG GTATTGTCAT GTCCACATCC CACACTGAGC CACTTATGAG AGCCACTAAT GAGTGGACTA CTTTCGGTAA TGGTTCTTGG GATTACAGTA CTAACAAAGA TAACATCGTT GAATTCTGGA AAGAAGGTAT CGCCAGAAGC AAGCCATATG AAAACATGTG GACAACTGGT ATGAGAGGAT TCGGAGACAC TCCTATCACT GGTGGTGTTG AAATCAGTTT ACTTGAAGAC GTCATCAAGA CTCAAAGAGA ACTTTTAACT GAGTACTTTA ACAATACCGA TATCACTGAT ATCCCACAAG TTTGGTGTTT GTATAAGGAA GTTCAAGCCT ACTTCCAAGA AGGTATGCAA GTTCCTGAAG ACATTACTTT GTTATGGGTG GATGATAACT GGGGAAACAA CAGAAGATTG CCCCTTGCTA ACGAAACAGA CAGAGCTGGT GGTGCTGGTG TTTACTATCA TTTTGACTAC GTTGGTAGTC CAGTTGATTT CAAGTGGATT AACACTGTTT CTTTGGAAAA GACTTGGGAA CAGATGCATT TGGCAAAACA AAAACAAGCT GACCAGATCT GGGTTGTTAA CGTTGGTGAC ATGAAGCCTT TAGAAATTCC AATTGAATAC TTCATTTCTT TAGGTTATGA CTTTGATACA TGGGGACCAA TTAACAAGGT CATGACTTGG GCTACCGCAT GGGCTCAAAG AGAATTCGGA GCATACCTTA AGGAAGACGA TGTCAAGGAA GTTGCTGACA TTATTGATCT TTACGGTTTC TATGCAAACA GAAAGAAGTA CGAAGCATTA AACACCACCA CATTTCATTT ATACAACTAC AACGAAGCCG AAACCGTGCT TAGTGAATGG GCAGACTTGG CTGACAGAGC ATGGGCTGTT TACAAAAAAT TGCCAAAGAA TGTTCAGCCA TCATTTTTCC AATTGGTTCT CCATCCTGCT GTTGCTGGTT ACACCGTTTA CGACATTTTG ATTTCTGCTG GTAAGAACAA CTTGTATGCT GAACAAAGAA GAAATCAAGC TAATGCATTA TCCACTCATG TACTTGACAG ATTCCAATAT GACAACACCT GGAAAGGTGA ATATGACTCC TTGCTTGGTG GAAAATGGAA GCATATGATG GATCAAACTC ACTTGGGCTA CTTCTACTGG CAACAGCCAA TGAGAAATGT TGCTCCTCCA CTTGCATATG TTCAACTTGA AGAAAACTCA TTAGCTGGTT CTTTGGGTGT CACTGTAGAA CAATCCAGAG GATCTGTTCC AGGTGATGAT GCTTATAATG CTGTTGCATA CAGTAACAAT ACTTTGGTAT TGCCAACTTT AGATCCTTAC ACTGACAGTA GACACATTAC TATTTTCAAC AAAGGTATTG AAGATTTCTA CTTTGAGGTT TCTCCTTATG CTGATTACGT TAAGGTTAAT CCCTCGTCAG GATCTGTTTC TGCAACAAAC AACAGTATCT GGAACTCGGT TGATGTTGAA GTCACGGTAG ACTGGGACCA GGCCCCAGAA GGATACAATA TCGTGTTCAT GAATATCACT TCCAATACCA CTTACGAATT CTTTGGTATG CCAACTGTCA ATTTGCCAAT CAACAAGACT CAAGCACCAG ATGATTTCAA GGGTTTTGTG GAGACTAACC AACACATCTC CATCGAAGCA GAACATTTCT CTAACAACCA ATCTACAAAT GACACATACT ATGTTACAAT TGACAGATAT GGAAGAACCT TGTCTGGTGT CACTTTGTTC CCCGTCACTG CTGACTCCCA AGAAGCAACC GAAGACTATT CGTACTTGGA ATACAACTTG TACAGTTTTA GTGCACCACA ATATGGATCT AACATTACTG TTTACACTGG TTCATCCTTG AACATTGATC CATCTCGTCC TTTGAAGTAT GCCATTGCCA TCGACGATCA AGAACCTCAA GTAGTTCAAA TTGTTGTTGA TCCAACTGAC CCTACTGCCA TGCCTGCCCA TTGGGAAGAC GCTGCAAGTG ACGGTGTATG GATTCATAAC ACTACCCATA CTTTTGATGC TGGTGAACAT ACCTTGAAAT TATGGGCATT GGAACCAGCA GTTGTTTTTG AAAAAGTTGT TGTCGACTTT GGTGGTGTTG TTCCTTCGTT CTTGGGACCA CCAGAAACTT ACATCAAAAA GTAG
|
Protein sequence | MLFSYFPALV ALCSTVSALG GLQNIVFKNS KDDLQLAAHK KSATLFLDTE DWPGVVRAGL DLSDDFKKVT GEALPVVNFT GSGICTKVHG KVESAIIIGT VGNSSIIDTL VSAKKLDVSE IEGKWESYVM KVIENPAPCI SSALVIAGSD KRGSIFGAYD ISEQIGVSPW YWFADVVPAT HSEIYVSKSI VKVQGEPSVK YRGIFLNDEQ PALAAWVAEF FPEGKYNSYF VHQFYVKLFE LLLRMRANFL WPAMWASMFG EDDPENQYWA DYYGIVMSTS HTEPLMRATN EWTTFGNGSW DYSTNKDNIV EFWKEGIARS KPYENMWTTG MRGFGDTPIT GGVEISLLED VIKTQRELLT EYFNNTDITD IPQVWCLYKE VQAYFQEGMQ VPEDITLLWV DDNWGNNRRL PLANETDRAG GAGVYYHFDY VGSPVDFKWI NTVSLEKTWE QMHLAKQKQA DQIWVVNVGD MKPLEIPIEY FISLGYDFDT WGPINKVMTW ATAWAQREFG AYLKEDDVKE VADIIDLYGF YANRKKYEAL NTTTFHLYNY NEAETVLSEW ADLADRAWAV YKKLPKNVQP SFFQLVLHPA VAGYTVYDIL ISAGKNNLYA EQRRNQANAL STHVLDRFQY DNTWKGEYDS LLGGKWKHMM DQTHLGYFYW QQPMRNVAPP LAYVQLEENS LAGSLGVTVE QSRGSVPGDD AYNAVAYSNN TLVLPTLDPY TDSRHITIFN KGIEDFYFEV SPYADYVKVN PSSGSVSATN NSIWNSVDVE VTVDWDQAPE GYNIVFMNIT SNTTYEFFGM PTVNLPINKT QAPDDFKGFV ETNQHISIEA EHFSNNQSTN DTYYVTIDRY GRTLSGVTLF PVTADSQEAT EDYSYLEYNL YSFSAPQYGS NITVYTGSSL NIDPSRPLKY AIAIDDQEPQ VVQIVVDPTD PTAMPAHWED AASDGVWIHN TTHTFDAGEH TLKLWALEPA VVFEKVVVDF GGVVPSFLGP PETYIKK
|
| |