Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_25346 |
Symbol | |
ID | 4840383 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 864869 |
End bp | 866407 |
Gene Length | 1539 bp |
Protein Length | 500 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640391698 |
Product | predicted protein |
Protein accession | XP_001385530 |
Protein GI | 150866060 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0929212 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTGT TGAATCTAGG GGCCCAATCC CGTAAAACAG GTATTAAGCC TCGACAGAAC TTGACCAAAG ACAAGTACGA CATGGAGGAT ATAGACGAGT TCTTTGCTGA TGATGATGAG TTGAGCCGTA TACACAGAAG ATCGAACACC TCCCGTTCCC GTAAATCTAC ATCCGCCCAA TCATCTACTC CAAAGGACAA CTTTGACAAT GTAGCACGGT TTTTGAACTT CACAGATGCC GAAGCTGAGA ACTTCAACTT GTCTCCGATA TCTCTGTCGT CTAAGCTTGT TCCCAACAAG GACAACAAGA AATCGCCGTT GCTCTCCCCT TTGGCTGTCC GTAGGGACAA TAAGGAGTAT GATAGAGTAA AAACTAAGGG CCTTGGAAAG CAGCAGCTTG CAAAGAACGA TATCTCTGAG GTTGATGATG ACTTGTATGA TTATGAACCA GATTTTTACG ATGACCATAA CGACAACGGA CAGTCATTCG AAGAACATCT GAGTTTATCA CCAGTGCCCT TGTCACCAGT GTTACCTATT AGTGGTAATG GCAAGAAGAC ATCTTCTACT CCTAAGTCAA AGTCAACATC GTCAAAATTA TCATCTGCAT TGACTAAGCG CATGGCTCTT GGAAAGTCAT CCAAGTCCAG CGTGCCAAAA GTTCTAGAAC AAGAAGAAAG TGACGAAGGT GTGTTAGAAG CAAAGTCTAA TAATGAAAGC AGCGAATTGC TTTCCCCTCC ACCAACAAAC AAGAGATCAT CCAAAGAGAA ACCTGCTAAA TCCACATCTA CTATCTCAAA ACCTTCTCCA TTACCGTCAC CCCCACCTGA TGGGTTGAGG CGATCTAAAA GAACGAGAGT ACGACCTGTT GCATTTTGGA GGAATGAGAA AGTTCGCTAT CGTCGTGCCA ACGAAAACTC CCAGGATCCA AATACAACAT TAGGTAGCGA TATCAAGAAC ATACCCTTAC AGGAAATCCA AGAAGTTGTA CACATACCCG AACCAGCATC CAATAATACA TTAGCACCGT CGAGAAAGAG GTCACGACTG AAATCAAGGA CAACTCCTCC AAAATTCAAG AAAGCTACCA AGAAAGAAGT ATATGACTAT GAATCAGATC CTGAAATCAG TGGTTCAGAG TGGTTCAAGA ACGATACCCT CCAGCTTGAG GTGTTCGAAA ATGAGAAAAA GGTCAGGACG CTAGTGGCAT ATACACCAGA TGGTGCCGAT TTACGAGATC CGCCCGCCCC ACAGGAAGGA GACGAGAATT TCAAAGTCGC AACACTATTT GAGCATGATA AGGATTTCAG TGCTAGCGGC CTTTTGGAGT TTGATTTTGG TGGTTATAAG CAATTTCGAA ACTCTGGAGA GTTTGTCTAC AGCTTTCATG TAGTTAAAGG CTTGATAGAA GTTACTCTTA ATAATACCAA ATTTGTGGTA ACTCGAGGCT GCTCATTTCA AGTTCCAGAT GGTAATTCCT ATGGTTTCAA GAATATTGGC CAGGATTCGG CCCGACTTTT CTTTGTCCAA TGTAAGATA
|
Protein sequence | MDLLNLGAQS RKTGIKPRQN LTKDKYDMED IDEFFADDDE LSRIHRRSNT SRSRKSTSAQ SSTPKDNFDN VARFLNFTDA EAENFNLSPI SSSSKLVPNK DNKKSPLLSP LAVRRDNKEY DRNDISEVDD DLYDYEPDFY DDHNDNGQSF EEHSSLSPVP LSPVLPISGN GKKTSSTPKS KSTSSKLSSA LTKRMALGKS SKSSVPKVLE QEESDEGVLE AKSNNESSEL LSPPPTNKRS SKEKPAKSTS TISKPSPLPS PPPDGLRRSK RTRVRPVAFW RNEKVRYRRA NENSQDPNTT LGSDIKNIPL QEIQEVVHIP EPASNNTLAP SRKRSRSKSR TTPPKFKKAT KKEVYDYESD PEISGSEWFK NDTLQLEVFE NEKKVRTLVA YTPDGADLRD PPAPQEGDEN FKVATLFEHD KDFSASGLLE FDFGGYKQFR NSGEFVYSFH VVKGLIEVTL NNTKFVVTRG CSFQVPDGNS YGFKNIGQDS ARLFFVQCKI
|
| |