Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_37102 |
Symbol | |
ID | 4841028 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 25482 |
End bp | 28076 |
Gene Length | 2595 bp |
Protein Length | 536 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640392343 |
Product | predicted protein |
Protein accession | XP_001386406 |
Protein GI | 150866721 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATACCC TTAGTTCCCA TATTGAGAGC AACGTCAGAG AAGTGTGTGT CTGCTCTGGA GTTGCAAATT AGTTTCTAGT CGAATCAAAA CGACTTCACC CAAAATGCTA CTTTTTGCAA TCAAACTGCC AAAAACTGGT AAACTCAACT GCCAAATAAA CCAACACAAA AGCACTAAAT CTAACTGGCA ATTGATATCA ATGGATCCCT TGGGGATATA AGATCACCCC ACGGACGATA CTGCCGCTAT CGCTCCACGC TGAGCCCAAA CCAATGCACA TTGCAACAGA TTATGCAAGT GGGGAATCAT ACATGTGACT ATCGCGTGAC CGTTGCAATG TCTCCATCTT TTTGCCCTTA TCAGCACATC GTTTTTTTGT CGCGTCCGCA TGCAGGGATC CAACCGTTAG TATCATTCCC CCTTCTTTCT AACTCTATCC ACAGTTGCTC CCCTCAGTTG TTCTTCTGAC ACTTCAGTTA CAAAGTGCCA TTTTGTTTAC CAATCCTTTC TGATAGCTCA AGATATACTC AGCAGGGGGT ACATCAATTC TGTTACAAAT TCCCGCGACA TACTTTCCAA GGCTTCTTGT AATCGTATAA CACCAGTCGT CCTTAGATCT CGTTCGTGTT TCATTTCCGC CAAACACTGG TTTGCTGCCA AATTCGGGCA CTTAACCAAT CTGTCCCACA TCCCAACCCT GACCGAGGTA TATATACGTA ATACTCGTAC TTGCAGTTTC TTTGTTGTGA GTTGTCCGAT CCCTCGATCC CGCCACTGAT ACATTACTTA CCAGTTCTAC TAGTTCATAT TCCACAAACT TCGCATTACC AATGGACTTA GCTAAGGTCC GCAATCTGGA CACAGGCTCG TTGAAAAGAG TCCGGTCACT TCCAGTCAGC GCCGAGGATG AAGGCAACCT CGCGTATCAA CATCAACAGC ACCCTAGCTT CTCCAACGAG GCTTCGTTCC GCAAGTTACT CTCTTCCCAT CTGTCCATCT ACAACGTGTT GAACAATCCC GACTTAGACG TCTCCCCTAG CTTCAGCAGG ACACAAAGCC CGTTGTTATT AACGCCTGAG TTGCCTGACC CGTCGTTCTC CATCACGCTG TGGCAGTCTA TAAGCAATGA CTCTGTGTTT TTGGACAATC TCAATCTCAA GAATCTTTCG CCATCGCCAC ACAATGTACC TCAATTCCAG AAGCGAAACG AAAACGATGA ATCCGGCAAA AGGAGACCCA CTTCTTCCAT CTTGTCGGTT ACAACTGAAG AGGACGACAC AACGAAAGAC TCAACAGTGT TGATCTCGAC ACCAAAGCCG TCTAACGTTA TGGTCTTCGA GACAGACGAG GAAGACAACG ACGATGACCG TAACCATGTT AATCCTCCGA ATTCATTTAT CATGCCGAAG ATGAGCATTT CGGAAAGACC TAATATTCAT GACTATAGTG GCAGAAGCTC GCGATTTCAG GTCACGCTTC TTAGCTCCTA TGGTTCGTAC AAGGTGGATA CCAACTACTT GGTCAAGAGC ATAGAGCGTG AATTGAGTTG TGATTGCATC GGCATTCGCC ATGTAAATTT AGATATCCAC AATGATTACA GATCCTCTAG TTTCCTGCGA TTTGACAAAT CGTTGGTCAA GAATTCCGAT TTGATATTTG TGGTTAACGA TGGCTCTTCC GTGTTTCTCG AATATTTGAC TAGCGTCTTT GGTGGAGACG TGCAATTAGA CGAGGACTCT ATGGAAGCTT TGCCGAAACT CACAATCATC AACATGATGA CAGTCAACTA CTTTGTCAAC TTGTTTGAGT TGATAAATTA CTTAAAACCC TACCAAATCT GGAAGACTTC ATCTTTAAAA CAGGAGAAAT TGGTCAATAA AGTTAAAGAC TTCATCGAAA TCGAGTTGAA CCAACTGGAC CATTTTGAAT CCAATAAGGT TGCTACAAAG GATACAAATT TGTCATTGGT TGTCTCGAAT GTCGGCAGAT CACAAACTAT GTATTCCAAT TTAATTCTGC ACAAGAGAGC CGACTACAAG GGCATAGAGA AGAAATTCAA AACTGATCTT CAGGGTTCAT CTAGTTTTAG CGATCCGTTG CTGATTTCTT CCAACTTTGC CCATATTAAC ATCTTGTATT CAATCTTGAT GAAATTATTT TCGACTTCAC AATTGAGTCA AAACATCGTT GCTGTTGATA AGTCAACTCC CAAATCCTCG CGATTTTGGT TGATTTGTAG TTTCACAGTA GGTATTGGCT TTGGTATTGG TATCGCAAGC GGTGCCACTT CCGTTGTTGG GTTATACATA TATGAGAAGT TTCTTCAGTT TAGCCCTGGC CAAACACAGC AATGTATTCC AGTTTCATCT CCTGCAACTA AACCAATTGT GGATACCGTC GTTGATTTAT CGAAGGAGTT TCAAGGTTCT ATGTTCCAGT TTTACAATGA AGTTTCGACT GACTTAATTG GAGAGCTCAG ATCATTTTCA ACGTTATATG TTAGTTATTT AAGGTCCGCT GGTGATATTG TTATTGATTG TATTAGAGGA GGATTAGAGA AAGTTGTTGG TCTTGTTGTG TACACTAATT GCTGA
|
Protein sequence | MDTLSSHIES NVREASFRKL LSSHSSIYNV LNNPDLDVSP SFSRTQSPLL LTPELPDPSF SITSWQSISN DSKRNENDES GKRRPTSSIL SVTTEEDDTT KDSTVLISTP KPSNVMVFET DEEDNDDDRN HVNPPNSFIM PKMSISERPN IHDYSGRSSR FQVTLLSSYG SYKVDTNYLV KSIERELSCD CIGIRHVNLD IHNDYRSSSF SRFDKSLVKN SDLIFVVNDG SSVFLEYLTS VFGGDVQLDE DSMEALPKLT IINMMTVNYF VNLFELINYL KPYQIWKTSS LKQEKLVNKV KDFIEIELNQ SDHFESNKVA TKDTNLSLVV SNVGRSQTMY SNLISHKRAD YKGIEKKFKT DLQGSSSFSD PLSISSNFAH INILYSILMK LFSTSQLSQN IVAVDKSTPK SSRFWLICSF TVGIGFGIGI ASGATSVVGL YIYEKFLQFS PGQTQQCIPV SSPATKPIVD TVVDLSKEFQ GSMFQFYNEV STDLIGELRS FSTLYVSYLR SAGDIVIDCI RGGLEKVVGL VVYTNC
|
| |