Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_53294 |
Symbol | |
ID | 4851904 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 3145676 |
End bp | 3149065 |
Gene Length | 3390 bp |
Protein Length | 1117 aa |
Translation table | |
GC content | 41% |
IMG OID | 640393612 |
Product | Hypothetical WD-40 repeat protein |
Protein accession | XP_001386930 |
Protein GI | 126275972 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.149122 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTCAT TAGCACCCAA AACGTTTTCC ACGCCTACGA AGAAAGGATT CTCGTATGTG TTGGGCCACT CTTCTCATAA CGAAAACCAT ATTCTTCCCA TCAATGCCAC ACAATACTCC CTGGCTTCCC AGCAGTTGTA CACAGCCGGC CGAGATGGCA CTATCAAGGT ATGGAGTCAC CAAGAACACT TAAACCCGTT TTCAGACGGG ATAAGTCAAC AAAACAACGA GCAGAGTGAT ATCGCTGGTG TTGAGTTCTA TGAACCAGCC AGTAACGGCG AGGAGTATCC AGATATCGAT GAGAAGATCT TGAAGTTGGA AACTTCTATA TCGTCCAATC CATTGCCCTA TAGCTACAGC AGGCAGCGCA ATCGAAATAG TCATGTAGAA AGTGTATCCG AATATTCGAT AGTGCGCAAC CATAATATCC ATTTCGATTG GATCAATGAT ATGAAGTTGA TCAACAACGA CCGGGATTTG GTTTCATGCT CATCTGATCT CTCGATCAAG TTGTTGAACC TCAATGTTGA TGATAACTCG ATAACGACGA CTACAAACAA CTTCAATACA AATAATAGCG AGGTCCATCG CTTCCCCAAC ATGCATACAG ACTACATCAA GAAACTCTCT TACAACAAAA ATACCAACCA CTTGTTCAGT GGAGGGCTTG ATGGTGACAT TATTGCCTGG GACTTGTTGA CGCTCAAGCC TTTTCTTCTG GTGCCTAATC GGTCTTCGTC TATGGAACCA ACGGCTTCCA TATACTCTCT AGCCAATAGT GAGAACTTGA TCTCTACTGG TGGACCCAAT AACACCATAA ATATATATGA TCGAAGAAGC CAGAACCCGT TCATAAGAAA GTTGATCGGA CATCAGGACA ATATCCGATG TTTGCTAATG AACGAGCGGT TCATTTTGAG TGGCTCCTCT GACACTTCCA TCAAGCTCTG GGACTTGCGG AATTTCAAAG TTTACAAAAA CTTTGATATC CATGACTACC CTGTATGGTC TTTGTCAAGC GAAGACAACA ACTTTGCTAA GTTCTATTCT GGGGACAAGG GAGGAAACAT CATAAAGACA GACTTATCTT TTCTTTCTCA CTCTGTTCCT GAAGAGGACC AGTTTCACGG CTTTGAGACG TTTAATTCAA ACGATAATCT TGTTATAGAC GAGAAGTTAG GTATATCGAC CATCGTTGCC AAAGATTCAT CTCCAATCTT ATCTTTATGT TGGGAATCTA ACGAGGATAC GTTGTTTGCT TCGAACTATG AGTCTTTGAA TCGGTTCTAC AACCCAGATA CAAACCAATT ATCCAAGTAT CAGTATTTGA GGACTTGTTT AGACTATTCC ATCAACAAGG AAAATCAGCT CAATGACGAC CTAGCTTCAG GTCTAGCTCC CGAAGACGCC ACTGTTCCAG GACAGAACGA TCAAACAGAT TTGAATTCCG ACTTTTACGA CCTCATATCT CATCTCTCTA TGGATACCAA TGTCAACACT TTTGATATTC AATCGACTTT TTCAGCACAT CAACATGCTA TGTTTGAGTC AGGAACTCCA CCAACAGACA TGTATATGGA AAGTGATAAA TCAGCAGAAA ACGACGACGA GGGAGAATAC AACTCGATGT TCTTGAACGT GAATGGTGGT CCATCACAAG AGTTTATCAA TGCATTCAAA GACGAGTACG AATCTCAGGA AGCAAATCCT CTTTATCAAG AGCCCATTAA ACCTGTAGAA AATCTGCAAA GCAAATTTAT AGACAACACT CCAGTTGAGA TCTTGTTGAA TCCGATTCCT GCAGATCAGA TAACGTTGAT TCCATTTAAC AGACAGCCCA TTAGTGATTA CAAGATATCT GCTAAAAGTA TCATTTCTAA GAGATCGTTT AATAACAAGC TTCAGCTCTT GGTTTTGTAC CTCAATGGTG ATATCAAGAT ATGGGACATT ATCTTGTGCA AGGAGCTTCA AACATTCACA TACGACAAAT CCAAGTTGTC GATGGTTAAC AGCAAAGATT TGGACCAACG ACTCAAAGAA ATGGATGCTA TCTTCCGGAA ATTTCAGACT TCTGATACTT TGAACAATTG GTGTGAAGTT GAAATTCGAG CTGGTAAGCT ATTGGTTACT GTGAAAGAAT CTTCGTACAT GAATGTAGAA GTTTACTACG ACGATTTGAT CAAGAATTAC CCATTCTTAG ATATTGACCA CCCAGAGAAC AGCATGTTAC CTAGAAATAG AATCAAGGTG ACAGACGATG ATCGGTTCCA TATTGGTGCC GTATTACTTA ATTCAATTTT CCGCAACTAC GCTTTGTATG AATGGGAGTT TGATTGCAAA GTGAGAGAAG AAATGAGATC TTTGCGGAAA AGTAATAGAA CATTGAGTAA TACTCCCCAG GAGGACGACA GCGATAGTAA TTCTATCTCA AGTAGTATTA GGAAACTCAA GAAGTTTAGC AAAAAGTCCT CAAGGACTAA TTTGACTAAT TTGGCTCAAC TGAGTGGTAG CCCAGCTAGT TCAGCACAGA ACAGTGTTCG TGAAATGAGC ATTCTGGAAA CTCCTTTAAC TGAGTTCTTG AACTTCAGCG ACGATTCTCC TGCTGCTGTT TCGTCTTCTT CAAACGTCAA CTACGATAAC TCAATCATGA AATTGTTACA AACTAATAAG AGAATCTACT GGGATAAATA CAATAATTCT TCATATATTG TCGGTAAAGG AAAGTCGGTT CCGTCCATTT TGCATGTGGA TTCTATCCAT CCTTCATTGG ACGAGAACCA GACCCCTGAT ATACCATACA TGCCTATTGT TAACAACAAG AGGTTGCCAC AAGATTTGTT GATTATCATC TTCGAGTATT CTCCAGATCT TGGTAACTAT CGTGATGTGT GTAGCTTCAC ATTGGAAGAT ATACAGAAGA TTGACATCAA AGCGAACGAA AAGTCCAATT TGGTGGATGA GTTAAGAATG CAGTTGCCTC GTTGGATTGG ACATCCTATT CTCTTCAACA GGTTCCCCCA AAAGGAACAC CCTAAGATAG CTTTCCAGCT ATTTGAAGTG GACTACACCA GTCTACCAGC AAATAAGAAG ATAGGGGGCA AGGCACAGAA AAAGATCAAG AAATTGCCTG TCCTCGAGAG TCTGATCAAG TTGACGTCGC ATAACATGCT CAGAGTGAGT AAAGTTTTGT CTTTCTTGAC CGAAAAATTT GATTCCAAGA CGTCAGAAAT GAAAGACAAG AAGAGCTTGC CTACAGACTG GTTAGCTCTT GAGTGTAGAG GTGAGGAGTT GCCTTCAGAT ATGACATTAC AGACGATTAA AACGACGATC TGGAAGAGTA GTTCCGATAT TGAGTTGAGG TTTAGAAGGA AGTTTGATGT TGAAAAGTAG
|
Protein sequence | MPSLAPKTFS TPTKKGFSYV LGHSSHNENH ILPINATQYS LASQQLYTAG RDGTIKVWSH QEHLNPFSDG ISQQNNEQSD IAGVEFYEPA SNGEEYPDID EKILKLETSI SSNPLPYSYS RQRNRNSHVE SVSEYSIVRN HNIHFDWIND MKLINNDRDL VSCSSDLSIK LLNLNVDDNS ITTTTNNFNT NNSEVHRFPN MHTDYIKKLS YNKNTNHLFS GGLDGDIIAW DLLTLKPFLL VPNRSSSMEP TASIYSLANS ENLISTGGPN NTINIYDRRS QNPFIRKLIG HQDNIRCLLM NERFILSGSS DTSIKLWDLR NFKVYKNFDI HDYPVWSLSS EDNNFAKFYS GDKGGNIIKT DLSFLSHSVP EEDQFHGFET FNSNDNLVID EKLGISTIVA KDSSPILSLC WESNEDTLFA SNYESLNRFY NPDTNQLSKY QYLRTCLDYS INKENQLNDD LASGLAPEDA TVPGQNDQTD LNSDFYDLIS HLSMDTNVNT FDIQSTFSAH QHAMFDDKSA ENDDEGEYNS MFLNVNGGPS QEFINAFKDE YESQEANPLY QEPIKPVENL QSKFIDNTPV EILLNPIPAD QITLIPFNRQ PISDYKISAK SIISKRSFNN KLQLLVLYLN GDIKIWDIIL CKELQTFTYD KSKLSMVNSK DLDQRLKEMD AIFRKFQTSD TLNNWCEVEI RAGKLLVTVK ESSYMNVEVY YDDLIKNYPF LDIDHPENSM LPRNRIKVTD DDRFHIGAVL LNSIFRNYAL YEWEFDCKVR EEMRSLRKSN RTLSNTPQED DSDSNSISSS IRKLKKFSKK SSRTNLTNLA QLSGSPASSA QNSVREMSIL ETPLTEFLNF SDDSPAAVSS SSNVNYDNSI MKLLQTNKRI YWDKYNNSSY IVGKGKSVPS ILHVDSIHPS LDENQTPDIP YMPIVNNKRL PQDLLIIIFE YSPDLGNYRD VCSFTLEDIQ KIDIKANEKS NLVDELRMQL PRWIGHPILF NRFPQKEHPK IAFQLFEVDY TSLPANKKIG GKAQKKIKKL PVLESLIKLT SHNMLRVSKV LSFLTEKFDS KTSEMKDKKS LPTDWLALEC RGEELPSDMT LQTIKTTIWK SSSDIELRFR RKFDVEK
|
| |