Gene PICST_53294 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_53294 
Symbol 
ID4851904 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3145676 
End bp3149065 
Gene Length3390 bp 
Protein Length1117 aa 
Translation table 
GC content41% 
IMG OID640393612 
ProductHypothetical WD-40 repeat protein 
Protein accessionXP_001386930 
Protein GI126275972 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.149122 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTCAT TAGCACCCAA AACGTTTTCC ACGCCTACGA AGAAAGGATT CTCGTATGTG 
TTGGGCCACT CTTCTCATAA CGAAAACCAT ATTCTTCCCA TCAATGCCAC ACAATACTCC
CTGGCTTCCC AGCAGTTGTA CACAGCCGGC CGAGATGGCA CTATCAAGGT ATGGAGTCAC
CAAGAACACT TAAACCCGTT TTCAGACGGG ATAAGTCAAC AAAACAACGA GCAGAGTGAT
ATCGCTGGTG TTGAGTTCTA TGAACCAGCC AGTAACGGCG AGGAGTATCC AGATATCGAT
GAGAAGATCT TGAAGTTGGA AACTTCTATA TCGTCCAATC CATTGCCCTA TAGCTACAGC
AGGCAGCGCA ATCGAAATAG TCATGTAGAA AGTGTATCCG AATATTCGAT AGTGCGCAAC
CATAATATCC ATTTCGATTG GATCAATGAT ATGAAGTTGA TCAACAACGA CCGGGATTTG
GTTTCATGCT CATCTGATCT CTCGATCAAG TTGTTGAACC TCAATGTTGA TGATAACTCG
ATAACGACGA CTACAAACAA CTTCAATACA AATAATAGCG AGGTCCATCG CTTCCCCAAC
ATGCATACAG ACTACATCAA GAAACTCTCT TACAACAAAA ATACCAACCA CTTGTTCAGT
GGAGGGCTTG ATGGTGACAT TATTGCCTGG GACTTGTTGA CGCTCAAGCC TTTTCTTCTG
GTGCCTAATC GGTCTTCGTC TATGGAACCA ACGGCTTCCA TATACTCTCT AGCCAATAGT
GAGAACTTGA TCTCTACTGG TGGACCCAAT AACACCATAA ATATATATGA TCGAAGAAGC
CAGAACCCGT TCATAAGAAA GTTGATCGGA CATCAGGACA ATATCCGATG TTTGCTAATG
AACGAGCGGT TCATTTTGAG TGGCTCCTCT GACACTTCCA TCAAGCTCTG GGACTTGCGG
AATTTCAAAG TTTACAAAAA CTTTGATATC CATGACTACC CTGTATGGTC TTTGTCAAGC
GAAGACAACA ACTTTGCTAA GTTCTATTCT GGGGACAAGG GAGGAAACAT CATAAAGACA
GACTTATCTT TTCTTTCTCA CTCTGTTCCT GAAGAGGACC AGTTTCACGG CTTTGAGACG
TTTAATTCAA ACGATAATCT TGTTATAGAC GAGAAGTTAG GTATATCGAC CATCGTTGCC
AAAGATTCAT CTCCAATCTT ATCTTTATGT TGGGAATCTA ACGAGGATAC GTTGTTTGCT
TCGAACTATG AGTCTTTGAA TCGGTTCTAC AACCCAGATA CAAACCAATT ATCCAAGTAT
CAGTATTTGA GGACTTGTTT AGACTATTCC ATCAACAAGG AAAATCAGCT CAATGACGAC
CTAGCTTCAG GTCTAGCTCC CGAAGACGCC ACTGTTCCAG GACAGAACGA TCAAACAGAT
TTGAATTCCG ACTTTTACGA CCTCATATCT CATCTCTCTA TGGATACCAA TGTCAACACT
TTTGATATTC AATCGACTTT TTCAGCACAT CAACATGCTA TGTTTGAGTC AGGAACTCCA
CCAACAGACA TGTATATGGA AAGTGATAAA TCAGCAGAAA ACGACGACGA GGGAGAATAC
AACTCGATGT TCTTGAACGT GAATGGTGGT CCATCACAAG AGTTTATCAA TGCATTCAAA
GACGAGTACG AATCTCAGGA AGCAAATCCT CTTTATCAAG AGCCCATTAA ACCTGTAGAA
AATCTGCAAA GCAAATTTAT AGACAACACT CCAGTTGAGA TCTTGTTGAA TCCGATTCCT
GCAGATCAGA TAACGTTGAT TCCATTTAAC AGACAGCCCA TTAGTGATTA CAAGATATCT
GCTAAAAGTA TCATTTCTAA GAGATCGTTT AATAACAAGC TTCAGCTCTT GGTTTTGTAC
CTCAATGGTG ATATCAAGAT ATGGGACATT ATCTTGTGCA AGGAGCTTCA AACATTCACA
TACGACAAAT CCAAGTTGTC GATGGTTAAC AGCAAAGATT TGGACCAACG ACTCAAAGAA
ATGGATGCTA TCTTCCGGAA ATTTCAGACT TCTGATACTT TGAACAATTG GTGTGAAGTT
GAAATTCGAG CTGGTAAGCT ATTGGTTACT GTGAAAGAAT CTTCGTACAT GAATGTAGAA
GTTTACTACG ACGATTTGAT CAAGAATTAC CCATTCTTAG ATATTGACCA CCCAGAGAAC
AGCATGTTAC CTAGAAATAG AATCAAGGTG ACAGACGATG ATCGGTTCCA TATTGGTGCC
GTATTACTTA ATTCAATTTT CCGCAACTAC GCTTTGTATG AATGGGAGTT TGATTGCAAA
GTGAGAGAAG AAATGAGATC TTTGCGGAAA AGTAATAGAA CATTGAGTAA TACTCCCCAG
GAGGACGACA GCGATAGTAA TTCTATCTCA AGTAGTATTA GGAAACTCAA GAAGTTTAGC
AAAAAGTCCT CAAGGACTAA TTTGACTAAT TTGGCTCAAC TGAGTGGTAG CCCAGCTAGT
TCAGCACAGA ACAGTGTTCG TGAAATGAGC ATTCTGGAAA CTCCTTTAAC TGAGTTCTTG
AACTTCAGCG ACGATTCTCC TGCTGCTGTT TCGTCTTCTT CAAACGTCAA CTACGATAAC
TCAATCATGA AATTGTTACA AACTAATAAG AGAATCTACT GGGATAAATA CAATAATTCT
TCATATATTG TCGGTAAAGG AAAGTCGGTT CCGTCCATTT TGCATGTGGA TTCTATCCAT
CCTTCATTGG ACGAGAACCA GACCCCTGAT ATACCATACA TGCCTATTGT TAACAACAAG
AGGTTGCCAC AAGATTTGTT GATTATCATC TTCGAGTATT CTCCAGATCT TGGTAACTAT
CGTGATGTGT GTAGCTTCAC ATTGGAAGAT ATACAGAAGA TTGACATCAA AGCGAACGAA
AAGTCCAATT TGGTGGATGA GTTAAGAATG CAGTTGCCTC GTTGGATTGG ACATCCTATT
CTCTTCAACA GGTTCCCCCA AAAGGAACAC CCTAAGATAG CTTTCCAGCT ATTTGAAGTG
GACTACACCA GTCTACCAGC AAATAAGAAG ATAGGGGGCA AGGCACAGAA AAAGATCAAG
AAATTGCCTG TCCTCGAGAG TCTGATCAAG TTGACGTCGC ATAACATGCT CAGAGTGAGT
AAAGTTTTGT CTTTCTTGAC CGAAAAATTT GATTCCAAGA CGTCAGAAAT GAAAGACAAG
AAGAGCTTGC CTACAGACTG GTTAGCTCTT GAGTGTAGAG GTGAGGAGTT GCCTTCAGAT
ATGACATTAC AGACGATTAA AACGACGATC TGGAAGAGTA GTTCCGATAT TGAGTTGAGG
TTTAGAAGGA AGTTTGATGT TGAAAAGTAG
 
Protein sequence
MPSLAPKTFS TPTKKGFSYV LGHSSHNENH ILPINATQYS LASQQLYTAG RDGTIKVWSH 
QEHLNPFSDG ISQQNNEQSD IAGVEFYEPA SNGEEYPDID EKILKLETSI SSNPLPYSYS
RQRNRNSHVE SVSEYSIVRN HNIHFDWIND MKLINNDRDL VSCSSDLSIK LLNLNVDDNS
ITTTTNNFNT NNSEVHRFPN MHTDYIKKLS YNKNTNHLFS GGLDGDIIAW DLLTLKPFLL
VPNRSSSMEP TASIYSLANS ENLISTGGPN NTINIYDRRS QNPFIRKLIG HQDNIRCLLM
NERFILSGSS DTSIKLWDLR NFKVYKNFDI HDYPVWSLSS EDNNFAKFYS GDKGGNIIKT
DLSFLSHSVP EEDQFHGFET FNSNDNLVID EKLGISTIVA KDSSPILSLC WESNEDTLFA
SNYESLNRFY NPDTNQLSKY QYLRTCLDYS INKENQLNDD LASGLAPEDA TVPGQNDQTD
LNSDFYDLIS HLSMDTNVNT FDIQSTFSAH QHAMFDDKSA ENDDEGEYNS MFLNVNGGPS
QEFINAFKDE YESQEANPLY QEPIKPVENL QSKFIDNTPV EILLNPIPAD QITLIPFNRQ
PISDYKISAK SIISKRSFNN KLQLLVLYLN GDIKIWDIIL CKELQTFTYD KSKLSMVNSK
DLDQRLKEMD AIFRKFQTSD TLNNWCEVEI RAGKLLVTVK ESSYMNVEVY YDDLIKNYPF
LDIDHPENSM LPRNRIKVTD DDRFHIGAVL LNSIFRNYAL YEWEFDCKVR EEMRSLRKSN
RTLSNTPQED DSDSNSISSS IRKLKKFSKK SSRTNLTNLA QLSGSPASSA QNSVREMSIL
ETPLTEFLNF SDDSPAAVSS SSNVNYDNSI MKLLQTNKRI YWDKYNNSSY IVGKGKSVPS
ILHVDSIHPS LDENQTPDIP YMPIVNNKRL PQDLLIIIFE YSPDLGNYRD VCSFTLEDIQ
KIDIKANEKS NLVDELRMQL PRWIGHPILF NRFPQKEHPK IAFQLFEVDY TSLPANKKIG
GKAQKKIKKL PVLESLIKLT SHNMLRVSKV LSFLTEKFDS KTSEMKDKKS LPTDWLALEC
RGEELPSDMT LQTIKTTIWK SSSDIELRFR RKFDVEK