Gene PICST_77338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_77338 
Symbol 
ID4838961 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp549461 
End bp552413 
Gene Length2953 bp 
Protein Length922 aa 
Translation table12 
GC content45% 
IMG OID640390276 
Productpredicted protein 
Protein accessionXP_001384402 
Protein GI150865259 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.98376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CACGCATCGT AGTTACCTTC ATATCACCAT GAAATTGGAC GTGGTCAAAC AGTTCTCCAC 
GCGTTGTGAC CGTGTCAAGG GAATCGACTT CCATCCCTCT GAGCCGTGGA TCCTCACCAC
CTTGTACAAT GGTAAGATCG AGATTTGGTC CTATGCCACA AACACGCTTG TAAAGTCGAT
CCAGGTAACG GAGATGCCCG TCCGGACGGG AAAGTTCATC GCTCGTAAGA ATTGGATTGT
TGTCGGCTCC GACGACTTCC AGATCCGGGT CTATAACTAC AACACGGGTG AAAAAATCAC
CCAGTTCGAG GCCCATCCAG ACTATATCAG GTCCATTGCC GTTCATCCTT CCAAGCCATA
TATATTGACG TCGTCTGATG ACTTGACCAT CAAGTTGTGG AACTGGGACA ATTCCTGGAA
GTTGGAACAG GTCTTTGAAG GACACCAACA TTACGTTATG AGTGTCAACT TCAACCCCAA
GGACCCAAAC ACGTTTGCTT CTGCCTGTTT GGACAGAACC GTCAAGATCT GGTCCTTGGG
TTCCTCGGTG CCAAACTTCA CGTTGGTAGC TCACGATGCC AAGGGTGTTA ACTATGTTGA
TTACTACCCT CAGGCTGATA AGCCTTATTT GATTACATCA TCAGATGATA AAACGATCAA
GATATGGGAT TACCAGACTA AATCATGTGT CGCCACATTG GAGGGTCATT TGTCGAACGT
GTCGTTCGCG ATTTTCCACC CTGAGTTGCC GTTGATCGTA TCTGGTTCTG AAGATGGAAC
CATTCGTTTC TGGAACTCCA ACACCTTCAA GTTGGAGAAG TCTATTAATT ACTCGTTGGA
ACGTGTGTGG TGCATTGGTA TCTTGCTGAA GTCCAACTTG ATTGCCGCGG GGTTCGACTC
TGGCTTCGTT ATCGTCAAAC TTGGAAATGA AGAGCCCTTG TTCCTGATGG ACTCCAACAA
CAAACTCATC TATGCTAAGA ACTCAGAGGT TTACCAGTCA GTGATTAAGC CCTCTTCTAC
GGAAGGATTG AAGGATGGAG AAGCCTTGCC TTTGCAACAG AGAGAATTGG GTAACATCGA
GATATATCCA CAGTCCTTAT CGCATTCTCC TAACGGTCGA TACGCTGCAG TCTGTGGAGA
TGGGGAATAC ATTGTCTACA CTGCTTTGGC ATGGAGATCG AAATCATACG GTAATGCTTT
GGACTTTTCC TGGAATACAC ACGATACTTC CAACGCATGT TCTTTTGCTG TGCGTGAATC
GCAAGTATCG GTCAAGATTT TGAAAAACTT TCAGGAGTAC TTGACGCTCG ACTTGATCTA
CCAGGCTGAC AAGATCTTTG GTGGTGCCTT GTTGGGAGTC AAGCTGGAAG GTTGTATTTC
TTTCTACGAT TGGATCCACG GTAAGCTAGT TAGACGTGTT GACTTAGACG ACGACATCCA
GGACGTAATC TGGTCCGACA ATGGCGAGTT GTTGGCCATT GTTACTTCTT CCAGTGTCGG
CGATAGTAAT TCTGTAGGAG CTAAGAAGAG TGATGAGACG TACTTCTTGA GTTACAGCCA
GGAAGCTTTC GAACTGGCCT TACAAGCTGA CGAGCTTGAT CCAGAAGAAG GTGCTGAATC
GTCTTTCAAT GTTTTGTACA CTCTTCCAAC CTCTGAGCCT ATATTGTCGG GCAAGTTCAT
TGGCGATGTT TATGTCTATA CCACAGCCTC TACCAACCGA TTGAATTACT TTGTGGGTGG
AGAAGTGATC AACTTGGGAC ATTTCGACCA CAAATACTAC ATAATTGGCT ACAAGGCGCA
AGAAGGCAAG TTGTATCTTA TTGACAAGTC GTTCAACGTC GTCTCCTGGT TCGTCAACGC
CGAGGTCTTA GAGTTACAGA CCTTGGTAAT GCGTGGTGAT CTTGAACAGT ACGCTGTCAA
AACTGTAGAA GATGAGGAGA CAGGTGAACA GATCCCAGAC TTGGCTAGTG TAGAAATCGA
CAACTTGTCG GACGATTACG CAAACCTCAA GTCGGGATTC AGCAAGACTG AATTGAACCA
GTTGTCGCGT TTCTTTGAGA AGTTAGGTTA CTTATCGTTG TCGTACTCGT TGTCGCAGGA
TTTCGACTCC AAGTTCCAGA TATCACTCTC TACCGGTAAC TTGAAACAGG CATATGAATT
GTTGTCTACT AACCAAAAGG AAAACCCATC TACAGCATTA GCCAACTCCA ACAAATGGAA
GAGACTTGGA GACCTTGCAT TAACCAAGTG GCAGATTAAA TTGGCGGAAG ACTGCTTCTG
GCTTGCCAAC GACTATTCTT CTTTATTGTT GTTGTTGTCT TCGTCTAACA ATCAGAAAGA
GCTCTCCAGG TTGGCTACCG AATGTGAGGC CAAGGGTAAA TACAATATTG CATGGCAGGC
ATGGTGGTTG ACTGGACAGA AGGAAAAGTG CTTGGACTTG TTGGTCAAGA GTGAAAGATT
GCCGGAGGCT GCTATCTTTG GTGCCAACTA CGGTGTAAGC AGCGAAAAGT TGGAATCTAC
TGTGAAATCG TGGAAAAACA AACTTGATAG CAAAAACAAA AGTAAGGTCA GTGCAAGATT
AGAGGACAGC TTATCGGGAT TAAAGATCTC TACCAATGGC AGTGCAGCTC CGTTAATTGA
CCTTGAAGCT ACCGAAGCAG TTGCTGAAGT TGAAGATGTA GCTGAACCAG AAACGGAAGC
AGATGCTGAA GAGGCCAAAG AAGTACCACA GGCTGAAGAG GAAGAAGCTG CAGTGGAAGA
GGATGAAGTG GAAGAGGATG ATGATGAAGA TGCTTAAACT AAATAAATTT TACGATCCTG
AAAAGTACAC TTCCATTCAT TGTACAATTA CATACATGAA ATACAAAATG ATAAATTATA
TTTGTTTTGG TTTTAGCTTC ATTTTATACA TTTTATATTT ACGTCTTTTA AGTTCATTGT
TGGCATTAGT CGT
 
Protein sequence
MKLDVVKQFS TRCDRVKGID FHPSEPWILT TLYNGKIEIW SYATNTLVKS IQVTEMPVRT 
GKFIARKNWI VVGSDDFQIR VYNYNTGEKI TQFEAHPDYI RSIAVHPSKP YILTSSDDLT
IKLWNWDNSW KLEQVFEGHQ HYVMSVNFNP KDPNTFASAC LDRTVKIWSL GSSVPNFTLV
AHDAKGVNYV DYYPQADKPY LITSSDDKTI KIWDYQTKSC VATLEGHLSN VSFAIFHPEL
PLIVSGSEDG TIRFWNSNTF KLEKSINYSL ERVWCIGILS KSNLIAAGFD SGFVIVKLGN
EEPLFSMDSN NKLIYAKNSE VYQSVIKPSS TEGLKDGEAL PLQQRELGNI EIYPQSLSHS
PNGRYAAVCG DGEYIVYTAL AWRSKSYGNA LDFSWNTHDT SNACSFAVRE SQVSVKILKN
FQEYLTLDLI YQADKIFGGA LLGVKSEGCI SFYDWIHGKL VRRVDLDDDI QDVIWSDNGE
LLAIVTSSSV GDSNSVGAKK SDETYFLSYS QEAFESALQA DELDPEEGAE SSFNVLYTLP
TSEPILSGKF IGDVYVYTTA STNRLNYFVG GEVINLGHFD HKYYIIGYKA QEGKLYLIDK
SFNVVSWFVN AEVLELQTLV MRGDLEQYAV KTVEDEETGE QIPDLASVEI DNLSDDYANL
KSGFSKTELN QLSRFFEKLG YLSLSYSLSQ DFDSKFQISL STGNLKQAYE LLSTNQKENP
STALANSNKW KRLGDLALTK WQIKLAEDCF WLANDYSSLL LLLSSSNNQK ELSRLATECE
AKGKYNIAWQ AWWLTGQKEK CLDLLVKSER LPEAAIFGAN YGVSSEKLES TVKSWKNKLD
SKNKSKVSAR LEDSLSGLKI STNGSAAPLI DLEATEAVAE VEDVAEPETE ADAEEAKEVP
QAEEEEAAVE EDEVEEDDDE DA