Gene PICST_52735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_52735 
SymbolSTB6 
ID4851186 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1146639 
End bp1148855 
Gene Length2217 bp 
Protein Length591 aa 
Translation table 
GC content38% 
IMG OID640392894 
ProductSIN3 binding protein 
Protein accessionXP_001387449 
Protein GI126274172 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0582833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0303882 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTTT ACTACCGCAA GCTGATGGGC AATGGCACTG TTGCCAGCAA GTTCCGCTTG 
AACAGCATCC AGCTCAATCA GTCCAACATG CGTGTTCACC AGAAAGAGCC TTCTATTAGC
AAGAGTCAGA GCCAGATACC AGAAAATGAC AATAGACAGA TACGAAACCA ACATTCAAAT
GTCTCGAGAC AAGAAAAGGT CGCAGAAAAC AAAGCCGCTA ATAACTCGAA TGTAGAAAGA
ACGCAGGAAT CCAAAACGGA CTATGTCTCC AGTTCGTCCT CATTGAGCTC TTCTCCAAAA
CCTAGAAATC CAAGTGTTAG TGCTGAAATA ACTAATGTTG GTAACAAAAA CAGCAGCAGC
AATATTAAAG GCAAATCTAT TGACTCTAAT GTCAACGTTA CGAATGGAGA TGCTACTGCT
AAGATTGCTA CTGATTCAAA TAAATACGAA TCAAACCTAC ATACCTTCTA CAACAATAAT
CAAAACCAAC AGCCTTCAGA AATCACTTCA AAAATGGCTC TGGAGCCAGC AGATAATTAT
CCCATAGAAA GTCCCGAAAC TCACCACCCC AGCAATTGTA TTCGTTGCTA TCGGTTGAAG
AAAAAGTGTT CCAGACAGTA TCCCAAATGT TCTACATGTA CCAAAGGGCA TTTTGACTGT
GAATACGTAA CTAGGTCTAA CAAACGTAAA CGCAGGAAAA AGGCGGATTT GAATAATCTC
GAGAACACTA TTGATTTAAC TCAAAACAGT GACATCGAAA ATCAGAATGA AATCGAACCT
GATCTTGTAG AACATAGCAA CGCTCCTTCG TCTACGATTT TCCATATAGA AGCTACGGGA
AAAGATGGTA ATGTAGATAA ACTGGTTGTG GCACATAAAT TAGTATCTGT ATCGTCTCTT
CTTACCGACG AAACTCCAGA TCCTTCAAAA TTGGTCTCTA CTCATGAATC ATCTCGACCT
AGAAAAGTTC CCTCAGCTGC CCTTTCAAGT ATGGCCAGAA GAACAAGAGA ACACAACGAA
TCACTAGCGG AAAAACTGAC TAAGAAGCTC TTCCATCCTC TAAAGACAAA CCTCAATGAT
GATTTCATCA CCGTATTACC CATGAAGAAC TATATCTCCG CCACATTTGT ATACAATTAC
TTTGAGAATT TTGGCTCCAA ATACCCTTTC GTTAACAAGG AAGAATTCAT GCAGCGATTT
GTAAAGATCG ACTTTAACAA GGAGGCTATC GTCAACTTGG ATATTTATAT GGTAATGAGC
ATTGGCTGTA TCATATATGA TTCATATAGC AATGTGCTGC TTTTCGACAA ATTCTTCAAG
GAATCCATAA TAGAGTCTAT TGTGGACGGA TTAGATTTTA CCTTTAACGG ACAAAATAGC
GAGAAAGAAG AACTTCAAAA CTTGGAATTA ATAACCCTCC TCACAATCTA CAGTATTGCG
TCTTTGAACA AACAGAATTG CTGGGCTCTT GTGGGAGTTT TGAATAGATT GGCCCTTCAA
TTGGATCTAT ACAAAGCAAC GGACAATGTG CGAAAACAGA GACTTTTCTG GTCTATCCAC
AATATCGAGA TGGAATTGAG CTTGTTGCTA AACAAGCCCG CACAGACTCC TCAAGATAAG
TTTATCACTT TAGACTACCC ATTGAAAAAT AAGTATTTTA AATGCGAAGA AGAGGTGCTA
CTTTCACATG AGATATGGTA TGCCAAAATC CAAGAGAAGA TTCTTGTATT GCAATTGTCT
AACGATGATG ACAAACAGAA ATTGGTGACA TTGTCCTCTG AGATTGAGAA GTGGAGAGTT
GGTATAAGTG GAGTGGTTCA TCGAGTATAT AGAGAATCGC TGCTCTTGAA GAACTATACC
GACTTGATCA ATTTAAATTA CTACTATTTG CTTGTGGAGC TAGATCAACT TTCTCCTAAC
AAATCCTTCC AATTCACATT ACAATTTGTT TCCAACTCGT TTTCAATTCT TACTTCAAAT
GCAGAAGTTG AAATCGACAA CGAAGACCAA AACTCAGGTT CTAAAAAACC AGTAGTACGT
TTGTCTTTGG CGTCATCCTT ATTTTGGTAC AAGAAACTAT TTAAAGTCGT CAGGTTCAAC
ATGGATTCCT TAAGCAGGAT GATTGCTTCT ACAACAACAG TCAAGTTTGA CTTTGGAATC
AAATTGAACG AATTCAGAAG TAATATTCTG TTGATAACCA ACCTCTTGAA ATATGTC
 
Protein sequence
MDFYYRKLMG NGTVASKFRL NSIQLNQSNM QKVAENKAAN NSNVERTQES KTDYPADNYP 
IESPETHHPS NCIRCYRLKK KCSRQYPKCS TCTKGHFDCE YVTRSNKRKR RKKADLNNLE
NTIDLTQNKA TGKDGNVDKL VVAHKLVSVS SLLTDETPDP SKLVSTHESS RPRKVPSAAL
SSMARRTREH NESLAEKLTK KLFHPLKTNL NDDFITVLPM KNYISATFVY NYFENFGSKY
PFVNKEEFMQ RFVKIDFNKE AIVNLDIYMV MSIGCIIYDS YSNVLLFDKF FKESIIESIV
DGLDFTFNGQ NSEKEELQNL ELITLLTIYS IASLNKQNCW ALVGVLNRLA LQLDLYKATD
NVRKQRLFWS IHNIEMELSL LLNKPAQTPQ DKFITLDYPL KNKYFKCEEE VLLSHEIWYA
KIQEKILVLQ LSNDDDKQKL VTLSSEIEKW RVGISGVVHR VYRESLLLKN YTDLINLNYY
YLLVELDQLS PNKSFQFTLQ FVSNSFSILT SNAEVEIDNE DQNSGSKKPV VRLSLASSLF
WYKKLFKVVR FNMDSLSRMI ASTTTVKFDF GIKLNEFRSN ILLITNLLKY V