Gene PICST_28393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28393 
SymbolSPT20 
ID4851170 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1105477 
End bp1107840 
Gene Length2364 bp 
Protein Length787 aa 
Translation table 
GC content45% 
IMG OID640392878 
Producthistone acetyltransferase SAGA complex member 
Protein accessionXP_001387440 
Protein GI126274156 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3264] Small-conductance mechanosensitive channel 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.3451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAACG GCACGGCGGT CTCGCCCCAG CGGCCACCGC AGCACCAGCT CAGTCCTGCC 
CAACTCCAGA GACAACTGCA ACAGGCCCTC ATCCAGCAAC AGCAGCAAAA GCAGCAGCAA
CAAACCAGAC CCAGACCCAA CATCCAGAAC TACCATTTTG CCACTCTGTC AGCAGATATT
CTCCGGAAGT ATGCAAAATA TCCTGCTTCT ATTACACTTC ACATATACGA AACCCACTAC
CGCTTCAACA ACTCGCTGGA TTCCAACATA ATACCCAAGA ACTCGCCAAT GATCAAGGAC
TTTTTGCACC ATGTGATGAA GGAGCAGATT CCCGTAGAAA TGAGCGAGTT GCTCAAAGAC
TTCTCCATCA AGTCTTACGA TGGCTGCCTA ATTCTTCAGG TGTTTGATCA TCGAAACATG
GTATCGACTA CAGCAGGGGT GACAAAAGCT GGTTCAAGTG GAGAATCCAA AGATGTGAAA
AAAGAGATCA TAGCTAATGG TTCAGCAGAT AATGCTGTTG CTAATAGTTC GAGTGGTTCT
ACTTCAGCTT CAGCTGCGGG CACGAACTCC ACAGCAACAG CATCGGCTTC AACAACAGCA
GTTTCATCGG GATCTTCTCC TGCTACTGCA TCTGGCAGTG GATCTTCCGC TGGAACTGCT
GCTTCAACCA TACCTAAGCC TAAAACGTAT AGAACTTTAC TTCGTCCTAC GCAGCTTTCT
CTATACTACG ATTTATTGTA TCATACCGAT TCAGCGTTGA CCAAATTCAC CGACTCCTTA
TCTCTTCAGA TGGAGTCGGA GATTCTAACT TTGACTAACC GGAAGTTGGA TCTCGCTGTA
CCTTTAAATC CTTATCTATG TGACGAGTAT CTTCGTCCCG AGCCTGAATA CCCCAAGAAA
GTATGGGATG AAGCAATTCA AGACTACAAG CTTATTCATC TGCATCGTGA AGCCGCGGAT
CATAAATCTA GGAAGATCCA TCTGGACGAG ATGGTGCTTC ACAAGACTTC TGAATATGAA
GAAATTATGT TACTTTTGTC TAACAAGTAC AAGAGACCAG ATGAGTCTCA AGATAAGAAG
TTGATAATTG TAGGTCCTTC GGCTCTGGCA GCAGCAGCCG CTACTACCAC TACTCCAGCT
AATCCACCTA AATCAACAAG TAAGGAAGGT ACTTCTGGAC CGGATTCGAA GGTTAAAAAA
GGTGAGGATT CTGCTCCATC TATTCCAGCA GTTCCGGCAG TTACTCCAGC GCAATCTACT
ACGAGAAGTA CTGGTCAATT CATGAGATTG AGACTAATCG AAGAAATAAG AAAGAAGAGG
GAAGCAGAAA GAGCTCTGCA AGAAGCCAAA CTACAGGCCC AGACTCAGGC TGTTCAGAGT
GATGGAAATC TAGCACTCAG TTCCGTCAAC CAGACACCAC AGGATAAGCG AAAATTGGCA
GAAATGGAAA TACAACGTCA ACAGCAACTT CAGCAGCTGA GCCAACAACA ATTGCAACAG
TCTCCACCTT CACAACAGGG CCAGCCTCAA CAGAAGAAAT TGAAGAAAGA ACCTGTAAAG
TCTCAAATGC CTCAGCAAGT ACAACAGTTT GGTAACAATA ATGGTAATAA TAATAGTAGC
AACGGAGGAG GAATTGTGAC TCCACAACAG AATAACATTC CCGTTGCTCC ACAACCAATA
AGGAATGGAA CAGCCTTAGC TGCTCAACAA CAGCAATTAC AGCAGCAAGC ACAGCAGCAA
GCACAACAAC AAGCACAACA GCAAGCACAG CAACAGCAAC AACAGCAGCA ACAGCAACAA
CAACAGCAAC AACAGCGTAT GGTTCAGAAT ACTTCAGCTC CAGGAAGTCT GCAATCTTCT
CAACAAAGCT CTAGCCAAAC TAGTGGAAGT GCTTCTTCTG GAAGTGGACT GCAGCCCACT
CTTTTGCAAC AGCAACAGCA GCAGATTTTC CAGAACTCTT TAACTCCTGA AGAGCAACAG
GTGTTTAGAC AATTGCAACA GAGAATGAAC ACTTTTGCCG TAATGGGAAA TACTGGGGTA
GCACCAAATA GACAACAATT GACACCACAA CAACAACAAC AAGCCCAACA ACAAGCCAAG
GCAATACAGC AGCAGTTGAT GCAGAAATTC CCTCTCTACT TCCAGCGTTT GCGTCAATTC
CACCTTATCC AACAACAGAG ACAACAGAAA CAAAAACAGA TCCAAGAGAA ACAAATGCAA
GCAGCTGCTG CAGCAGCTGC TCAGTCAAAT AATAATGCTC GTTTTAACCA GCAGCAACAT
CCTAACATTT CTGGAAAGAT TGTCCCAGGC TCTTCTCAGC TGCAAGCAGA CAAGAAGAAG
AGGAACTATA AGAAAAAGAA TTGA
 
Protein sequence
MNNGTAVSPQ RPPQHQLSPA QLQRQLQQAL IQQQQQKQQQ QTRPRPNIQN YHFATLSADI 
LRKYAKYPAS ITLHIYETHY RFNNSLDSNI IPKNSPMIKD FLHHVMKEQI PVEMSELLKD
FSIKSYDGCL ILQVFDHRNM VSTTAGVTKA GSSGESKDVK KEIIANGSAD NAVANSSSGS
TSASAAGTNS TATASASTTA VSSGSSPATA SGSGSSAGTA ASTIPKPKTY RTLLRPTQLS
LYYDLLYHTD SALTKFTDSL SLQMESEILT LTNRKLDLAV PLNPYLCDEY LRPEPEYPKK
VWDEAIQDYK LIHLHREAAD HKSRKIHLDE MVLHKTSEYE EIMLLLSNKY KRPDESQDKK
LIIVGPSALA AAAATTTTPA NPPKSTSKEG TSGPDSKVKK GEDSAPSIPA VPAVTPAQST
TRSTGQFMRL RLIEEIRKKR EAERALQEAK LQAQTQAVQS DGNLALSSVN QTPQDKRKLA
EMEIQRQQQL QQLSQQQLQQ SPPSQQGQPQ QKKLKKEPVK SQMPQQVQQF GNNNGNNNSS
NGGGIVTPQQ NNIPVAPQPI RNGTALAAQQ QQLQQQAQQQ AQQQAQQQAQ QQQQQQQQQQ
QQQQQRMVQN TSAPGSLQSS QQSSSQTSGS ASSGSGLQPT LLQQQQQQIF QNSLTPEEQQ
VFRQLQQRMN TFAVMGNTGV APNRQQLTPQ QQQQAQQQAK AIQQQLMQKF PLYFQRLRQF
HLIQQQRQQK QKQIQEKQMQ AAAAAAAQSN NNARFNQQQH PNISGKIVPG SSQLQADKKK
RNYKKKN