Gene PICST_36036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36036 
Symbol 
ID4838737 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1408821 
End bp1410026 
Gene Length1206 bp 
Protein Length323 aa 
Translation table12 
GC content44% 
IMG OID640390052 
Productpredicted protein 
Protein accessionXP_001384228 
Protein GI126135408 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5034] Chromatin remodeling protein, contains PhD zinc finger 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACC AGCAGCAACA ACAGCAGTTC CAGCAGTACA AACAGCGTGC AAAGAACATT 
GCAAACATCA ACGAGCTTCT TCCAGGGTTA AACGACATTT CCGATGCGTT TGAAGTGTTA
CCTATGGACC TCATAAAGTA CTTCACACTA CTTAAGGAAA TCGATGCCAA ATGTATTAAC
ACCGTTCCGC AGATCAACTT CTTGATCAAA AAGTACATTG GAGGTTTACA TGAGGGCAAG
CTGTCCAAGG ATGGAGCCGA AGGAGGAGAA GACAGCAACG AAAATAACAA TGACAAATCC
GAAAATGACA AAGACAAGCT TGAAAACGAC AAGGAATCAA AAGAGACTTC CAAGAGAGAC
CTGGAATTGA GAAGGTTGAC ACTCATCAAA GACAAGATCA ACGAAATAAT TCCCTGCTTA
GAAGAGAAGA TGCACGTTAC GTCGGTGGCG ACCGACTTGT TGAGCAAACA TATGTTCCGT
ATCAACAATG ACTACAAACT AATTGTCAAC AACAACGAGA TTCCAGAGTC AATCCGTATC
GGTCCTCTAA GCCACCCGGC AATGATCATG GATGCTAACG TAGCCAACGG GTCAAGTGTT
GACAGATCAG CACAGGCACA AAGAAGTGAA AGTAGAAGAG AAGCTCTAGC AGCCAAGAAA
GCCAGCAAAG AAGACAATTC TGATGACCAT CTTTCTGTGA AGAAGAAGAA GAACAAAGAG
AGCACACCTT CAGAAGCACT CAAGGGCACA CCGAATGGAA ACCCAGGAGT AACCAACACC
AAGAAGAGAT CGAGAAAGGA AAATGATGAT ATCCAGAGAC CAGTTACCCC TGGCGGCGGC
GCTACCAACG GAACAGCACC AGCCAAGAAG AAGAATAAGC CCAAGAAGGA AGACGACAGT
AGCAATAATT CACGTGCAAA CAACACCAGC AGCAATAATG TCAACTCCAA CGAACTTGAC
AACGATGACA ATAACACTAT AGACAGAGTC AAGGCTGAAG TTCCAGCAAG TAAGGCCAAG
AATACAGGCG AACCTACGTA CTGTTACTGT AACCAAGTTT CCTTTGGTGA GATGGTGGGA
TGCGACGGTG ACGACTGTAA GCGTGAATGG TTCCATTTGC CATGTATTGG ATTCAAGAAC
CCACCTAAGG GTAAATGGTA CTGTGACGAC TGTCTTGTGA AAATGAAGAA AATGAAGAAA
TTATGA
 
Protein sequence
MKNQQQQQQF QQYKQRAKNI ANINELLPGL NDISDAFEVL PMDLIKYFTL LKEIDAKCIN 
TVPQINFLIK KYIGDKINEI IPCLEEKMHV TSVATDLLSK HMFRINNDYK LIVNNNEIPE
SIRIGPLSHP AMIMDANVAN GSSVDRSAQA QRSESRREAL AAKKASKEDN SDDHLSVKKK
KNKESTPSEA LKGTPNGNPG VTNTKKRSRK ENDDIQRPED DSSNNSRANN TSSNNVNSNE
LDNDDNNTID RVKAEVPASK AKNTGEPTYC YCNQVSFGEM VGCDGDDCKR EWFHLPCIGF
KNPPKGKWYC DDCLVKMKKM KKL