Gene PICST_73625 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_73625 
Symbol 
ID4840543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp888704 
End bp891604 
Gene Length2901 bp 
Protein Length636 aa 
Translation table12 
GC content42% 
IMG OID640391858 
Productpredicted protein 
Protein accessionXP_001386360 
Protein GI150866686 
COG category[B] Chromatin structure and dynamics
[K] Transcription 
COG ID[COG5076] Transcription factor involved in chromatin remodeling, contains bromodomain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.171811 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GCCAGCACAT AATATTTTTT CGCCGCCCGC CCATCCAGCC CGCGTTTTGC AGTTTCGTCC 
ACCGCACCAC AAAGTCTGCA GATTGCGCTT ACGTCAGCTT GCTGGTTCAC CTCGACACGT
CTGGCTGTGG GTAATTTTTT AATTTTTTGC AAAACCCGGT CGCAATCCGA AGATATATAA
AAAACGCTTT TGCGTCTTTT TTTTTGTCTT TTTCCAACCT GAAAAAACAG ATTTAGTTTT
CGTGGGTGCA GAGTGTATGT GTTATATACA GGTGTTTATT TCACCACCAG TGAACGAGAG
AGCCGAAAGA TACCAGAGGA TAGGATAGAT ATGACAGTTT GGTGGCCGTT CGAGAGCTGC
TGAAGTGATG GCAAATGACT GAGATGTGAA AGAGGGTATA AATTCGAAGT CACTTCCGTA
CTTTTTTTTC AGCCTCGTTT CTGCGGTTTT GCCATCAAGT TTATGTGTCT CCTGTTATCT
GTCAGTTTCG CTGGATTACC AGTGTCTCAG ATTTGTTCAC AGTGATAAAT TTCACTTGCC
AATTCCAAAA ACCAATAGAA TATCCATACT AACAAAACTT CTAGAACTTG GTTTGTTATT
TTGCTTAGAC TCCGATACTA TTCTTATCTC TTTACAGATA TATCTTAATT GTATCATAGG
TGTTCTCTTT GTTGTTTGTC ATCTTATCAC TTCCGTATAT TGCATCTTTC ATCGCTTCAA
GCCTTGTTAA TAATCTTATT CCCTATATCT TTTCAAGAAT ATGTCAGAAG TTGTACCAGA
AACGAATACT CCAGTTCAGA CCCCTTCATC CGAAACACAC TTCCACAAAT CAGACACAGC
CATAGTCAAC GAATATAAGA AAATGACCCC TGAAGAGCCG GAAAAACCGC TTTCTCCTCC
AAATCCCTCT CCGAGCCCTG AGAAGCGAAA GTTAGAAGTG GACGAAAATG AAGAGTCCAA
AAGGCAAAAA TACGATTCAG AAGCTCCTGA AGCTGTAGCC AATGAGGCTG CTCCCAATTC
GATTAACGTA GAAGAATCTA AAGAAGCTTC TCCAGTTGTT CCTGCAACAG CAGGGACAGC
TGTATTTTCG GAACCGGCTC CAAAGCCAGC TGCAGAACCA GATATGGACA ATTTGCCTGC
CAACCCATTA CCCCCACATC AAGCCAAGTT TGCCCTCAAC ACCATAAAAG CCATAAAGCG
GTTGCGTGAT GCTGTGCCAT TTTTACACCC AGTAGACATC GTCAAGTTGA ACATTCCCTT
CTACTACAAC TACATTCCTA GACCTATGGA CTTGTCAACT ATCGAAACCA AAGTACATGT
AAATGCCTAC GAAGACTCCA ATCAGATAGT TGAGGACTTC AACTTGATGG TAGCTAATTG
TAAGAAGTTC AACGGCGAAA ATGCTGGTAT TTCCAAAATG GCTGATAACA TTCAAGCTCA
CTTTGAAAAG CACATGTTGA ATTTTCCTCC CAAAGTTTTA CCATCGGCAG TTGCTGCGGC
TAAACCTTCT GCAACTGGAT TGGCTTCGAA GAGAAGAACC GAAGCTGATG CCGTAAAGCA
ACAACAGCGC GAGTCAGTAG CTGCTCATAG ACCAAAGAGA ACCATACACC CTCCAAAGTC
AAAAGAAATC CCATATGATA CTAAGCCCCG TAAGAAGAAG TTTGCAGCAG AGTTGCGATT
CTGTTCCCAG ACTGTCAAGG AGTTAATGTC GAAAAAGCAT AACGGCTACA ACTTCCCGTT
TGTCGCTCCT GTAGACCCTG TAGCCTTGAA TATTCCAAAC TACTTCAAAG TAGTGAAAGA
ACCGATGGAC TTGGGCACAA TTCAATCCAA GTTGACTAAT AACCAGTACG AAAATGGAGA
TGAGTTTGAA CGTGACATAC GTTTGGTATT CAAAAATTGT TACATTTTCA ATCCTGAAGG
AAGTGAAGTG AACATGATGG GACATCGTCT TGAAGCAGTT TTCGACAAGA GATGGGCTGC
TCGTCCTGTT CCAGAACCAA CGCCTGTCAA TTCTGAAATC GAAGATTCTG AAGAAGAATC
GAGCGACGGC GAAGATGAAG AACTGGAAAT CAACGAGTCC ATGTTATCAG ATGTTCCTGC
CATTCAGTTC TTGGAAAATC AATTGCTCAG AATGAAGAAG GAACTAGATG AATTGAAGAA
GGAACATTTG AAGAAGTTGA GAGAGCAGCA GGCAGCTAGG AAGAAAAAGA GAAAGTCCAA
GAAGGCTGCA GCTAAGAAGT CTTCGGCTCC GCCTAGGGCA CCATCGATTT CCTCGACTCC
TGTTGTTACT TACGAAATGA AAAAACAAGT CAGTGAGATG GTTCCTAATC TTTCGGACAA
GAAATTGCAG TCTCTTATCA AGATCATCAA GGATGATGTT GAAATTAGCA ACGAAGATGA
AGTAGAATTG GACATGGACC AATTGGAAGA CCGCACTGTC TTGAAGTTGT ACAACTTCTT
GTTTGGCAAG AAGGCTTCAG CCAAGCTTGC TAAGAAGCCA AAGAAACCTG TTATAACTAA
CAGTGTTGAT GAATTGGCCC ATTTGAGAAG CCAGTTGGCG TTGTTTGACG ACGACAATAA
CAATGGCTCT ACCAATGGAT TCATGAATAT TGGCAACGAC CATGAATCTT CAGAAGACGA
TCTCTCCTCT GAAAGTTCTG AAGAGGAATA AGCTCCTCCA CGATATGAAT AAATTAATAG
TGGTTTGTTG TCACAAACTA TGGAAATGGA AGCTATGAAT ACTATGTAAA ACTAGCGCCT
TGTACAAAGA AAGCTTTTAC TGTTGAGCTA CGCCCACCGC ACCCGTTTCA CACGTATACT
ATATTAGTTT GGATATGTTA TTCTTACTTC TATCACAGTT TCAGTTTTAT AGTGTATTAT
AAAAGGATTA ATTAATTGAA G
 
Protein sequence
MSEVVPETNT PVQTPSSETH FHKSDTAIVN EYKKMTPEEP EKPLSPPNPS PSPEKRKLEV 
DENEESKRQK YDSEAPEAVA NEAAPNSINV EESKEASPVV PATAGTAVFS EPAPKPAAEP
DMDNLPANPL PPHQAKFALN TIKAIKRLRD AVPFLHPVDI VKLNIPFYYN YIPRPMDLST
IETKVHVNAY EDSNQIVEDF NLMVANCKKF NGENAGISKM ADNIQAHFEK HMLNFPPKVL
PSAVAAAKPS ATGLASKRRT EADAVKQQQR ESVAAHRPKR TIHPPKSKEI PYDTKPRKKK
FAAELRFCSQ TVKELMSKKH NGYNFPFVAP VDPVALNIPN YFKVVKEPMD LGTIQSKLTN
NQYENGDEFE RDIRLVFKNC YIFNPEGSEV NMMGHRLEAV FDKRWAARPV PEPTPVNSEI
EDSEEESSDG EDEESEINES MLSDVPAIQF LENQLLRMKK ELDELKKEHL KKLREQQAAR
KKKRKSKKAA AKKSSAPPRA PSISSTPVVT YEMKKQVSEM VPNLSDKKLQ SLIKIIKDDV
EISNEDEVEL DMDQLEDRTV LKLYNFLFGK KASAKLAKKP KKPVITNSVD ELAHLRSQLA
LFDDDNNNGS TNGFMNIGND HESSEDDLSS ESSEEE