Gene PICST_75782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_75782 
SymbolNOT5 
ID4837356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2384466 
End bp2386535 
Gene Length2070 bp 
Protein Length610 aa 
Translation table12 
GC content45% 
IMG OID640388671 
Productnegative transcriptional regulator 
Protein accessionXP_001382683 
Protein GI150864013 
COG category[K] Transcription 
COG ID[COG5665] CCR4-NOT transcriptional regulation complex, NOT5 subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAACACAACA GATACTCCCT GTTGAGATAC ATAAGCATCA CGATAGTTGC CGCATATACC 
CGAGATCATC TTTTCTTTGG TAATAGCTTC AGCATTCGCA TCTAACCCCC CAGCATCTGG
AATCTTGATT TCCATATAGT GAATGATTTC CCCAGTGCCG ATATGAGTAC GAGAAAACTA
CAACAAGAGT TCGACAAGAC GAACAAAAAG ATAGCCGAAG GGTTGTCGGT GTTTGACGAC
ATATACGACA AGTTGATGAC ATCAGAAATC AGTTCTCAAA AAGAGAAGTT GGAGTCAGAC
TTGAAGAAAG AAATCAAGAA GTTGCAGAGA CTGAGAGACC AATTGAAACA ATGGCTTGGT
GACTCAAGTA TTAAATTAGA CAAAGACTTG TTGCAAGAGA ACAGAACCAA GATCGAGCAT
GCTATGGACC AGTTTAAAGA TTTGGAGAAG TCGTCCAAAA TTAAGCAGTT TTCCAACGAA
GGATTAGAAT TGCAGAGTCA AAGGACGAAA TCATCGAGAT TTGGCCCTGA AGATGCCAAA
AGGGCTGATG CCTGTAATTA TGTGAGCGAC ATAATCGATT TGTTGAACCA GCAAAATGAT
GAATTAGACC AGAACGTCAA CCTGCTCTTA GTGCAATTGA AGAAAGCCAA GTCTCTGAAC
CAGGCTCCTA TCCAGTCGTC TATAGAAGAC GAAAGATACA AGATAGAACG TAACAACACG
CATTTGACGA AACTCGAGAG CATCTTACGA AACTTGGAGA ACGACCGCTT GGACCCCCAG
AAGGTTGACG ACATCAAGGA TGACATAGAA TACTATGTAG AAAACAACCA AGAGGAGGAC
TATGTTGAGT ACGACGACTT CTACGATGCC TTGGAGATAG ACGACGAAGC TACTTTAGAA
GTGCAGGGTT CTCTAGCTCA AATGGCTGTA GATACCCAGG AATACGAGCC TAAAATCGAT
GCTCCTGAAC GAAAAGACGA GCCTGTGACT CCAGCAAAAC CTGAATCCAA GCCTCTTCCC
ACTTCTGCTG CATCAACTAC TACTCCTCAT AAGCCGTCTC CCAGCTCTAC GGCTGCTGGA
GGGAACACTC CTACCAAAAA AGCTCACATT GTGCCTGCTC CAGCACCTCC TCCAATTGCC
TACTCTAATG TAATCAAGGC AGCTCAGATG AACGCTCTGG TGGCAGCTGT TTCCGGCTCT
CCTGCCGCGT CAGCTGCTAC TTTATCAGGT GCCTCGCCAG TAGTTTCGGG CAAAGCACCT
CCAGGCTTGA ATCATCTCGA CAAGTCACCT TCCGTATCTC CTAATGTGAC CAATCTGAAG
ATAGCTGAGG AAGACACGCC TAAGTTGATA CAAGCACAAA CAGCTCAGAA CCAACTTCAA
CTCCAGCAGC ACCATCAGCA ACAACCTCCC GCTAACTCAC AACTGCCTGT CGGAACAGAA
GAGTACAAGG GTTTGTATGA TACTGTGTCA CGTTTCACGA CGTTGCCTCA GTCTAGATTA
CAGAACCCAT TGCCCTTCCA GGCGATAGCT GCGTTGTTAG AGTCGTCATT GCTCAATTGC
CCCGATTCTT TTGATGCCGA AAAGCCAAGA CAGTATATTC CTGTCAACGT GCATCCCTCG
TCCATCGACT ATCCCCAGGA GCCAATGTAC GAGTTGAATT CTAGCAACAT CATGAGAAAA
TTTGACAACG ATACGTTGTT CTTCTGTTTC TACTACAGCG AAGGAGTGGA CAACTTGGCC
AAGTGGAACT CGGCACAAGA ATTGTCTAGA CGTGGGTGGA TCTTCAACAC CGAGTTGAAG
CAGTGGTTTT TGAAGGACAC CAAGAACGGA GGCAAGAACC GTTCGATGTC GGTTATCCAG
AAAGAAGAAG AGCAGGACAG TGTAGACGAC TCAGAAAAGG AGGAGAACTA CAAGTACTTT
GACTACGAAA AGACGTGGTT GACCAGAAGA AGAGAAAACT ACAAGTTCAC CAACAACTTG
AGAGAAACAT TTTAGGGTGG TTGTAGTTGT TACATGTCTT GTTAGCATAG CTCTGTTGTA
TATGTATTTT AGTATAAATG TAATTATTCA
 
Protein sequence
MSTRKLQQEF DKTNKKIAEG LSVFDDIYDK LMTSEISSQK EKLESDLKKE IKKLQRSRDQ 
LKQWLGDSSI KLDKDLLQEN RTKIEHAMDQ FKDLEKSSKI KQFSNEGLEL QSQRTKSSRF
GPEDAKRADA CNYVSDIIDL LNQQNDELDQ NVNSLLVQLK KAKSSNQAPI QSSIEDERYK
IERNNTHLTK LESILRNLEN DRLDPQKVDD IKDDIEYYVE NNQEEDYVEY DDFYDALEID
DEATLEVQGS LAQMAVDTQE YEPKIDAPER KDEPVTPAKP ESKPLPTSAA STTTPHKPSP
SSTAAGGNTP TKKAHIVPAP APPPIAYSNV IKAAQMNASV AAVSGSPAAS AATLSGASPV
VSGKAPPGLN HLDKSPSVSP NVTNSKIAEE DTPKLIQAQT AQNQLQLQQH HQQQPPANSQ
SPVGTEEYKG LYDTVSRFTT LPQSRLQNPL PFQAIAALLE SSLLNCPDSF DAEKPRQYIP
VNVHPSSIDY PQEPMYELNS SNIMRKFDND TLFFCFYYSE GVDNLAKWNS AQELSRRGWI
FNTELKQWFL KDTKNGGKNR SMSVIQKEEE QDSVDDSEKE ENYKYFDYEK TWLTRRRENY
KFTNNLRETF