Gene PICST_67047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_67047 
SymbolCRZ1 
ID4837466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1932122 
End bp1935092 
Gene Length2971 bp 
Protein Length781 aa 
Translation table12 
GC content45% 
IMG OID640388781 
Productzf-C2H2 Zinc finger, C2H2 type 
Protein accessionXP_001382598 
Protein GI150863944 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.928485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GAAAATCAAG ATTGTGAGAG ATTTCGCAGA ATCAGCTTCC CATTCACTCC TTCATAGACT 
AGATAGAGAA ACACTTAGAG TAGCAGTTGG CCGCGGAGTT AATTGAGACA TCTATAATTC
AGTACGTGTA GCGTCACCAC TTGCAAGCGT ATTTTATTGC AAGAGCCGTC CGCAATTCCC
CACCACAAGC CCAGGTGAAT TAGTCCATTC TTGTCGTTAT TTTCGCCAGT GACACGCTCG
GCTGCGGATA TATTCTAGAG AAGTCGTACT AGGGACAAAT AACGACAGAT TCTTCAGCCC
CGATTAGTGC ACTTTTGCGT TGGCCGTCCA GTAACGTGGT TCCACCAGAT TGTTCAGATT
ACAGCACCTA TAAAGTGCTA ACCATAGACA AGGAAACCCC GGCAACACCT TCAACAAGAC
TAGAGAGTCA GTTTGCCTTT TCACACTTTT CTCTTGCCAT TCGAGTCTTC CAACTGAACT
TTTCACTTTT TGTCTCGCAA TATTATCGTT TCCTGTCTGG AACCGTTTTT TCGTCCCCCT
CACAAATTGG ATAGATATTT ATCCCGTATT CTCTTCATAT AAAATAGAGT TGCACGATTT
CTTTCCAGAC TAGTGCCAGT CCTCGATGAC GTCCAACAAC CCCCGCCCGA GTGTCAAGAT
GGACGACGAC GAGCTATACA GCGACATTTT GAACCTATCT CCAACGTCTA ACCTCGATTT
CGACATGGGC TCTGGAAATT TCAGAGACAT GAGAGACAAT GGAAACAACA ATATCCAGAA
TAACCACAAC AACAATCGGA ACAACAGCAC CAATAGCAAC ATCCACAGTG TCAGTGTCAG
CTCTGACAAC ACCAAAAACA CCAGCAACAA CATCAATACA AACAACATCA ACAGTATCAC
CGGTAGAACA ACTCTCAGCA ATCCAAGCTA TACCTCCAAT GCTGCGAGCC TTTTTGCTCT
CAAAGAAGGA GCTCCCATTC CCGGTTTTGG GCTAGACGAC TCGGCAGCCC TCGACTTGAG
CTACACCTAC GATAGCTTCT CGTTCAACAG CAGAAATGAA AACATCAACA TCTACTTAAA
TGACCAGCAA CAGCTTCTCC AACAACCGGG AAACAACTCA CACAATCTTT CGCTATACCA
AGAAAAGGAC AAAAGCTCCA ACATCAATAC AAACAACATC GCAAATAGCA ATAACAACCC
CAATAGTAGA AACAGCAGTA CCAATATCAC CACGCCTGGT TCGAACGGAG ACTTTCTTTC
ACCAACGGGA AATCTCAAGT ACCAGCACAA TGGCAATATA AGTTCACAAA ACAGTAACAC
TATCAGCCAC AATCTGCAGC AGGAACGTAG CGGATCAGGT TTACTTAATC CGAATTCACC
GAGTGCTTTT TCTTCACACT CGCTCTATTC CGAAAACTCG AGCCAGCCAG CCAGTCCATA
TTTGGATGCG ATGTCTCAAC TAAGCAACGT AAACGTAAAC GGTCTACAAC CTCCAGCCGT
AGAAAGAGCT TATTCTGATG TAGGAAAATC CAACACTCCT CAGTTACTCA ATCAGGGTTC
TACACAATAT ATTGATGCTG ACTCTGCCCA CATGTTGAAT ACGTTCGATA CTGAAATCGC
CTTGGGAGGC TCAATTTCCA GCACAAACTT GGCTGGTTTG GATAGCCCAC AATACGCCCA
GATCAATGCT GGATTTCAGC AGCAAAGCTT CGGTGGGTTT TCTATGAGTA ACCAGCCACT
TTCTGTAGAT TTCAACTCTC ATAGTTTGAT GGCTACACCT CCTCCAATGC AACAGAATTC
TTCTGGTGCC AATCTTCAAA CTCAGCAACA ACAATCACAA GTGATGCAGC AGCAATACGA
TACAATTAGC ACCACTGCCA CCAATAACTC CAATACGGCT AACCAGTTCA ATTTACTTAC
AGAAAACAAT CTAAGTAGCT ACAACCAGCT TCAGGATGTA ACTTCTACGC AAATGAGAGA
CGATTATGTC TCCGAGGATA TCGTGATATC AATCCAGCAG GCGCCAGAGC CGGTAGCTGC
AAAAACGCCG TCTTTGTTCA GTAACTCCTC AGCCAATTCA TCCATCAACA ATTCACCAAG
AGTGGGAAAC ACCAATACTG GAGTACCGTT ATCGCGCTCC GCCAGTGGAG GAGGAATTTA
TTCCTCTACT AATAGCTTGA TACCCAATTC ACAATTGTTG TCATCTCAAG ACCATGATAA
CGGTGTTTCG TTGTTGAAGC CAGACGAGTA CCAAGCAATG AAGAGAGGAA GAAGAAAGAG
TCATTCAAGC AAGTCTTCAA CTTCTAAGTC CAGATCACGT TCTCGTTCAG TTTCCAGAAC
TAGAAGTGGA GGTGAGGAAG ACTATGATGA AGACGAATAC GATGATGAAG ATGATGATGA
AAAGGATTCC AGGTTAGTAA TATCTTCAAG AGAAAAAATG TTGGAGCTAG CCTCACCGAA
TCAATCATCC AAAAGAACTC AAAAACATCC CAGCGTGTAT GCTTGTCATT TGTGTGACAA
ACGATTCACC AGGCCATACA ACTTGAAATC GCATTTGAGG ACACATACAG ATGAAAGACC
GTTTATATGT AATGTCTGTG GTAAGGCTTT TGCCAGACAA CATGACAGAA AGAGACACGA
AGACTTACAT ACAGGTGAAA AGAAGTTCCA ATGTAAAGGG TTTTTGAAGA GCGGAAAGCC
TTATGGATGT GGCCGGAAGT TTGCTCGTGC TGATGCGTTG AGACGCCATT TCCAGACAGA
GGCAGGCAAG GAATGTATAA GACTATTGAT AGAAGAGGAG GAACGGGAGC GGTTGAAGAA
CGGAGACACA TCCACAGTGC ACGACTCCAT CGACAGTATT ATAGCGTCGT CGACGGGAGA
GCCAAGGGCC GAATATATGG GCTCGTATGG GCCCGGATCC ATCTCCACGG AGCATTCCAT
TCCTCTGGTG GCAATATCAC CGCCAGAATA G
 
Protein sequence
MTSNNPRPSV KMDDDELYSD ILNLSPTSNL DFDMGSGNFR DMRDNGNNNI QNNHNNNRNN 
STNSNIHSVS VSSDNTKNTS NNINTNNINS ITGRTTLSNP SYTSNAASLF ALKEGAPIPG
FGLDDSAALD LSYTYDSFSF NSRNENINIY LNDQQQLLQQ PGNNSHNLSL YQEKDKSSNI
NTNNIANSNN NPNSRNSSTN ITTPGSNGDF LSPTGNLKYQ HNGNISSQNS NTISHNSQQE
RSGSGLLNPN SPSAFSSHSL YSENSSQPAS PYLDAMSQLS NVNVNGLQPP AVERAYSDVG
KSNTPQLLNQ GSTQYIDADS AHMLNTFDTE IALGGSISST NLAGLDSPQY AQINAGFQQQ
SFGGFSMSNQ PLSVDFNSHS LMATPPPMQQ NSSGANLQTQ QQQSQVMQQQ YDTISTTATN
NSNTANQFNL LTENNLSSYN QLQDVTSTQM RDDYVSEDIV ISIQQAPEPV AAKTPSLFSN
SSANSSINNS PRVGNTNTGV PLSRSASGGG IYSSTNSLIP NSQLLSSQDH DNGVSLLKPD
EYQAMKRGRR KSHSSKSSTS KSRSRSRSVS RTRSGGEEDY DEDEYDDEDD DEKDSRLVIS
SREKMLELAS PNQSSKRTQK HPSVYACHLC DKRFTRPYNL KSHLRTHTDE RPFICNVCGK
AFARQHDRKR HEDLHTGEKK FQCKGFLKSG KPYGCGRKFA RADALRRHFQ TEAGKECIRL
LIEEEERERL KNGDTSTVHD SIDSIIASST GEPRAEYMGS YGPGSISTEH SIPSVAISPP
E