Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67047 |
Symbol | CRZ1 |
ID | 4837466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 1932122 |
End bp | 1935092 |
Gene Length | 2971 bp |
Protein Length | 781 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640388781 |
Product | zf-C2H2 Zinc finger, C2H2 type |
Protein accession | XP_001382598 |
Protein GI | 150863944 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.928485 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GAAAATCAAG ATTGTGAGAG ATTTCGCAGA ATCAGCTTCC CATTCACTCC TTCATAGACT AGATAGAGAA ACACTTAGAG TAGCAGTTGG CCGCGGAGTT AATTGAGACA TCTATAATTC AGTACGTGTA GCGTCACCAC TTGCAAGCGT ATTTTATTGC AAGAGCCGTC CGCAATTCCC CACCACAAGC CCAGGTGAAT TAGTCCATTC TTGTCGTTAT TTTCGCCAGT GACACGCTCG GCTGCGGATA TATTCTAGAG AAGTCGTACT AGGGACAAAT AACGACAGAT TCTTCAGCCC CGATTAGTGC ACTTTTGCGT TGGCCGTCCA GTAACGTGGT TCCACCAGAT TGTTCAGATT ACAGCACCTA TAAAGTGCTA ACCATAGACA AGGAAACCCC GGCAACACCT TCAACAAGAC TAGAGAGTCA GTTTGCCTTT TCACACTTTT CTCTTGCCAT TCGAGTCTTC CAACTGAACT TTTCACTTTT TGTCTCGCAA TATTATCGTT TCCTGTCTGG AACCGTTTTT TCGTCCCCCT CACAAATTGG ATAGATATTT ATCCCGTATT CTCTTCATAT AAAATAGAGT TGCACGATTT CTTTCCAGAC TAGTGCCAGT CCTCGATGAC GTCCAACAAC CCCCGCCCGA GTGTCAAGAT GGACGACGAC GAGCTATACA GCGACATTTT GAACCTATCT CCAACGTCTA ACCTCGATTT CGACATGGGC TCTGGAAATT TCAGAGACAT GAGAGACAAT GGAAACAACA ATATCCAGAA TAACCACAAC AACAATCGGA ACAACAGCAC CAATAGCAAC ATCCACAGTG TCAGTGTCAG CTCTGACAAC ACCAAAAACA CCAGCAACAA CATCAATACA AACAACATCA ACAGTATCAC CGGTAGAACA ACTCTCAGCA ATCCAAGCTA TACCTCCAAT GCTGCGAGCC TTTTTGCTCT CAAAGAAGGA GCTCCCATTC CCGGTTTTGG GCTAGACGAC TCGGCAGCCC TCGACTTGAG CTACACCTAC GATAGCTTCT CGTTCAACAG CAGAAATGAA AACATCAACA TCTACTTAAA TGACCAGCAA CAGCTTCTCC AACAACCGGG AAACAACTCA CACAATCTTT CGCTATACCA AGAAAAGGAC AAAAGCTCCA ACATCAATAC AAACAACATC GCAAATAGCA ATAACAACCC CAATAGTAGA AACAGCAGTA CCAATATCAC CACGCCTGGT TCGAACGGAG ACTTTCTTTC ACCAACGGGA AATCTCAAGT ACCAGCACAA TGGCAATATA AGTTCACAAA ACAGTAACAC TATCAGCCAC AATCTGCAGC AGGAACGTAG CGGATCAGGT TTACTTAATC CGAATTCACC GAGTGCTTTT TCTTCACACT CGCTCTATTC CGAAAACTCG AGCCAGCCAG CCAGTCCATA TTTGGATGCG ATGTCTCAAC TAAGCAACGT AAACGTAAAC GGTCTACAAC CTCCAGCCGT AGAAAGAGCT TATTCTGATG TAGGAAAATC CAACACTCCT CAGTTACTCA ATCAGGGTTC TACACAATAT ATTGATGCTG ACTCTGCCCA CATGTTGAAT ACGTTCGATA CTGAAATCGC CTTGGGAGGC TCAATTTCCA GCACAAACTT GGCTGGTTTG GATAGCCCAC AATACGCCCA GATCAATGCT GGATTTCAGC AGCAAAGCTT CGGTGGGTTT TCTATGAGTA ACCAGCCACT TTCTGTAGAT TTCAACTCTC ATAGTTTGAT GGCTACACCT CCTCCAATGC AACAGAATTC TTCTGGTGCC AATCTTCAAA CTCAGCAACA ACAATCACAA GTGATGCAGC AGCAATACGA TACAATTAGC ACCACTGCCA CCAATAACTC CAATACGGCT AACCAGTTCA ATTTACTTAC AGAAAACAAT CTAAGTAGCT ACAACCAGCT TCAGGATGTA ACTTCTACGC AAATGAGAGA CGATTATGTC TCCGAGGATA TCGTGATATC AATCCAGCAG GCGCCAGAGC CGGTAGCTGC AAAAACGCCG TCTTTGTTCA GTAACTCCTC AGCCAATTCA TCCATCAACA ATTCACCAAG AGTGGGAAAC ACCAATACTG GAGTACCGTT ATCGCGCTCC GCCAGTGGAG GAGGAATTTA TTCCTCTACT AATAGCTTGA TACCCAATTC ACAATTGTTG TCATCTCAAG ACCATGATAA CGGTGTTTCG TTGTTGAAGC CAGACGAGTA CCAAGCAATG AAGAGAGGAA GAAGAAAGAG TCATTCAAGC AAGTCTTCAA CTTCTAAGTC CAGATCACGT TCTCGTTCAG TTTCCAGAAC TAGAAGTGGA GGTGAGGAAG ACTATGATGA AGACGAATAC GATGATGAAG ATGATGATGA AAAGGATTCC AGGTTAGTAA TATCTTCAAG AGAAAAAATG TTGGAGCTAG CCTCACCGAA TCAATCATCC AAAAGAACTC AAAAACATCC CAGCGTGTAT GCTTGTCATT TGTGTGACAA ACGATTCACC AGGCCATACA ACTTGAAATC GCATTTGAGG ACACATACAG ATGAAAGACC GTTTATATGT AATGTCTGTG GTAAGGCTTT TGCCAGACAA CATGACAGAA AGAGACACGA AGACTTACAT ACAGGTGAAA AGAAGTTCCA ATGTAAAGGG TTTTTGAAGA GCGGAAAGCC TTATGGATGT GGCCGGAAGT TTGCTCGTGC TGATGCGTTG AGACGCCATT TCCAGACAGA GGCAGGCAAG GAATGTATAA GACTATTGAT AGAAGAGGAG GAACGGGAGC GGTTGAAGAA CGGAGACACA TCCACAGTGC ACGACTCCAT CGACAGTATT ATAGCGTCGT CGACGGGAGA GCCAAGGGCC GAATATATGG GCTCGTATGG GCCCGGATCC ATCTCCACGG AGCATTCCAT TCCTCTGGTG GCAATATCAC CGCCAGAATA G
|
Protein sequence | MTSNNPRPSV KMDDDELYSD ILNLSPTSNL DFDMGSGNFR DMRDNGNNNI QNNHNNNRNN STNSNIHSVS VSSDNTKNTS NNINTNNINS ITGRTTLSNP SYTSNAASLF ALKEGAPIPG FGLDDSAALD LSYTYDSFSF NSRNENINIY LNDQQQLLQQ PGNNSHNLSL YQEKDKSSNI NTNNIANSNN NPNSRNSSTN ITTPGSNGDF LSPTGNLKYQ HNGNISSQNS NTISHNSQQE RSGSGLLNPN SPSAFSSHSL YSENSSQPAS PYLDAMSQLS NVNVNGLQPP AVERAYSDVG KSNTPQLLNQ GSTQYIDADS AHMLNTFDTE IALGGSISST NLAGLDSPQY AQINAGFQQQ SFGGFSMSNQ PLSVDFNSHS LMATPPPMQQ NSSGANLQTQ QQQSQVMQQQ YDTISTTATN NSNTANQFNL LTENNLSSYN QLQDVTSTQM RDDYVSEDIV ISIQQAPEPV AAKTPSLFSN SSANSSINNS PRVGNTNTGV PLSRSASGGG IYSSTNSLIP NSQLLSSQDH DNGVSLLKPD EYQAMKRGRR KSHSSKSSTS KSRSRSRSVS RTRSGGEEDY DEDEYDDEDD DEKDSRLVIS SREKMLELAS PNQSSKRTQK HPSVYACHLC DKRFTRPYNL KSHLRTHTDE RPFICNVCGK AFARQHDRKR HEDLHTGEKK FQCKGFLKSG KPYGCGRKFA RADALRRHFQ TEAGKECIRL LIEEEERERL KNGDTSTVHD SIDSIIASST GEPRAEYMGS YGPGSISTEH SIPSVAISPP E
|
| |