Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31236 |
Symbol | PZF1 |
ID | 4838921 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 292394 |
End bp | 293743 |
Gene Length | 1350 bp |
Protein Length | 449 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640390236 |
Product | Strongly-conserved Zn-finger binding protein (TFIIIA) |
Protein accession | XP_001384352 |
Protein GI | 150865224 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.269579 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTCCG ACACTGCGTC GGTAACATCT ACGGGCTCGT CAGCGCTACC GAAGAAGTAC CTCTGTGACT TTGAAGGATG CACCAAAGCC TATGCAAAAC CTTCCCTACT AGAACAGCAC AAGAGATCAC ATACCAACGA AAGACCTTAC AAATGTTCCA GTCCCGACTG TGGCAAGTCC TTCATGAGAC AATCGCACTT GGATGCTCAT TTATTGTCAC ATGCCGATAA CGGAACCAAA CCATACCATT GTTCAGTGTG TGGAAAGGGT GTGAACTCAC TTCAGCATCT CAAGAGACAC GAAATCACCC ATACAAAGTC GTTCGTCTGC ACCCACGAAG GCTGTAGCGA ATCATTTTAC AAACACCAGT CGCTCAGACA TCATATACTA TCTGTACATG AGAGAACACT CAGTTGTAGT ATTTGCAATA AGAACTTCTC TCGTCCTTAT AGACTAGCCC AGCATAACTT GAAATACCAC AGCGATTCTC CAGCTTACCA ATGTGACCAT GCGGGCTGTT TTTCCAACTT CAAGACGTGG TCAGCGTTGC AGCTTCATAT CAAAACAGAA CATCCCAAGT TGAAGTGTCC CGTTTGTGGA AAAGGCTGCG TAGGAAGAAA GGGTCTTCGC TCCCATATGA TTAGTCATGA TGAGGAAAAG ATGATAAAGC TTTGGAACTG CAACTACTGT AATATTGGGA AATTTTCCAA GAAAATTGAT CTTGTTGAAC ATTATAACTC GCTTCACGAC GGAAATATAC CCGAAGATTT GCTTAAGCCT AACGAGAAAA TGCGTTTGGA AGAGTTACTA AGTGAAACTG ACGATGTAAC GAATTTGGCC GACTTGAAAA GCTTACCGGG ATCAAGGTAC GAGTTTTTAG ATGAAGAAGA GGATGAAGAG CAAGAGTTGG TTCTCGAGAA TCGATTCGAG GCTCCGAACT CTATTAAATC CATGGACAGT TTTGAGAACT CGTTGAGAAG AATATCGGTG ATCGGCCTAA TTCTGAACAA CTTCTCATCC AAGACTATCA AGTGTCCCAA AAAAAATTGT GCGAGAGCTT TCAGTAGAGA GTACGACCTC ACAAGGCACT TGAAATGGCA CGAAGAACAC ATGAAGAAAA TCGAAGACTT TTTGAACTCC GTAGAGAAGG AAGAGACAAT CTCTCCTTCA AAGATAGAGG ATGATGAATA TGACTCTGCT CTGGAACCAC CACTGAAAAG ACAAAAACTT CCAGCTCGGT ATGAAACCTT AACTAATGAT AACGACAACG ACAATGACAA TGACGACGAC CTAGATGCAC TTATAGATGT AGAATTACGC CTGATCAAAG CTGGTGACTC CTCCTTCTAA
|
Protein sequence | MSSDTASVTS TGSSALPKKY LCDFEGCTKA YAKPSLLEQH KRSHTNERPY KCSSPDCGKS FMRQSHLDAH LLSHADNGTK PYHCSVCGKG VNSLQHLKRH EITHTKSFVC THEGCSESFY KHQSLRHHIL SVHERTLSCS ICNKNFSRPY RLAQHNLKYH SDSPAYQCDH AGCFSNFKTW SALQLHIKTE HPKLKCPVCG KGCVGRKGLR SHMISHDEEK MIKLWNCNYC NIGKFSKKID LVEHYNSLHD GNIPEDLLKP NEKMRLEELL SETDDVTNLA DLKSLPGSRY EFLDEEEDEE QELVLENRFE APNSIKSMDS FENSLRRISV IGLISNNFSS KTIKCPKKNC ARAFSREYDL TRHLKWHEEH MKKIEDFLNS VEKEETISPS KIEDDEYDSA SEPPSKRQKL PARYETLTND NDNDNDNDDD LDALIDVELR SIKAGDSSF
|
| |