Gene PICST_31236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31236 
SymbolPZF1 
ID4838921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp292394 
End bp293743 
Gene Length1350 bp 
Protein Length449 aa 
Translation table12 
GC content43% 
IMG OID640390236 
ProductStrongly-conserved Zn-finger binding protein (TFIIIA) 
Protein accessionXP_001384352 
Protein GI150865224 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.269579 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTCCG ACACTGCGTC GGTAACATCT ACGGGCTCGT CAGCGCTACC GAAGAAGTAC 
CTCTGTGACT TTGAAGGATG CACCAAAGCC TATGCAAAAC CTTCCCTACT AGAACAGCAC
AAGAGATCAC ATACCAACGA AAGACCTTAC AAATGTTCCA GTCCCGACTG TGGCAAGTCC
TTCATGAGAC AATCGCACTT GGATGCTCAT TTATTGTCAC ATGCCGATAA CGGAACCAAA
CCATACCATT GTTCAGTGTG TGGAAAGGGT GTGAACTCAC TTCAGCATCT CAAGAGACAC
GAAATCACCC ATACAAAGTC GTTCGTCTGC ACCCACGAAG GCTGTAGCGA ATCATTTTAC
AAACACCAGT CGCTCAGACA TCATATACTA TCTGTACATG AGAGAACACT CAGTTGTAGT
ATTTGCAATA AGAACTTCTC TCGTCCTTAT AGACTAGCCC AGCATAACTT GAAATACCAC
AGCGATTCTC CAGCTTACCA ATGTGACCAT GCGGGCTGTT TTTCCAACTT CAAGACGTGG
TCAGCGTTGC AGCTTCATAT CAAAACAGAA CATCCCAAGT TGAAGTGTCC CGTTTGTGGA
AAAGGCTGCG TAGGAAGAAA GGGTCTTCGC TCCCATATGA TTAGTCATGA TGAGGAAAAG
ATGATAAAGC TTTGGAACTG CAACTACTGT AATATTGGGA AATTTTCCAA GAAAATTGAT
CTTGTTGAAC ATTATAACTC GCTTCACGAC GGAAATATAC CCGAAGATTT GCTTAAGCCT
AACGAGAAAA TGCGTTTGGA AGAGTTACTA AGTGAAACTG ACGATGTAAC GAATTTGGCC
GACTTGAAAA GCTTACCGGG ATCAAGGTAC GAGTTTTTAG ATGAAGAAGA GGATGAAGAG
CAAGAGTTGG TTCTCGAGAA TCGATTCGAG GCTCCGAACT CTATTAAATC CATGGACAGT
TTTGAGAACT CGTTGAGAAG AATATCGGTG ATCGGCCTAA TTCTGAACAA CTTCTCATCC
AAGACTATCA AGTGTCCCAA AAAAAATTGT GCGAGAGCTT TCAGTAGAGA GTACGACCTC
ACAAGGCACT TGAAATGGCA CGAAGAACAC ATGAAGAAAA TCGAAGACTT TTTGAACTCC
GTAGAGAAGG AAGAGACAAT CTCTCCTTCA AAGATAGAGG ATGATGAATA TGACTCTGCT
CTGGAACCAC CACTGAAAAG ACAAAAACTT CCAGCTCGGT ATGAAACCTT AACTAATGAT
AACGACAACG ACAATGACAA TGACGACGAC CTAGATGCAC TTATAGATGT AGAATTACGC
CTGATCAAAG CTGGTGACTC CTCCTTCTAA
 
Protein sequence
MSSDTASVTS TGSSALPKKY LCDFEGCTKA YAKPSLLEQH KRSHTNERPY KCSSPDCGKS 
FMRQSHLDAH LLSHADNGTK PYHCSVCGKG VNSLQHLKRH EITHTKSFVC THEGCSESFY
KHQSLRHHIL SVHERTLSCS ICNKNFSRPY RLAQHNLKYH SDSPAYQCDH AGCFSNFKTW
SALQLHIKTE HPKLKCPVCG KGCVGRKGLR SHMISHDEEK MIKLWNCNYC NIGKFSKKID
LVEHYNSLHD GNIPEDLLKP NEKMRLEELL SETDDVTNLA DLKSLPGSRY EFLDEEEDEE
QELVLENRFE APNSIKSMDS FENSLRRISV IGLISNNFSS KTIKCPKKNC ARAFSREYDL
TRHLKWHEEH MKKIEDFLNS VEKEETISPS KIEDDEYDSA SEPPSKRQKL PARYETLTND
NDNDNDNDDD LDALIDVELR SIKAGDSSF