Gene PICST_37460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_37460 
SymbolZPR1 
ID4851051 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp818805 
End bp820298 
Gene Length1494 bp 
Protein Length497 aa 
Translation table 
GC content43% 
IMG OID640392759 
Productnucleolar zinc-finger protein 
Protein accessionXP_001387386 
Protein GI126274037 
COG category[R] General function prediction only 
COG ID[COG1779] C4-type Zn-finger protein 
TIGRFAM ID[TIGR00310] ZPR1 zinc finger domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0530225 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.336313 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCG ACGAGGAACA AAAACCGCAA GATCTCTTCA CTAGCGTGGG AGAACAGGCT 
CAAGAAGTAG ATGATCAGGA AACTAATGTT GCCGGAAGCA ACGAAGTCAG ACAGACCGGA
GCGGAGGATG CTGAGGGACA TCCAGTTCAG GAAATCGAGT CGTTATGTAT GAACTGTCAC
AAGAACGGAA TCACAAGAAT GCTTTTGACC AAGATTCCAT ACTTCAGAGA AATCATCTTG
ATGTCATTTG AATGTACTCA CTGCGGATTC AAGAACAGCG AAATCCAGCC AGCTGCACAA
ATAGCCGAAA AGGGATCAAG GTACGTGTTG AAACTTGAAA CTAAAGAAGA TTTCAACCGT
CAAGTCGTGA AGTCTGAGAC TGCCTCTGTA AGATTCTCAG AGTTGGACAT TGAGATTCCA
CCAAAGAGGG GCCAGTTATC AAACATCGAG GGTTTATTGG AAGAAATGAT CGAAGATTTG
GAATCCGACC AACCCGCCAG AGAAACCATG CAACCGGAAA TCTACCAAAA GATCAAAGAA
GTTATCACCA AAATTAGATC GTTTATCAAT GCTGAACCCA ACACCTTACC ATTGACCTTC
ACCATTGATG ATCCAGCAGG TAACTCGTGG ATTGAATACC TTCCAGGAGA GCCCAGTCAT
AAGTGGGCCA TGTACGAATA CTCAAGAACT GCTGAGCAAA ATGTGTTTTT GGGCTTGATA
TCGGCTGACG ATGTGGCCAG ACACCAACAA GAAGAATTGG CCAATAAGAA GAATGCCACT
TCTAAGAACA TCTCTTCTTC TTTGAATAAG AGCTCTGAAA AGGATGACGA ACATAACCCA
AGAGCTACGG GCTTTATCTC TGACGAAACT GAAATTGAAA ATTTTGAAAA CGAAGTGCAA
ACCTTCCAGG CTACTTGTTC TTCTTGTTTC CAGCCATGCT CAACACACAT GAAGACTGTT
AACATTCCTC ACTTCAAGGA TGTCATCTTG ATGTCGACTG TATGTGATCA TTGTGGCTAT
AAGTCTAACG AAGTCAAAAC AGGTGGAGAG ATTCCTCCTC GCGGAAAGAA GATCACCTTG
AAAATCACCG ATCCTGAAGA CTTGGCTAGA GATATATTGA AGAGTGAAAC TTGTGGATTG
TCAATTCCAG AATTGAACTT GGATTTGACT CCAGGAACAT TAGGAGGTAG ATTCACTACT
ATTGAAGGAT TGTTGACTCA AGTTGCAGAA GAGTTGAACT CCAGAGTGTT CAGTCAATCA
TCAGACTCTA TGGATGAAGC TACCAAGTCC AGATGGACAA GTTTCTTCGC TAGATTGCAA
GATGCTATCG ACGGCAAAAT ACCATTCACT ATTATTGTCG AAGACCCATT GGCTTCTTCG
TACATCCAAA ACGTCTACGC TCCAGACAAT GACCCCAACA TGACTATCGA AGAGTTTGAA
AGATCCTTTC AACAAAACGA AGACTTAGGT TTGAATGATA TGAAAACTGA CTAA
 
Protein sequence
MAADEEQKPQ DLFTSVGEQA QEVDDQETNV AGSNEVRQTG AEDAEGHPVQ EIESLCMNCH 
KNGITRMLLT KIPYFREIIL MSFECTHCGF KNSEIQPAAQ IAEKGSRYVL KLETKEDFNR
QVVKSETASV RFSELDIEIP PKRGQLSNIE GLLEEMIEDL ESDQPARETM QPEIYQKIKE
VITKIRSFIN AEPNTLPLTF TIDDPAGNSW IEYLPGEPSH KWAMYEYSRT AEQNVFLGLI
SADDVARHQQ EELANKKNAT SKNISSSLNK SSEKDDEHNP RATGFISDET EIENFENEVQ
TFQATCSSCF QPCSTHMKTV NIPHFKDVIL MSTVCDHCGY KSNEVKTGGE IPPRGKKITL
KITDPEDLAR DILKSETCGL SIPELNLDLT PGTLGGRFTT IEGLLTQVAE ELNSRVFSQS
SDSMDEATKS RWTSFFARLQ DAIDGKIPFT IIVEDPLASS YIQNVYAPDN DPNMTIEEFE
RSFQQNEDLG LNDMKTD