Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_37460 |
Symbol | ZPR1 |
ID | 4851051 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 818805 |
End bp | 820298 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | |
GC content | 43% |
IMG OID | 640392759 |
Product | nucleolar zinc-finger protein |
Protein accession | XP_001387386 |
Protein GI | 126274037 |
COG category | [R] General function prediction only |
COG ID | [COG1779] C4-type Zn-finger protein |
TIGRFAM ID | [TIGR00310] ZPR1 zinc finger domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0530225 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.336313 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCCG ACGAGGAACA AAAACCGCAA GATCTCTTCA CTAGCGTGGG AGAACAGGCT CAAGAAGTAG ATGATCAGGA AACTAATGTT GCCGGAAGCA ACGAAGTCAG ACAGACCGGA GCGGAGGATG CTGAGGGACA TCCAGTTCAG GAAATCGAGT CGTTATGTAT GAACTGTCAC AAGAACGGAA TCACAAGAAT GCTTTTGACC AAGATTCCAT ACTTCAGAGA AATCATCTTG ATGTCATTTG AATGTACTCA CTGCGGATTC AAGAACAGCG AAATCCAGCC AGCTGCACAA ATAGCCGAAA AGGGATCAAG GTACGTGTTG AAACTTGAAA CTAAAGAAGA TTTCAACCGT CAAGTCGTGA AGTCTGAGAC TGCCTCTGTA AGATTCTCAG AGTTGGACAT TGAGATTCCA CCAAAGAGGG GCCAGTTATC AAACATCGAG GGTTTATTGG AAGAAATGAT CGAAGATTTG GAATCCGACC AACCCGCCAG AGAAACCATG CAACCGGAAA TCTACCAAAA GATCAAAGAA GTTATCACCA AAATTAGATC GTTTATCAAT GCTGAACCCA ACACCTTACC ATTGACCTTC ACCATTGATG ATCCAGCAGG TAACTCGTGG ATTGAATACC TTCCAGGAGA GCCCAGTCAT AAGTGGGCCA TGTACGAATA CTCAAGAACT GCTGAGCAAA ATGTGTTTTT GGGCTTGATA TCGGCTGACG ATGTGGCCAG ACACCAACAA GAAGAATTGG CCAATAAGAA GAATGCCACT TCTAAGAACA TCTCTTCTTC TTTGAATAAG AGCTCTGAAA AGGATGACGA ACATAACCCA AGAGCTACGG GCTTTATCTC TGACGAAACT GAAATTGAAA ATTTTGAAAA CGAAGTGCAA ACCTTCCAGG CTACTTGTTC TTCTTGTTTC CAGCCATGCT CAACACACAT GAAGACTGTT AACATTCCTC ACTTCAAGGA TGTCATCTTG ATGTCGACTG TATGTGATCA TTGTGGCTAT AAGTCTAACG AAGTCAAAAC AGGTGGAGAG ATTCCTCCTC GCGGAAAGAA GATCACCTTG AAAATCACCG ATCCTGAAGA CTTGGCTAGA GATATATTGA AGAGTGAAAC TTGTGGATTG TCAATTCCAG AATTGAACTT GGATTTGACT CCAGGAACAT TAGGAGGTAG ATTCACTACT ATTGAAGGAT TGTTGACTCA AGTTGCAGAA GAGTTGAACT CCAGAGTGTT CAGTCAATCA TCAGACTCTA TGGATGAAGC TACCAAGTCC AGATGGACAA GTTTCTTCGC TAGATTGCAA GATGCTATCG ACGGCAAAAT ACCATTCACT ATTATTGTCG AAGACCCATT GGCTTCTTCG TACATCCAAA ACGTCTACGC TCCAGACAAT GACCCCAACA TGACTATCGA AGAGTTTGAA AGATCCTTTC AACAAAACGA AGACTTAGGT TTGAATGATA TGAAAACTGA CTAA
|
Protein sequence | MAADEEQKPQ DLFTSVGEQA QEVDDQETNV AGSNEVRQTG AEDAEGHPVQ EIESLCMNCH KNGITRMLLT KIPYFREIIL MSFECTHCGF KNSEIQPAAQ IAEKGSRYVL KLETKEDFNR QVVKSETASV RFSELDIEIP PKRGQLSNIE GLLEEMIEDL ESDQPARETM QPEIYQKIKE VITKIRSFIN AEPNTLPLTF TIDDPAGNSW IEYLPGEPSH KWAMYEYSRT AEQNVFLGLI SADDVARHQQ EELANKKNAT SKNISSSLNK SSEKDDEHNP RATGFISDET EIENFENEVQ TFQATCSSCF QPCSTHMKTV NIPHFKDVIL MSTVCDHCGY KSNEVKTGGE IPPRGKKITL KITDPEDLAR DILKSETCGL SIPELNLDLT PGTLGGRFTT IEGLLTQVAE ELNSRVFSQS SDSMDEATKS RWTSFFARLQ DAIDGKIPFT IIVEDPLASS YIQNVYAPDN DPNMTIEEFE RSFQQNEDLG LNDMKTD
|
| |