Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_87504 |
Symbol | RPN4 |
ID | 4837664 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 72301 |
End bp | 73687 |
Gene Length | 1387 bp |
Protein Length | 420 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640388979 |
Product | hypothetical zinc finger protein |
Protein accession | XP_001382244 |
Protein GI | 126131438 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.482923 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.021979 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTCGC AATCCAGCGT CACTGCACCG GATACCGTGT ATTCCTTGGA AGAACAAGTC GATATCCTTA ATCTCGTAAA TCCTTCATCT ATTGAGGAAT ACTCGGAGCA TCTCTACAAG ATAGCTAAGA TCTTGTACGA GGACAACCTT GACCAGAAGC TTCCCATATT GTTCAAGAAG TTACAGGACA AGTACTATTC CTTGATGGAC CAACTTGACA ACTTTGTAGA CTCGGAAGAC CCCATCCACT CCAAAGTGTA TGTAATCGTA GAGAGAAATT TCGACCTTTT CATAAAGATC TCAACAAACC ACAAGAACGT CGAGCTTTCG ACAAGGACCA TTCGCTTTTT AACCAACATA ATTATGAGCT TGAACTACTG GGAAGTTTAC AATCTCTTGT CCTGGAAGCC AGTTATATAT CATTTCTTGT CTGTTATTCA ATTTGACATG AACGACTGCT ACAACAAATT CATCAGTGAC TATGCCAAAT ACAACTACAA AAGATTGACC CAGCCACAGC CAATGTCGCA CAAGTCTAGA AACCGAAGAA GACTTAAGAG AAGAAGAAAG TACAGTGGCA AGCTTGGATC TCCAGGGTCA GAAAATGGAG TTAATGACGA TGGATCTCTT TCTCCAAACC CATACTACTA CGTAAGCTCC GACACAACTG GCCGTGGCAG TTTGAGAGGT ATGAGTAAAG AAAGAAGAAA CAGAATGATG AGAGCTGAAA ACAAGCGTAA TTCCCATCGG GCTATTAAGA AACCATCCAA TACCAAGCCT TCCAACAAGT CGTCAAATTA TGATCCTGAT GTTGTTCATG AATGTCAATT GCCATCGCCA GATGAACCTC ATAAGCTTTG CTTACGTCGT TTCTCAAGAA AATATGAATT AATAAGACAT CAGGAAACTG TGCATTCTAA GAAGAAGAAA CTTTTCAAAT GCTTTGTTTG TGTAAAACAA CACCCTGGAG TTGGCCCAAG AATATTTACC CGCCACGACA CCTTGGCCAA ACATATTCGT GTTAACCACA AGATTTCTGG TAAGGAAGCC AAGGCAGAAG TTGCATATTC GAAGAAACAT GCCGAGGTTG TTGAAGAAGG TGATATTACA GTCCATGTCG GACGCAGAAA GACCAAGGTT GACTTTGAAT TGCGCGCTCA TATGGAGAAA CGAAAGGCAG AAAAGGATGA AATGGATGAT TCCGGTTATT TGGTTCATAG CGATTTCGAC TCTGGAGACG AAGAAGTTAC ATTCAATACG TAGTTTGCAT TCGGCCATCG AACATTCAAT TCTTTATTGA TTTTATAGTA TCATCAAACG CATATATACA TGGAAGTATG ATTCAATGTT TTAATAGTTT ACATTTATAA TACAAATAGG ACTACCA
|
Protein sequence | MSSQSSVTAP DTVYSLEEQV DILNLVNPSS IEEYSEHLYK IAKILYEDNL DQKLPILFKK LQDKYYSLMD QLDNFVDSED PIHSKVYVIV ERNFDLFIKI STNHKNVELS TRTIRFLTNI IMSLNYWEVY NLLSWKPVIY HFLSVIQFDM NDCYNKFISD YAKYNYKRLT QPQPMSHKSR NRRRLKRRRK YSGKLGSPGS ENGVNDDGSL SPNPYYYVSS DTTGRGSLRG MSKERRNRMM RAENKRNSHR AIKKPSNTKP SNKSSNYDPD VVHECQLPSP DEPHKLCLRR FSRKYELIRH QETVHSKKKK LFKCFVCVKQ HPGVGPRIFT RHDTLAKHIR VNHKISGKEA KAEVAYSKKH AEVVEEGDIT VHVGRRKTKV DFELRAHMEK RKAEKDEMDD SGYLVHSDFD SGDEEVTFNT
|
| |