Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_65910 |
Symbol | RPN4 |
ID | 4839779 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 614320 |
End bp | 616001 |
Gene Length | 1682 bp |
Protein Length | 491 aa |
Translation table | 12 |
GC content | 47% |
IMG OID | 640391094 |
Product | zf-C2H2 Zinc finger, C2H2 type |
Protein accession | XP_001385813 |
Protein GI | 150866273 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTCCA TCGCCGTGTT TCCTCCCTTG AAGAGATCCA TCACAGACAT CATGGACGAG GAACTCTACC ACATCCCCAA CTCCCCCATC CAGTTCAGCT CGCGCAACGC CACCCAGCCC CAGACGCAGA CGCAGCAGAA CCAGAACCAG TCACAGCTCT TTTCGGGTTT CAAATCGATC AACTCGTCGC CGGCTTTGGC CCACAACACC TTGTTCATAC ATAACGCCGT CAACGGCTCC AGCTCGTCGT TGAATGCGTC CACCACTGAT AGTTTCTTGG ACCAGTACGT GTACTCGAAC AATCTCAACG CCACATCTTC CATCAACTCG TCGACAAACG TAAGCACCGC AAATGCCAAC AGCAATCACG GCAGTAACGT GAATACAAAC CCTGCTAACT TCGACGCCAT GGACGTTGAC GAAGCGTTCA ACACCCCCAG TCTATTTGTC AGCCCCTTTA CCGACTACGG CAATCCAGAC TCATCGCTTC TCACCAACAA CCAAATCAAC CCCTTCTCGT TCGTGAGGCC CCAGCAGTTA CAGAAACAAG CCCCTCCTCC TCTTCCACAG GCTTTCCCCG CTCGTCCCAC CAGAAGAAGA CATATCACAA CCTTAGACGA TGACTCATCG CTTCGCTCTT CTACCAAGAA GGAAGACGAC TACTTATTGT TTAACCCCGA CATCCAGCCC TCGCACCTCA TCAACAACAA GTCGTTCTTC AACGACGATT ACTTGTTTGT GCCCAACAGT CAGCAGTACG ACCTTGACAA CTCAAACAAC GGCACCACTG GCACAGTAGC CAACCTTGCA GCCAACGGGA TTATTCCAGG CTACGAAAAC GACTATTTGC TTGTGGATGA CTTTGACGAA GAGATCGAAG CCGATTTGTC AGATGACGAG GAAGATGACG ACAACTACTT TCATTTTGAC GACGACTTGG ACGACCTTGT GATGAACAGC AACGAATACC CTGAAGACAA CATTAACATC AACGTCAACT TGATGGACAT GGACGACTAC TTGAAGAACA CAAGCAGCAC CAATAACAGT AATAATATAA CCGATGTAGA TCCAGCCGAA ACCATCAGAT TGAACAAGAA TGACGTGATG AACGGTGGCT ACATTGACGT TAAGCAGGAC AAGATCGCAC AATCTCACGA AATCTCTTCG CATGAAATTG ACCACGAGGA TGACATTGAC GAACTTGAAC TTGAAGACGA AGAATTTGAC GACAGAAGAC ATTCGACGAA GTCGTTTGCC CATAAGTCGG CAGCCGAAAT ATCCGCCAAT AACCCCAACC ACCAGTGCGA CTTGATCAAT CCATTGACGG GAGTCCCTTG CAACAAGCAA TTCTCCAGAC CTTACGATTT GATTAGACAT CAAGAGACGA TCCATGCTTC GAAGAAGAAG ATCTTTAGGT GTGTCATTTG TGAAGGTAGA GCCAACGGGG GTCCAGGTAA CGGAAAACTG AAGACATTCT CCAGAGGCGA TGCTTTATCC AGACACATCA AAGTGAAACA TGGATTGGTA GGTAAAGAGG CGATTGACAT CATCAATGCC GCTAAAGAGA ATGTAGAGTA TGTGAGCGTA TAATGATTTT GGTTTGGTTT ATTTGGTTTT TTGGAGTTCT TGTACATAGT CTTCTACATC TTATATATAG CATAATATAA TCTTAATGAT AG
|
Protein sequence | MTSIAVFPPL KRSITDIMDE ELYHIPNSPI QFSSRNATQP QTQTQQNQNQ SQLFSGFKSI NSSPALAHNT LFIHNAVNGS SSSLNASTTD SFLDQYVYSN NLNATSSINS STNVSTANAN SNHGSNVNTN PANFDAMDVD EAFNTPSLFV SPFTDYGNPD SSLLTNNQIN PFSFAFPARP TRRRHITTLD DDSSLRSSTK KEDDYLLFNP DIQPSHLINN KSFFNDDYLF VPNIANLAAN GIIPGYENDY LLVDDFDEEI EADLSDDEED DDNYFHFDDD LDDLVMNSNE YPEDNININV NLMDMDDYLK NTSSTNNNPA ETIRLNKNDV MNGGYIDVKQ DKIAQSHEIS SHEIDHEDDI DELELEDEEF DDRRHSTKSF AHKSAAEISA NNPNHQCDLI NPLTGVPCNK QFSRPYDLIR HQETIHASKK KIFRCVICEG RANGGPGNGK SKTFSRGDAL SRHIKVKHGL VGKEAIDIIN AAKENVEYVS V
|
| |