Gene PICST_65910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_65910 
SymbolRPN4 
ID4839779 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp614320 
End bp616001 
Gene Length1682 bp 
Protein Length491 aa 
Translation table12 
GC content47% 
IMG OID640391094 
Productzf-C2H2 Zinc finger, C2H2 type 
Protein accessionXP_001385813 
Protein GI150866273 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCCA TCGCCGTGTT TCCTCCCTTG AAGAGATCCA TCACAGACAT CATGGACGAG 
GAACTCTACC ACATCCCCAA CTCCCCCATC CAGTTCAGCT CGCGCAACGC CACCCAGCCC
CAGACGCAGA CGCAGCAGAA CCAGAACCAG TCACAGCTCT TTTCGGGTTT CAAATCGATC
AACTCGTCGC CGGCTTTGGC CCACAACACC TTGTTCATAC ATAACGCCGT CAACGGCTCC
AGCTCGTCGT TGAATGCGTC CACCACTGAT AGTTTCTTGG ACCAGTACGT GTACTCGAAC
AATCTCAACG CCACATCTTC CATCAACTCG TCGACAAACG TAAGCACCGC AAATGCCAAC
AGCAATCACG GCAGTAACGT GAATACAAAC CCTGCTAACT TCGACGCCAT GGACGTTGAC
GAAGCGTTCA ACACCCCCAG TCTATTTGTC AGCCCCTTTA CCGACTACGG CAATCCAGAC
TCATCGCTTC TCACCAACAA CCAAATCAAC CCCTTCTCGT TCGTGAGGCC CCAGCAGTTA
CAGAAACAAG CCCCTCCTCC TCTTCCACAG GCTTTCCCCG CTCGTCCCAC CAGAAGAAGA
CATATCACAA CCTTAGACGA TGACTCATCG CTTCGCTCTT CTACCAAGAA GGAAGACGAC
TACTTATTGT TTAACCCCGA CATCCAGCCC TCGCACCTCA TCAACAACAA GTCGTTCTTC
AACGACGATT ACTTGTTTGT GCCCAACAGT CAGCAGTACG ACCTTGACAA CTCAAACAAC
GGCACCACTG GCACAGTAGC CAACCTTGCA GCCAACGGGA TTATTCCAGG CTACGAAAAC
GACTATTTGC TTGTGGATGA CTTTGACGAA GAGATCGAAG CCGATTTGTC AGATGACGAG
GAAGATGACG ACAACTACTT TCATTTTGAC GACGACTTGG ACGACCTTGT GATGAACAGC
AACGAATACC CTGAAGACAA CATTAACATC AACGTCAACT TGATGGACAT GGACGACTAC
TTGAAGAACA CAAGCAGCAC CAATAACAGT AATAATATAA CCGATGTAGA TCCAGCCGAA
ACCATCAGAT TGAACAAGAA TGACGTGATG AACGGTGGCT ACATTGACGT TAAGCAGGAC
AAGATCGCAC AATCTCACGA AATCTCTTCG CATGAAATTG ACCACGAGGA TGACATTGAC
GAACTTGAAC TTGAAGACGA AGAATTTGAC GACAGAAGAC ATTCGACGAA GTCGTTTGCC
CATAAGTCGG CAGCCGAAAT ATCCGCCAAT AACCCCAACC ACCAGTGCGA CTTGATCAAT
CCATTGACGG GAGTCCCTTG CAACAAGCAA TTCTCCAGAC CTTACGATTT GATTAGACAT
CAAGAGACGA TCCATGCTTC GAAGAAGAAG ATCTTTAGGT GTGTCATTTG TGAAGGTAGA
GCCAACGGGG GTCCAGGTAA CGGAAAACTG AAGACATTCT CCAGAGGCGA TGCTTTATCC
AGACACATCA AAGTGAAACA TGGATTGGTA GGTAAAGAGG CGATTGACAT CATCAATGCC
GCTAAAGAGA ATGTAGAGTA TGTGAGCGTA TAATGATTTT GGTTTGGTTT ATTTGGTTTT
TTGGAGTTCT TGTACATAGT CTTCTACATC TTATATATAG CATAATATAA TCTTAATGAT
AG
 
Protein sequence
MTSIAVFPPL KRSITDIMDE ELYHIPNSPI QFSSRNATQP QTQTQQNQNQ SQLFSGFKSI 
NSSPALAHNT LFIHNAVNGS SSSLNASTTD SFLDQYVYSN NLNATSSINS STNVSTANAN
SNHGSNVNTN PANFDAMDVD EAFNTPSLFV SPFTDYGNPD SSLLTNNQIN PFSFAFPARP
TRRRHITTLD DDSSLRSSTK KEDDYLLFNP DIQPSHLINN KSFFNDDYLF VPNIANLAAN
GIIPGYENDY LLVDDFDEEI EADLSDDEED DDNYFHFDDD LDDLVMNSNE YPEDNININV
NLMDMDDYLK NTSSTNNNPA ETIRLNKNDV MNGGYIDVKQ DKIAQSHEIS SHEIDHEDDI
DELELEDEEF DDRRHSTKSF AHKSAAEISA NNPNHQCDLI NPLTGVPCNK QFSRPYDLIR
HQETIHASKK KIFRCVICEG RANGGPGNGK SKTFSRGDAL SRHIKVKHGL VGKEAIDIIN
AAKENVEYVS V