Gene PICST_79520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_79520 
SymbolPNU1 
ID4840548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp920602 
End bp921668 
Gene Length1067 bp 
Protein Length325 aa 
Translation table12 
GC content47% 
IMG OID640391863 
ProductMitochondrial nuclease 
Protein accessionXP_001386365 
Protein GI126139685 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1864] DNA/RNA endonuclease G, NUC1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.553306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAAGATGAGT AAAGTTCTTG TTAATACCTT GGGATTAGGA ACTGTGGGCG TGGCCTCGTT 
TTTCTGGGGC CGTTCTTCAA CTCCAGCTGA TGTTGCTACT GAAAGCAAAA CAGATTCCAA
GAATCTTCCA GCAATAGTAA ATGGAGGCAA CGGGGCTCCA GACAAGGCAT TATTCAATCC
TGAACTTGTG AAGCCAAGTC AGTTCTTCAA ATACGGCTTT CCTGGCCCTA TTCACGATTT
ACAAAACAGA AGTGAATTTG TCAGTTGCTA CAATAGACAG ACTAGAAACC CATACTGGGT
CGTAGAGCAT ATTACCAAGG AGTCAGTGCA AAGGGGCAGC GGAGTAGACC GGAAGAATTC
CGTCTTCAAG GAAGATGAAG CTATTCCGGC CAAGTTCAGA AGCAGATTGA GAGACTTCTT
CAGAAGTGGC TACGATAGGG GACACCAGGC TCCAGCAGCT GACGCTAAGT TCAGCCAAGT
TGCTATGGAT GAAACGTTCT ACTTGACCAA CATGTCTCCA CAGGTGGGCG ATGGATTCAA
CAGAGACTAC TGGGCACACT TCGAAGACTT TGCACGTAGA TTGACTAACA GATACGACAA
TGTGCGTATA ATGACGGGGC CATTATTCTT GCCAAAGAGA TGTGACGACG GAAAGTACAG
AGTCACCTAC GAGGTTATTG GGTCTCCGCC AAATGTTGCC GTACCGACCC ACTTCTTCAA
GTTGATTGTG GGGGAGAACA ACGGTGACGA CCGGATCAGT GTCGGAGCCT TTGTGTTACC
CAACGAGCGC ATCGATAACA CCGACGACTT GACCAAATAC CAGGTGCCCG TGGAAGCCTT
GGAGAGATCT ACAGGACTAG AGTTGTTGCA GAAGGTTCCT TTCAGCAAGA AGAAGGACTT
GTGTCGCGAG GTCAAGTGCG AGATACTAGT GCGAGAGTTC CCAAAGCAAG CCAAGAATGT
GTTGGCCTTG CCCGGTAAAT AGATTGCTAC GACACATGCT AGAAACACAT GCGACATGGC
TAATACATAC TAAATAGCTA CACATACACA TACTTCGTAA GAGCAAA
 
Protein sequence
MSKVLVNTLG LGTVGVASFF WGRSSTPADV ATESKTDSKN LPAIVNGGNG APDKALFNPE 
LVKPSQFFKY GFPGPIHDLQ NRSEFVSCYN RQTRNPYWVV EHITKESVQR GSGVDRKNSV
FKEDEAIPAK FRSRLRDFFR SGYDRGHQAP AADAKFSQVA MDETFYLTNM SPQVGDGFNR
DYWAHFEDFA RRLTNRYDNV RIMTGPLFLP KRCDDGKYRV TYEVIGSPPN VAVPTHFFKL
IVGENNGDDR ISVGAFVLPN ERIDNTDDLT KYQVPVEALE RSTGLELLQK VPFSKKKDLC
REVKCEILVR EFPKQAKNVL ALPGK