Gene PICST_85400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_85400 
SymbolPAP1 
ID4840620 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp703131 
End bp704882 
Gene Length1752 bp 
Protein Length556 aa 
Translation table12 
GC content42% 
IMG OID640391935 
ProductPoly(A) polymerase PAPalpha (Polynucleotide adenylyltransferase alpha) 
Protein accessionXP_001386148 
Protein GI126139251 
COG category[A] RNA processing and modification 
COG ID[COG5186] Poly(A) polymerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTCA GAACGTATGG TGTCACTGAT CCGATATCTA CAGCCAACCC CACGCCCAAG 
GAAAACAGTC TCAATGATGC ATTGATTGGT GAACTCCGCA GTCGTGGATC ATTTGAGAGT
GAGCAGGCAA CGAAGAAGAG AGTGGAAGTC CTCACTCAAT TCCAGAAAAT GGTACAAGAG
TTTGTGTTCA CCGTTTCCAA GAAGAAAAAT ATGTCCGATG GGATGGCCAA GGATGCTGGA
GGTAAAGTGT TTACTTTTGG GTCTTATAGA TTGGGTGTTT ACGGCCCAGG CTCGGATCTT
GATACCTTGG TTGTCGTTCC CCAGCATGTA TCGAGGGAAG ACTTTTTCAC TGTATTCGCC
GAAATTATTC GTAAACGTCC AGAATTAGAA GAAATTGTAC CTGTTCCAGA CGCTTTTGTG
CCAATTATTA AGATTGAGTA CGATGGCGTC CCGCTTGATT TGATTTTTGC ATGCTTAAAT
GTTCCTAGAA TTCCTCTTGA TATGACTTTG GACGACAAAA ATCTTTTGAG AAATCTTGAC
GAGAGAGATT TGCGGTCCCT CAACGGAACA AGAGTCACGG ATGAAATATT GCAGTTGGTA
CCAAAACCGA CGGTATTCAA GCATGCTTTG AGATGCATCA AGATGTGGGC TCAACAGAAA
GCTGTGTACG GGAATGTATT CGGGTTTCCG GGAGGGGTGG CATGGGCAAT GTTGGTTGCT
CGTATATGTC AATTATATCC TAATGCTGTA AGTGCTGTCA TTGTTGAGAA ATTTTTCAAT
ATCTATACAA AATGGAATTG GCCCCAGCCT GTCCTCCTTA AGCCAATCGA AGACGGACCA
TTGCAAGTTC GTGTATGGAA CCCAAGGCTT TATGCCAATG ATCGTCTTCA CCGAATGCCA
GTTATCACTC CAGCTTATCC TTCGATGTGT GCCACTCATA ATATTACCAG CTCGACACAG
AAGGTGATTT TGTCAGAGTT AGAGAAAGGT GCAGAAGTTG TACGTCAAAT TAATTTGGGC
AAAAAGTCCT GGTCTGATCT CTTTGATAGA CACGATTTCT TTCATCATTA CAAATTCTAT
CTATGTGTAG TTGCAGCAAC AGTGGCCTCG GACGAAGAAC ATCTAAAGTG GACGGGTTTG
ATTGAGAGTA AGTTAAGGCA TTTGGTCCTC AAACTTGAGG TCACAGAAGG TGTAGAGTTG
GCACATCCAT ACGTAAAAGA CTTTTCCGAC TCGTACGTAC TTGGAGATGC AAACCCCTCT
GATGTCATTG GGGCGTATGG AACCTTGTCT GGTCAACCAT TTTTAGATAC ACTTGCCAAA
AGCGATGATA GCAATAAAGG TGAGAAGCAG ATCCACCTCA CCAAACTCTT TGTTGGATTG
AATCTTCAAC AGCTTAAGTC GGTGGACGGA GTAAAGAAAT TAAATATCCA ATATCCATGT
TCAGAGTTCT ACAATATTTG CAAGGGATCT GGATTTTATG ACGAAGCTAT AAACCATATC
AATATTAAGA ATGTGAAACT TGTTGACCTT CCTGATGACG TTTACGTTGA AGGGGAAGTG
AGACCTACTA AAAACAGCAA AAAGAGGAAA AAGGAAGTTA GCCTTGACAA GCAGAAGCGC
CCAAAAAGTA CAATTTCAAC TCCCCCAGTA TCTGTTAACG GGTCTGGTTG ATTCTTTTCC
TGTAGGAGGC TACGTATTAT GAGGTTACGG AAATCATTTA ATCTATTTCA GTAAATATAC
ATCCTTGACA GT
 
Protein sequence
MSVRTYGVTD PISTANPTPK ENSLNDALIG ELRSRGSFES EQATKKRVEV LTQFQKMVQE 
FVFTVSKKKN MSDGMAKDAG GKVFTFGSYR LGVYGPGSDL DTLVVVPQHV SREDFFTVFA
EIIRKRPELE EIVPVPDAFV PIIKIEYDGV PLDLIFACLN VPRIPLDMTL DDKNLLRNLD
ERDLRSLNGT RVTDEILQLV PKPTVFKHAL RCIKMWAQQK AVYGNVFGFP GGVAWAMLVA
RICQLYPNAV SAVIVEKFFN IYTKWNWPQP VLLKPIEDGP LQVRVWNPRL YANDRLHRMP
VITPAYPSMC ATHNITSSTQ KVILSELEKG AEVVRQINLG KKSWSDLFDR HDFFHHYKFY
LCVVAATVAS DEEHLKWTGL IESKLRHLVL KLEVTEGVEL AHPYVKDFSD SYVLGDANPS
DVIGAYGTLS GQPFLDTLAK SDDSNKGEKQ IHLTKLFVGL NLQQLKSVDG VKKLNIQYPC
SEFYNICKGS GFYDEAINHI NIKNVKLVDL PDDVYVEGEV RPTKNSKKRK KEVSLDKQKR
PKSTISTPPV SVNGSG