Gene PICST_56697 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_56697 
Symbol 
ID4838162 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1610751 
End bp1612061 
Gene Length1311 bp 
Protein Length427 aa 
Translation table12 
GC content47% 
IMG OID640389477 
Productpredicted protein 
Protein accessionXP_001383930 
Protein GI150864919 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000378422 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AATCACCAAA TTCACCAAAA CCATCAGAAC CAGCAACAAC AGCAGTATCT GCAGATACCA 
CTCGGAAAGT CCGAGTTTGA GCTCACGGAG TATGATCTCA AGTCGCGCGA CTCGAAGTAC
CGTAGATGGA CGCCTAAGAT GGACCAGTTT CTCATCAAGT TGCTTTCAGA TGTGGTGCAC
AGCTATCCCA AGGGAGCTGA GGCAGAGATG ACGAAAAAGG CATGGGCCTA TGTCACGGGC
CAGTTGCGTG CAGCCAACCC AGAAACAGTC TATTCCACTT ATACCAAATA CTCGTGCCAG
CAGCATTTGC TCAATGTGAA TCATCACCGA TATAAGATTT GGTACTACTT GATGCTTCAC
CAGAAAAACG CACCTGCTAC CAGTTACGCA TACCGGTGGA ATCCAGAATT GGGCCGGTTC
CAAGTTATCG ACAATGCTAA CAGTACGTTG ATTCTTGATG AAAGACAAGT CAAGTCGTTG
TTGTATAGCG ATTCGCTTCT GCTTCCACAT CTCCAGTCGT TTAACAAAGG CAACTTGATT
GTTAACGACT TCTTCTTGAG CGACAACTTG CGCTACATGT CAGTTTACCA TAATGAGGTT
TTGCCGTTGC TCATCAGGCT AGATCCCAAG TACGCTGAAG GGTTGGGAGA TCTTTACGCG
GACATCCCCA AGTTCGACTA CCAGGAAGCC AGTCTTGAGT ACTTCAAGCC TTTGGTTCCA
GCCAGAGCTC ACAAGATGGC ACCCGTAAAT CTGGCTGTTC AGGTTCAACA GGTGCAACAG
TCTGTTGTCA AGAAGAGAAC CCATTCGGAT ATACCTTCTG ATATTTCGCT TCCATTTTCC
AAGTCTCTCG GCTCATTGAC GGATGATACC GATCCAGACG TCAGTGGACA GCAACAACTG
GTCCCAGACG AAGACTCAGT AGATCCAGCT CTCAAAAGGT CTAGAAACTC ACTCCAAGAC
ACAACGTCCA ACACAATGGA TTTCGAGAAT GCTTTGGCAA CTGCAGCCAT CGCAGCCATC
AACTCTCCAC CTGTAACTAA TGGAAGAGAC TCACTTCCTT TCTACATCAA GGACCGGAAG
TGGTTCAACA GATTGCTCAA TCTCCACGAG TCGGGTCTCA TAGGTGTACA GGAAGTGCTT
ACCGTCTGTG AAGGTGTCAG AGACGGCAAG ATCCCCTTGT TCATGCTCAA TGTTCTAGAC
CAATCGTACT ACCCTACTCG AAACAATACC GGTTTGTCTG AAGAGTTGCC TGATGATGAG
ACTGCTAAAA GAATCAGAGA GTTCATGCTA CCAATGGTAT ATAATTCGTG A
 
Protein sequence
NHQIHQNHQN QQQQQYSQIP LGKSEFELTE YDLKSRDSKY RRWTPKMDQF LIKLLSDVVH 
SYPKGAEAEM TKKAWAYVTG QLRAANPETV YSTYTKYSCQ QHLLNVNHHR YKIWYYLMLH
QKNAPATSYA YRWNPELGRF QVIDNANSTL ILDERQVKSL LYSDSLSLPH LQSFNKGNLI
VNDFFLSDNL RYMSVYHNEV LPLLIRLDPK YAEGLGDLYA DIPKFDYQEA SLEYFKPLVP
ARAHKMAPVQ QSVVKKRTHS DIPSDISLPF SKSLGSLTDD TDPDVSGQQQ SVPDEDSVDP
ALKRSRNSLQ DTTSNTMDFE NALATAAIAA INSPPVTNGR DSLPFYIKDR KWFNRLLNLH
ESGLIGVQEV LTVCEGVRDG KIPLFMLNVL DQSYYPTRNN TGLSEELPDD ETAKRIREFM
LPMVYNS