Gene PICST_60945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_60945 
Symbol 
ID4839247 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1039445 
End bp1040695 
Gene Length1251 bp 
Protein Length399 aa 
Translation table12 
GC content46% 
IMG OID640390562 
Productpredicted protein 
Protein accessionXP_001385209 
Protein GI150865830 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0140768 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTCTC CCCACGCAAA CCCTAACGGC ATCGAAACCA AAATCATTGC AGCACGTAGC 
GAAGCCCAAG CGTTGTACAA CGAGTTGGAG AAAGTGAAAA ATAGGCTCCA CGACTCTACG
TTGGACGAGA TATCGTATTC CATCCCAGGC ATTCCCAAGA ACTTCAACAA TTTGAAGCTC
TACAATACGC TTCGGGGCCA TAATAACAAG ATCGCCAAAA CGCAATGGAG CTCCGACTCA
AGCAAGCTCC TTTCAGCTAG TCAGGACGGC TATATGATTC TCTGGGATGC CGTTACTGGT
TTCAAGAAAC AGGCTATCAA TCTTGAGAAC CAATGGGTTC TCACATGTAG CTACTCGTCA
GACGGAAAGC TAGCAGCGTC CGCAGGACTC GACAATGCCT GTACTATCTA CAAGGTAAAA
CAGGATGGCG ATTTCCGCTT TGGGGGCACC AGAGGTGAAG CCAGAAAAGG GTCTGCTACT
GGAAATGACC TTGACATTTT GCCAGTTCAG TCTGTGTTCA AGGGCCACAC AGCGTATGTA
TCAGACTGCG GGTTCATCAC CAACACCACT ATAATTACAG CCAGTGGTGA CATGACTTGT
TCGCTATGGG ATATAACCAA AGGAGTCAAG TCGCGAGATT TTGTAGAACA CTTGGGCGAC
GTTCTCTGTA TGAGTATCTT TCCCTCCAAT AAGCTCAATG ACAACCTCTT TGTTTCTGGT
TCTTCTGACG GTAGTGCAAA GATTTGGGAT TTACGAAGTC CTACGCCTGC TCTGAGTTTT
TTTGTCTCCA ATAGCGACAT CAACACTGTT CTGATCTTTC CTAATGGAAA CTCGTTTGCA
ACAGGTTCAG ATGATGGACT AATTCGACTC TTTGATATTA GAGCAGATTG CGAATTGAGC
AACTATTCTC TCTTATCTCA GTTCCAGAAA CAAAACCACA AGATCCCCAA AGCCAAGATC
CCTAGTAGGA GACACAGCAC CACTGACCAG GTGAGCACAG GGTCCATCAG CATCTACTCT
AGCATAGATA ATCCGGGAGT TTTTTCCCTT GATTTCAGCA ATAGTGGAAG ACTACTCTAC
GCATGCTACT CAGAATTTGG CTGCTTAGTA TGGGATGTCT TGAAGAATGA GATCGTAGGC
TCCGTAGGAA ACGATCATGT CAACAAGATC AACCATATAA GCGTATCTCC TGACGGAACG
GCCGTTGCCA CGTCGTCATG GGACTCCACG ATCAAAATCT GGTCCGTGTG A
 
Protein sequence
MTSPHANPNG IETKIIAARS EAQALYNELE KVKNRLHDST LDEISYSIPG IPKNFNNLKL 
YNTLRGHNNK IAKTQWSSDS SKLLSASQDG YMILWDAVTG FKKQAINLEN QWVLTCSYSS
DGKLAASAGL DNACTIYKVK QDGDFRFGGT RVQSVFKGHT AYVSDCGFIT NTTIITASGD
MTCSLWDITK GVKSRDFVEH LGDVLCMSIF PSNKLNDNLF VSGSSDGSAK IWDLRSPTPA
SSFFVSNSDI NTVSIFPNGN SFATGSDDGL IRLFDIRADC ELSNYSLLSQ FQKQNHKIPK
AKIPSRRHST TDQVSTGSIS IYSSIDNPGV FSLDFSNSGR LLYACYSEFG CLVWDVLKNE
IVGSVGNDHV NKINHISVSP DGTAVATSSW DSTIKIWSV