Gene PICST_28789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28789 
Symbol 
ID4851539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2085588 
End bp2087054 
Gene Length1467 bp 
Protein Length488 aa 
Translation table 
GC content42% 
IMG OID640393247 
Productpredicted protein 
Protein accessionXP_001388027 
Protein GI126274790 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.78945 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.187778 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTACTTA GAAAGCTGGT CAGACACACT CTCAGTGACC CACCAGACTA CGAGTTTAAG 
TCACCCACGT ACTCTACGAA ATGCTCCAAG GATGCCGATG AAAGGAATTT CGGAAACTTC
GACTCTGTCA GACTGAATTT TGACACGTCG GAAGATTCTC ATTTTGTTTG GAAATACAAA
GCACTTACCA AAGTATTCAT GCACTTGCAG CTGAAGCTCA AGCGACATAT CAGACAGAGC
AACCCTGCCC AGTCCACTTC TATTCCAAGT TCAAGCAAGT TTGCTGAAAG ACTTAATATA
CAATCCATAG CTACTAAGCT GAGTTTTAGC TTGAATGGCT TTGCTAAGTT TTGCAAAGGA
GAAAGGTCAC CTGATGATTT TAACAACATA TTCACGAACG ACGAATCCAA TACTCTGCAT
GGACTCGGTT CTTTCGGCAA ATCCATCAAA CCCTACAAGC TTGAGGTGAT CCTACCAGAC
ACCACATTCT GGGACGATTT CAAGGATATT ATTGAAAAGT ATGGCATCAA AACTCCACTT
GAGGAAGCTA CCAATGCACT TATCTCCTGT GAAAATGACG TTCAAACTGC TCTTGACTGT
CAAGAGTCGA ATTTTCAAAT CTCAAAATGT TTTCATGCTA TTTATGAAAA TGATACTTTT
ACAACAAGGC TGACTGGCAA GGATCTTAGT GTCAACGAGA ACTCGGGAAC CACTCATACT
CGCAGCACAA CCGTGGTCGT AGAACCACAA AATTCTCCTT GTTCTGCCAC TCTGGAAGCT
GCTGAAGAGG TAGTTGAAGA AATTCCAAAT CCCCTCCAGA ATGAGCTACT ACTTCCAGAA
TCAGACCCAA CTACAGATCT GATTGAATTA GTTGCTCGAC TTGAAACTGA ATACATTTCT
CCACCGGAAG AAAATATCGT CCAGCAAGTT GCTCAATCAA AAGAATCTTC GTCCTCTGAA
AGTGAATCAA AAGCAAAGGA ACCAGAAGAA GAAGAAGAAG AAGAAGAAGA AGAAGATATT
TCACAGTCAC AAGAATCATC GCACGACGAT GGATCAAAGG AGGGTCTGGA TTTCAGTGGT
AAGAACCGCT GTCATGGAAC AGTCGTTGGC CGTGAACTTT CAATCAAATA CAACACAGGA
AGATTGTTTT GTGATCAAGA AGATTATGTA GATGCTTCAA CTGGAGTCGA ATTCCGCTTC
AACAGGTGGG GAGACCCAGA AGAGCTTGTT CCATATGGAT ATCAAAGAAC TTGTCATCAT
TTCAGAAGGA GGAGGTCACT TAAACTGGCT ATCCGGAACC TTGAACGCAG ACCACTGACT
GTTATCCAAT ATTCCAATAT GTTGGCAAAC GTTGAACTGC TTCACCGTGC AAATCTAATT
GCCTACAGAA GGGAAGCTAC TATCCTTCTT GATTGCGTTT CGAGTCTGGA AGAGGATAGA
CCTTTCACTT TTATAATGAA CCCATAG
 
Protein sequence
MLLRKLVRHT LSDPPDYEFK SPTYSTKCSK DADERNFGNF DSVRLNFDTS EDSHFVWKYK 
ALTKVFMHLQ LKLKRHIRQS NPAQSTSIPS SSKFAERLNI QSIATKLSFS LNGFAKFCKG
ERSPDDFNNI FTNDESNTLH GLGSFGKSIK PYKLEVILPD TTFWDDFKDI IEKYGIKTPL
EEATNALISC ENDVQTALDC QESNFQISKC FHAIYENDTF TTRLTGKDLS VNENSGTTHT
RSTTVVVEPQ NSPCSATLEA AEEVVEEIPN PLQNELLLPE SDPTTDLIEL VARLETEYIS
PPEENIVQQV AQSKESSSSE SESKAKEPEE EEEEEEEEDI SQSQESSHDD GSKEGLDFSG
KNRCHGTVVG RELSIKYNTG RLFCDQEDYV DASTGVEFRF NRWGDPEELV PYGYQRTCHH
FRRRRSLKLA IRNLERRPLT VIQYSNMLAN VELLHRANLI AYRREATILL DCVSSLEEDR
PFTFIMNP