Gene PICST_30814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30814 
Symbol 
ID4837787 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1010677 
End bp1012492 
Gene Length1816 bp 
Protein Length503 aa 
Translation table12 
GC content41% 
IMG OID640389102 
Productpredicted protein 
Protein accessionXP_001383479 
Protein GI150864596 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.018903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.204154 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGAGTT ATGCCCCTGG CGGACACTCG TTGAAAGACA AATACAAAGA AGGCGGAAAG 
GCCAAGGAGT TGTTTTCCAA CCACGATTCG TTCTTATTCT CCAAAGAGTA CTTGGAAAAT
GTGTTGGACG TCTCAAACGA CATGAAAGAC AGATTGCTGA ATTCGCACAA GAACTATGTC
GACGTACAGA TGAAGAAATT GATAGAGGTA TACGGCTTGG ACACATTTGG CAACATTTTG
AAAACAGACA CCGAGTGGGA CGACTACCAC GGCAGTTCTG GCTATGTATT GGTAGGAGGT
GGAAAGTACA GTTTCTTGTC GTATCTTGTA ATCAGACAAA TCCGAGCTAC TGGAGCTAAG
AAACCAATCG AGTTGTTCAT TCCAAGTAAG TTAGAATACG AAAAGGCTTT CTGTGAGACG
ATTTTGCCTA AGTATAATGC CAGATGTAAC GTATTCGATA CGAAGTTGGC TGGTAACCTC
AAGAAATCTT TCAATCTTGG AGGTTACCAG TACAAGATGT TGGCTCTTAT GAGCTCCTCC
TTCGAAAGGG TAATGTATAT TGACTCTGAC AACTTCCCTA CTAGAAACAT GGATTATTTA
TTCGACTCGG AGTTGTTCAA CGAGAAGGGG TTGATCTTGT GGCCCGATGC TTGGGCAAGA
ACGACTAATC CTGTCTTCTA CGAAATAGCC GGAATCAAAG TCAAGGAGAA CAAGCTCAGA
TACTCCACCT ACGATAAGAA ACAAGCCGAA AAGGAAGGAA AGCCATTAAA GCCATTGTCT
GAATTTAGTT TCAAAGACTC ATGGTTCCAC GATTTCGAGG GAGCACTTCC GGATCCAACT
TCGGAAACAG GCATGCTTTT GATTAATAGA ACGTCCCATC TCAAGACATT ACTCTTGGCC
TTATACTACA ATGTTTATGG ACCATTCTAC TACTATCCCT TGTTGACACA GGGTTCTGCA
GGAGAAGGTG ATAAAGAGAC GTTTATTGCG GCAGCCACTG CCATGCAGCA GACATATTTC
CAAACATTGA AACAATTCAA ATGGACTGGC TATGTTTCAC AAAATGATAA CAAATTCACT
TCGAAGGCTT TGGCGCACTA TGATCCCATT CAATCACAGG ATACGTCGAA AGATGACATT
GATATCGTCT TCATGCATTT GTCGTATCCT AAGTACTATC CTAACTGGCT TGTGGATAAC
CATGACTTGG TCTATCGTGA AAGTGGCGAC CATATTAGAA TGTACGAGTC GATCTATGAG
AACGTTGGCT ACGATTTTGA TTTGCGCGTG TTACAATTCT TCACTCAGGC TATTTGTCCC
AACTACTACG ATTCTCAAAC ATCGAAGGCC GTGGATGGAG AAGATATTGA TATGATGGAA
GAATACATGG GTGACTACCT AGCTTATGTA GACGATGACG AAGAGCACAA CATCAACAGA
TGTAAGGATG TTTTCATTCC TCACTTGCAA TGGTTGAAGG AAACCACCAA GTTCAAAGAA
GGGCTGGTGA TAGTATAGTT ACAATAAAGA GACCTACTCA TATATAATGT CATAAATTAA
TCGCTCCCTT ACAATTGACT TTTGCATCGC TTTTTAATTA TAATGAATGT AGTTCCACAG
CAGTTAATGT AGAATAGTGA ACAGGAAAAG ATTCAGTGGA ATTTCAGAAC TCTTCCCAGG
GTACCAAACA AGATCTACCT AAATGATTTT GCCATCACAT TATTCTCACT CCATTAAACG
GCATCGGAAA GTAGTGATGC ACATAAAAGC AACCTGAAAT GGTGTCGAAT GTAGCTTCCT
TTCATCACTG TCGTAA
 
Protein sequence
MESYAPGGHS LKDKYKEGGK AKELFSNHDS FLFSKEYLEN VLDVSNDMKD RLSNSHKNYV 
DVQMKKLIEV YGLDTFGNIL KTDTEWDDYH GSSGYVLVGG GKYSFLSYLV IRQIRATGAK
KPIELFIPSK LEYEKAFCET ILPKYNARCN VFDTKLAGNL KKSFNLGGYQ YKMLALMSSS
FERVMYIDSD NFPTRNMDYL FDSELFNEKG LILWPDAWAR TTNPVFYEIA GIKVKENKLR
YSTYDKKQAE KEGKPLKPLS EFSFKDSWFH DFEGALPDPT SETGMLLINR TSHLKTLLLA
LYYNVYGPFY YYPLLTQGSA GEGDKETFIA AATAMQQTYF QTLKQFKWTG YVSQNDNKFT
SKALAHYDPI QSQDTSKDDI DIVFMHLSYP KYYPNWLVDN HDLVYRESGD HIRMYESIYE
NVGYDFDLRV LQFFTQAICP NYYDSQTSKA VDGEDIDMME EYMGDYLAYV DDDEEHNINR
CKDVFIPHLQ WLKETTNFLS SSS