Gene PICST_33767 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33767 
Symbol 
ID4840933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp435132 
End bp436481 
Gene Length1350 bp 
Protein Length449 aa 
Translation table12 
GC content37% 
IMG OID640392248 
Productpredicted protein 
Protein accessionXP_001386478 
Protein GI150866771 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.539804 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.198679 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTCAA TAGCCTTGTC TATAATCGAG AAGCTAACGA CAAGCTACTT AAGAGGATTA 
GAGACAAAGG ATCAAGATTC GTCATTAAAA AGAAACAAAG AAGATGTCCA TCTAGCCAAA
TATCTTGCAA AGGATTCCGT CATATTCGAC CAACTGCCAT CTGAATTACT TTACTTCAGC
TTAATAAAAG AAAAAGAAAA ATCGGAAATA CTCTTCGAAG ATAAGAGTGC CTTGAAACAG
GTGAATGTTC CATTGGAACA GTCGAAGAAA GTCGCAAATA AACAAAAACA AGGAAAGAAA
AAGAAGAATA GTAAAAATCA TGAAAAGGAT GTAAGCCAGT TGATTGGAGG AAGTTTGAGT
TATGGGTACG AACAATTTAG CCCTCCCAAG CAAGAGGAAA TATATCTGTT GGAAAATATG
CTTCGAGCTA TTGAATATCT CGTGAGAGAA AAAGGCAAAC CATTGGAAGA TTTCAAAATG
CTTTCCCTAA GAAGAAATTT ACATTTATTA ATGACTATTC CCATTTCAAA GCAAACAACC
AGCTTTAATC TTATCTACTG GAACGGTCTC ATATTCCTTT CATATGATTG GAAATCATCT
GAGAAATCAA AAGAAACCAG AAAAGAAAAT GCCAGCGATA AGAGAGCCGA TAACTTAAGA
CTTCTTCAGT ATACTGGATT TGCATTTGAG AGGCTCATCA CATCATCACC GGCACTGGTG
AGTTTCGATG ATAATAGTAT CACTTCTTTC TACAGCTTGG TCAGCCACAA AGTGGGTGAA
ATTCCTATCC ATTTTACAGC AGAGATAGAT GCATGCAAGG ATATTACAAA GGACGGACTC
TCCAACTACA TTGAATTGAA GGCTAGAGCA ATCCCTTCCG GTCCCAAAGG GAGAGCCAAT
AGCTCCTTTC AAAGGAAGCT ACTTTCAGCT TATTGTCAAA ACAAGCTAAT TGGAAGCCAG
AATCTTGTAA TAGGATTCCG TTCTCCAGAA TTGAAGGTAT CGTCTATTAA AAGATATGAG
ACTAATGAGC TTAACGGCAT TATCAATAAA GAACCAGTCT ATTTTACAGA AAATTCGACT
CTAAACTGCT CCAAAATGGT GAAATGGTAC AAACTTGTGA TATCATGGAT TACGGAACAT
AACCAAATAA CTACTGTAGA CCACGATGCC TCAGTTCCAT TGGCATATAG ACTAGAATTC
ACTTTGAAAG ACAAAGTGAT GGAATCGTGC TTGGAAATTA ACCCTGTTGA AGAGCAAGAC
GTCCAGAGTT TGATCAAAGA ATTGATTCCA CCATGGTTTC AGAAGTTTAT GAATGAGAAT
AAGAATAAGA AACGAAATGA CTACAGATAG
 
Protein sequence
MNSIALSIIE KLTTSYLRGL ETKDQDSSLK RNKEDVHLAK YLAKDSVIFD QSPSELLYFS 
LIKEKEKSEI LFEDKSALKQ VNVPLEQSKK VANKQKQGKK KKNSKNHEKD VSQLIGGSLS
YGYEQFSPPK QEEIYSLENM LRAIEYLVRE KGKPLEDFKM LSLRRNLHLL MTIPISKQTT
SFNLIYWNGL IFLSYDWKSS EKSKETRKEN ASDKRADNLR LLQYTGFAFE RLITSSPASV
SFDDNSITSF YSLVSHKVGE IPIHFTAEID ACKDITKDGL SNYIELKARA IPSGPKGRAN
SSFQRKLLSA YCQNKLIGSQ NLVIGFRSPE LKVSSIKRYE TNELNGIINK EPVYFTENST
LNCSKMVKWY KLVISWITEH NQITTVDHDA SVPLAYRLEF TLKDKVMESC LEINPVEEQD
VQSLIKELIP PWFQKFMNEN KNKKRNDYR