Gene PICST_41956 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_41956 
Symbol 
ID4836776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2662056 
End bp2663087 
Gene Length1032 bp 
Protein Length343 aa 
Translation table12 
GC content44% 
IMG OID640388091 
Productpredicted protein 
Protein accessionXP_001383263 
Protein GI150864446 
COG category[I] Lipid transport and metabolism 
COG ID[COG0020] Undecaprenyl pyrophosphate synthase 
TIGRFAM ID[TIGR00055] undecaprenyl diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATT GGTTATCCAC ATTTCCTGGC TACCGACAGG CGCTAACTAC GGCTAAGCGG 
GCATTTGGGA GATTCATCCA GACAGGGCCC ACGCCGAAAC ACGTAGGAAT CATCATGGAT
GGTAACAGAA GATATGCCAA AAATCACAAG ATAGAGATCA AAGAGGGGCA CAATCTTGGA
TTCGACAGCA TGGCCAATGT GCTCGAGATT TTGTATGAAT CTGGTGTCAA GTGTGCTAGT
GTCTACGCAT TTTCCATTGA AAACTTCCGT AGACTGAGCT TAGAAGTCAA GTGGTTGATG
GACTTGGCCA AGCTGAAGTT CCAACAGATC AACCAGCATA GCGACTTGTG CGCTGAATAC
GGCATCCGCA TCAAGATAAT AGGTAATAAG AAGTTGATAC CACCAGACGT TGCTAAAATT
CTACAACAAA CAGAGGAGAT TACTAAGGAC AACAAGAGAG CATTGTTGAA TATCTGTTTC
CCATATACCT CTAGAGACGA GATGACAAAT CTGATCAAAT GTGCCGTAGA CCAGTCCACA
ATAGATCACG ACTTTGTAAT AGATGAGGAC ACTCTCGAAA GCTTATTCTA CACCCATGAT
GCGCCTCCTT TAGACTTATT GGTGAGAACA TCTGGTACCT TCAGATTATC TGATTTCTTA
TTGTGGCAGT GCGTTTCGCC AGACTGTTCA ATCGTATTTG TAGATAAATT GTGGCCCGCT
TTCACTCCCT TCGACATGGC CAAGATTTTG TTCAACTGGG GATTCAACAT GTACTGGTAT
GGCAAAGGCA ATGGCTACAG CACAACCCAA ATCTCTACCA AGAACTTTAA TCTAGCCGAG
TACGACTCTA ATGTAGACTT GAATGATGCT ACTGGATCTA GTGGATTTCA ACGTTTCGCC
AGCTCAGAAA GCGAAGAGGC CGAAGATGAA GATGATGTCG TCACTGAAGA AAGTAGTCAA
TCTGGTGGAG TCGATGAACT CGACACTGTT ACTTCAGAAG AGGAATCCGA CTCGAACAAA
AAGGGAAGGT AG
 
Protein sequence
MSDWLSTFPG YRQALTTAKR AFGRFIQTGP TPKHVGIIMD GNRRYAKNHK IEIKEGHNLG 
FDSMANVLEI LYESGVKCAS VYAFSIENFR RSSLEVKWLM DLAKSKFQQI NQHSDLCAEY
GIRIKIIGNK KLIPPDVAKI LQQTEEITKD NKRALLNICF PYTSRDEMTN SIKCAVDQST
IDHDFVIDED TLESLFYTHD APPLDLLVRT SGTFRLSDFL LWQCVSPDCS IVFVDKLWPA
FTPFDMAKIL FNWGFNMYWY GKGNGYSTTQ ISTKNFNLAE YDSNVDLNDA TGSSGFQRFA
SSESEEAEDE DDVVTEESSQ SGGVDELDTV TSEEESDSNK KGR