Gene PICST_40540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_40540 
Symbol 
ID4836985 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1929186 
End bp1930451 
Gene Length1266 bp 
Protein Length421 aa 
Translation table12 
GC content37% 
IMG OID640388300 
Productpredicted protein 
Protein accessionXP_001382597 
Protein GI150863943 
COG category[R] General function prediction only 
COG ID[COG2520] Predicted methyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATACT TTCGCATAAA AACTGAAGAT GCCACTCTTG TCAAGACGTT GAAAAACAAG 
CTTGAATCGA CTTCCAACCT TAATAAATCT TTGAAGATTG CAAAGTCTAA TGGTGTTTTC
CAGATTCATA CAACGTTAAC AGAAATTGAT GAACTCGAGC CAATTGTTTC CCAATACGAA
TCCAAAGTAA TTTACGACAC ATATTTAGCA GAAACTGGGC TCAATACTCC AACTACGTTA
GTTTCAATAG TAGAACAATA CTGTTCTTCT CATACTATAC CTTTAACACA AGATTTGATC
CAACTCATAC CGAAGAAATG GTCAGTATAC CCACCAATGA TACTTTTCGG AAGCAATACT
TTCGATTCAC AAATATGGCA ATCTCTTTTA TTAAGCATAG ACTCCAATGA CTTCTTCTTG
TACATACTTT CACTGTCTTT GTTTGCGTCC AATCATAGCA AGTTGACCCA TATTGCCATT
AATAAACCTA TCATAGAATC TGATGTCATG AGGAGGCCCT TTAATATCCG ACCCATATTT
GGGGACTTTG GTCCATTACC CACGAATTAC GACAGTCCAG TAGATACTGA CTTTTCCAAT
GCTTTCTGGT GTACAGTTGT TCAAAATGGA ATTTATCAGA CCTGGGCACC TTATTATACA
ATGTTCTCCA GAGGAAACAT CAAAGAGAAA GCAAGAATTC TCGATACTTA TACAGACATA
AAGAATAATG TAGTGGTTGA CTTATACTGT GGAATAGGGT ATTTCTCGCT TTCATACTTA
AAGAGAGGTG CCAAACAGCT ATTTTGTTGG GACTTGAATC CGTGGTCCAT AGAAGGATTC
AGAAGAGCCA TTAATGGGAA GTACCTTTAT AAGATCTTTA GAAGAGAAGA TAACTTTGAC
TATCAGATAT ACAAAGAGGC GACCGAGGAT GGTACTCAGG TGTTTATATT CCAAGAAAGT
AACGAACATT GTCTTGAAAG ATTGACAACG TTCCCAAAGA ACAGTTTGCC TATCGCTCAC
ATAAATTTGG GTTTATTGCC CAGTTCACAA AACTCGTGGA GAATTACGCA GAAGTTGAAA
GACGATCATT CGAGTCAATA CATTTCAACT TTTGTTCACA TTCACGAGAA TGTTCACGTA
GCTGAATTCG ACACTTTCAA GTCTCTAGTA TCAGACAGAT TTCCAACCGC CTCAATCGGG
CATTTAGAAA CAGTGAAAAC ATTTGCTCCT GATGTTTGGC ATATTGTCTT GGATATTAAA
TTATGA
 
Protein sequence
MEYFRIKTED ATLVKTLKNK LESTSNLNKS LKIAKSNGVF QIHTTLTEID ELEPIVSQYE 
SKVIYDTYLA ETGLNTPTTL VSIVEQYCSS HTIPLTQDLI QLIPKKWSVY PPMILFGSNT
FDSQIWQSLL LSIDSNDFFL YILSSSLFAS NHSKLTHIAI NKPIIESDVM RRPFNIRPIF
GDFGPLPTNY DSPVDTDFSN AFWCTVVQNG IYQTWAPYYT MFSRGNIKEK ARILDTYTDI
KNNVVVDLYC GIGYFSLSYL KRGAKQLFCW DLNPWSIEGF RRAINGKYLY KIFRREDNFD
YQIYKEATED GTQVFIFQES NEHCLERLTT FPKNSLPIAH INLGLLPSSQ NSWRITQKLK
DDHSSQYIST FVHIHENVHV AEFDTFKSLV SDRFPTASIG HLETVKTFAP DVWHIVLDIK
L