Gene PICST_37533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_37533 
Symbol 
ID4851543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2092075 
End bp2093532 
Gene Length1458 bp 
Protein Length485 aa 
Translation table 
GC content41% 
IMG OID640393251 
Productpredicted protein 
Protein accessionXP_001388029 
Protein GI126274801 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.143167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGGGTA AAGGAGTTAC CGTATTAAAT GATCCACGAA ATCGTGTATC TTCTAGTTTT 
GATGCGAAAA CTGTCCCTAC ATCCGATCCA GAGGTCCGCG CAGAGCTCAG AAAGCTTGGA
GAGCCTGTAA CTTATTTTGG AGAAGATGAT TCTGATCGTA GACAGAGACT TATAAAGCTT
CTCTCCGAGA AGAAACATAC TAATTTTGAC TTTGACTATG AAATGGAAGA AGATGAATTG
TTAGAAAACG AAGAAAGCGA TGAAAACGAG GATGACGAAG ACTTTTATAC ACCTGGAGAA
CAGGAATTGT ACACAGCCAG AGAAGAAATA CTCCATTCTT CGTTGGAAAG GGCAAGGAAG
AGAATAGAGA AACAACAAAA GAAAACAAAA GACCAGAGCT TTATCAAGTA TCTCAAACAT
AGGCGACACA TTAATTCACA ACTAGCCAAA TACGAGCTTT TTGGCACTCA GCTTATACAA
GGTAATACTA GAGCTCTATC AGCCGTGAGA TTCTCAAAAG ATAGTGAATT GATAGCGACA
GGTTCGTGGG ATGGAGGAAT CTACATTTTA GATTCTGGAG ATTTATCAAC TAAGTTTAAG
CTGGCATCTG GGTATCATTC AGAGAAAGTA AGTGCTATAG ACTGGGATGT CTATACAGAC
AGCAATCTTC TTGTCTCAGG AGGTAATGAA GGAAACATAA ACTTCTGGAA AGTTAATAAG
GAGTCAGAGA CGAAAGTAAT AAAGCCCGTC GTATCTATAA AGGCAGCCCA TGATAATCGT
ATTACTAAGA CGTTGTTTCA TCCTAGTGGT AGATTCGTAA CCTCGACATC ATGTGACCAG
ACATGGAAGC TTTGGGATGT CAATCGTCCC GAAAATGCAC TATTGCAACA AGAGGGTCAT
TCTAAAGAAG TTTTTGCTGG GTCTTTTCAT CCTGACGGCA GTTTATTTGC CTCTGGAGGT
TTTGATGCTA TTGGAAGAAT ATGGGATATG CGGTCAGGAA GATCAATAGT TACACTTGAA
AGACATATAA AAGGTATTTA CAGTATGGAC TGGTCGCCAA ACGGGTACCA TTTGGCTACA
GCTAGTGGAG ACTGCTCGGT GAAGATTTGG GATCTTCGAA AACTCCAACG AGACTTCAAG
GAGATATTTT CAATTCCAGT GCATACGAAG CTCGTAAGTG ACGTGCGGTT TTTTAACAGG
AGATCTGTGT CTAATGTACT TTCGACCGAA GTTGCAAATG AGAATGGAGA CAATCCTGAA
GTTCTCGACT CCGATGGCTC TTTTCTCGTT ACCTCTTCTT TTGATGGACT TGTAAATATC
TGGTCAGCTG ACAATTTCAT CAAGGTTAAG ACCCTTAGAG GACACAACGA CAAAGTGATG
AGTTGTGATA TTAGTTGCGA TGGAAGTACA ATAGTATCGT CGGGATGGGA CAGAACCGTC
AAGTTGTGGA AGAGCTAG
 
Protein sequence
MGGKGVTVLN DPRNRVSSSF DAKTVPTSDP EVRAELRKLG EPVTYFGEDD SDRRQRLIKL 
LSEKKHTNFD FDYEMEEDEL LENEESDENE DDEDFYTPGE QELYTAREEI LHSSLERARK
RIEKQQKKTK DQSFIKYLKH RRHINSQLAK YELFGTQLIQ GNTRALSAVR FSKDSELIAT
GSWDGGIYIL DSGDLSTKFK LASGYHSEKV SAIDWDVYTD SNLLVSGGNE GNINFWKVNK
ESETKVIKPV VSIKAAHDNR ITKTLFHPSG RFVTSTSCDQ TWKLWDVNRP ENALLQQEGH
SKEVFAGSFH PDGSLFASGG FDAIGRIWDM RSGRSIVTLE RHIKGIYSMD WSPNGYHLAT
ASGDCSVKIW DLRKLQRDFK EIFSIPVHTK LVSDVRFFNR RSVSNVLSTE VANENGDNPE
VLDSDGSFLV TSSFDGLVNI WSADNFIKVK TLRGHNDKVM SCDISCDGST IVSSGWDRTV
KLWKS