Gene PICST_33419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33419 
Symbol 
ID4840442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp591080 
End bp592927 
Gene Length1848 bp 
Protein Length615 aa 
Translation table12 
GC content43% 
IMG OID640391757 
Productpredicted protein 
Protein accessionXP_001386125 
Protein GI150866498 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.307671 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATCCAC CAAACAAATT GGACCCCCTT GGTTTCTTGG ATCCTTCTCC TCAAAGCTCA 
ATTGATTCTA TTGAATTTCG AAAACATCTT GCCTCCATGG CCGAGGATCA TCAGTCGAAG
AAGGCAGCCG CCTCTGAGTT TTCATACCTG GCCTCAGAGA ATTTGAATAC GCCCCAATCG
TCTAAGATTC GGACCCAGGA GACTGAAAAT CCCTCCTTTG ATTCCAGCAA AAACGATTCT
CAGCTCCTCC ACTTGAGAAA CCAATCCGGA GCCAGTGGAT ACGAACAATC AAAGCACAGA
TTGACAACTG ATTCATACGG ATTACAAACA CCAGTGCTCC AAGGCGAAGA TTTTCTCGCT
AACCTTCATC TGCCTGCTCT GGACTTCAGC AAGTTCAAGG CTCCCAGGAC AAGAGAATCA
TACTTGTCCC AGTACTCTGG CAAAGTAGAT AGAGTTGACG ATATGGGTGC TCAAGCGACT
GTTAAACTTG TCAGGCACGA CAGCACCAAG AAGGAAGAAA AATCCAAAGA TGTTTCAACT
AACTCAGTTT CACCAAGCCA ACTGTCCCTT CAATCTGGAA AGGAACCTTC CATCAAACTA
GGAAAGCTTC CAACTGTAAC AAAGTCCTAC TCACCAGAAG AAGACTCTAA GCTGCCTGAT
ATTTCATACA CAACAAATAT TGAAGGATCT GTCCCTCCTA GATCCAATAG AAGACCTCTT
TCACTGGCAG TTTCATCAAG CATAGATAAC GATCACCTGA ACGAAGATTT GGAAAACAGC
ATTGTCGAGA ACGTGGAAAA GACCCCAGAT ACAAAGTTTA AACAAAGACG TTCCAGAGCA
TTCTCAGGTA CCTTAAACCA TGATCTCGAC CAGTTAATGC TTAGTGCCAA CTCCTTGAAA
TCAGAGGAGT CTCATGACAT CCCAGAAGAA CAAAAACCCA CGATTTCTGA CACTTCTGCA
AACCTCGCTA GAGAAACAGA AGCTGGACCA TCGGCGAAAG AAGACTTCCA AGGCTTTGGA
GGAAATTTGC CAGGATCCAA CATCACTCTT GAACCAATAG AGCGTCCACA ATTGCAAGAA
ATCCCACACG ATGAACGAGA AATAAGTAAC ACCAAGTCCA TCAAATCTCA TAACAGTGTA
CGGTCTGGAC ATCTGGAACC TGATCTTCAT CCAAAAGATT CTACGGACAC ATTTCAGACT
GCTGAAAGTC CTGGCGAATT ACACTCCGAA AAATTGAGAT CAAAGACTCC GACTTCGAGT
CTACCACCAA GACCATCGCT CGACAACATA ATCAGAGCCA GAGAAGCTTC TGCCAGTTAC
CAAATTAGGG AGCTGCCTGA ATCAGAAGAA GACGAGAATG CCACTGCCAA GTTAGAAGAA
ATCGACAACC TGAAAGAGGT GACTCAGCCA GAACAGGAAT TGGATGTTGC CGGTGAACCA
ATTGTTACAC CACATATTAA TCAAGGAAAT GACGATAGTT ATATTGACAT TGAAGAGCCT
AGATTGGTCC ACAAACCATC AAGAGGGAAA TCTGTCAAAG ACTCCACCAG ACGACACACT
CACAAGAAGT CCAAGACTGT CAAAACAAAA CAGGCTAAGG GAACATCAGC AAACTTGAAA
CCTTTTTCAT ACAATACCTT GATTAATTTG CTTGAAAGTA TGAACGGAAC AATCATAGGT
GAGGAGTTCT CCCAATTGAA TTTGCCTATG AAAGAGAAGC AATTAATAGA AAAAATCGTA
GACTCCTTGT CCCGATTGAC ACTGGATATG GTTCTTGATC AAAGTAGATA TGAAATTGGT
ATTCAGAGAT TGGAGAAGGC ATTGCGTGTA CTCGAAGGCT TTATGTGA
 
Protein sequence
MDPPNKLDPL GFLDPSPQSS IDSIEFRKHL ASMAEDHQSK KAAASEFSYS ASENLNTPQS 
SKIRTQETEN PSFDSSKNDS QLLHLRNQSG ASGYEQSKHR LTTDSYGLQT PVLQGEDFLA
NLHSPASDFS KFKAPRTRES YLSQYSGKVD RVDDMGAQAT VKLVRHDSTK KEEKSKDVST
NSVSPSQSSL QSGKEPSIKL GKLPTVTKSY SPEEDSKSPD ISYTTNIEGS VPPRSNRRPL
SSAVSSSIDN DHSNEDLENS IVENVEKTPD TKFKQRRSRA FSGTLNHDLD QLMLSANSLK
SEESHDIPEE QKPTISDTSA NLARETEAGP SAKEDFQGFG GNLPGSNITL EPIERPQLQE
IPHDEREISN TKSIKSHNSV RSGHSEPDLH PKDSTDTFQT AESPGELHSE KLRSKTPTSS
LPPRPSLDNI IRAREASASY QIRESPESEE DENATAKLEE IDNSKEVTQP EQELDVAGEP
IVTPHINQGN DDSYIDIEEP RLVHKPSRGK SVKDSTRRHT HKKSKTVKTK QAKGTSANLK
PFSYNTLINL LESMNGTIIG EEFSQLNLPM KEKQLIEKIV DSLSRLTSDM VLDQSRYEIG
IQRLEKALRV LEGFM