Gene PICST_43096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_43096 
SymbolDIE2 
ID4838305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1135486 
End bp1136883 
Gene Length1398 bp 
Protein Length465 aa 
Translation table12 
GC content39% 
IMG OID640389620 
Productglucosyltransferase 
Protein accessionXP_001383848 
Protein GI150864855 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.708454 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGTTGG TTATACCTAG GGGTGGCTCC ATAGTCCGCA AGAGACTCAT TACACTCTTG 
CTAACATTGA GTACAGTTGC ATTCTGCGGA TATGTCCATC ATCAAGTCAG TCTCAAGGTC
AAGAACCCAT ACATAGACGA ATACTTCCAT ATCCGCCAGT GCCAGAAGTA CTGCCAACAT
AAATTCCACG AATGGGACAA CAAGATCACA ACTCCACCAG GGTTGTACGT ATTGGGGTTC
TTATACACGA ATGCCATTCA GAAGCTTTCT GGGGCAGAAT CGAACTACTA TTGTGGAAAC
TACGATATCT TGAGATCAGT GAATTTACTA GGTTTCTTTG CTCTTTTGGC AATTGCTCAT
CGTTTCAAGA AATCTTATGG AAACCAATAC CTTTCGATCA ACATAGCGTC CCAGCCGTTG
CTATTCACCT ACTATTTCTT GTTCTATACT GACATATGGT CAACAGTATT TGTAGTGCTA
GCACTTACAA TTGTCATGTC CAAACCTGTG AGAGACTACC AGGCTTATTG CAGTGGATTA
CTTGGTTTGT TGAGTTTGTG GTTCAGACAA ACAAATATCG TTTGGGTCGC CTTCATTCTT
GCTGTACTTG TGGAAAGAAG TGTAGTCAGA AAAAGAGGCG AGAGTCCTAA CTTCTTGGCT
CAGACACTGC TGTTCATATC TCTGTTCTTT AAGAACTGGT TCAAGATAAT TCCATTCGTA
ATAAATGCTG TGCTCTTTGC GATTTTTCTC AAGATCAATG GTGGTATTAC CTTTGGCGAT
AAAGAGAATC ACGAAATCCA ATTGCACGTG GTTCAAGTCT TTTACTGTTT CACATTTATT
GTTCTCTTCA CTTGGCCAGT ATGGTTTGAT GTTCATTGTT TGAAGAGATA CCTCAAATTT
GTCTTTGTTC AAAACTATGG TCTTAATTTC GGTTTGAATG TGGTGAGTTT GTGTGCTATA
AAATACGTCA TAGACAATTT TACTGTTGTC CATCCATTCT TGTTGGCTGA TAATAGACAT
TATACATTCT ACATATTCAA GCGACTCATT AGCCATCCAA AGAGTTACAT CATAGCTGTG
CCATTATACC ACTTTGCAAC CTATTCTATA ATTAGTTCAT TGTCCCAAAG TGATAAGATC
AACATGAGAT TTGTCACCAT TGTGTGCTAC TTGGCCGCAG TGTGCTTGAC TATCATTCCT
TCGCCATTAT TTGAACCACG ATACTACATT GTTCCATTGG TGATATTCAG ACTCTTCATA
AAGCCTGTCA ACACAAAGAG ACACTACTTG GAGTTTATTT GGTTAAACAC TATAAACGTT
GTTACTACAT TAGTATTCTT AAACTATGAG TTTACATGGG CAAGTGAGCC GGGTAGCATT
CAGAGAATAA TATGGTAA
 
Protein sequence
MALVIPRGGS IVRKRLITLL LTLSTVAFCG YVHHQVSLKV KNPYIDEYFH IRQCQKYCQH 
KFHEWDNKIT TPPGLYVLGF LYTNAIQKLS GAESNYYCGN YDILRSVNLL GFFALLAIAH
RFKKSYGNQY LSINIASQPL LFTYYFLFYT DIWSTVFVVL ALTIVMSKPV RDYQAYCSGL
LGLLSLWFRQ TNIVWVAFIL AVLVERSVVR KRGESPNFLA QTSSFISSFF KNWFKIIPFV
INAVLFAIFL KINGGITFGD KENHEIQLHV VQVFYCFTFI VLFTWPVWFD VHCLKRYLKF
VFVQNYGLNF GLNVVSLCAI KYVIDNFTVV HPFLLADNRH YTFYIFKRLI SHPKSYIIAV
PLYHFATYSI ISSLSQSDKI NMRFVTIVCY LAAVCLTIIP SPLFEPRYYI VPLVIFRLFI
KPVNTKRHYL EFIWLNTINV VTTLVFLNYE FTWASEPGSI QRIIW