Gene PICST_34595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_34595 
SymbolHUT1 
ID4851774 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2795847 
End bp2796902 
Gene Length1056 bp 
Protein Length351 aa 
Translation table 
GC content42% 
IMG OID640393482 
ProductUDP-galactose transporter 
Protein accessionXP_001386871 
Protein GI126275562 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.147925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC ACGGCCTGTT GTTGACATTG ACAATATGTG TATTAGGACT CTATGGGTCC 
TTCCTCAGTT GGTCTGTTTT GCAGGAAAGA ATCAACACAA AGCCTTATGG AGAAAACGAA
AACGAAATTG AGTTCTTCAA GGCTCCTCTT ATCATCAATA TAGTGCAAGC GTTCTTTGCC
TCAATAGTAG GTTTTGGCTA TTCGCTTGTG ACAACGAAAG TGAATCCGTT CAAGATATTC
ACAGCAAACG AGAAATCAGT TGCAAGAAAG TACATGTTGT CGCTATTGTT AATTTCCATC
ACCTCCAGCT TGTCTTCTCC CTTGGGATAC CAGTCCCTTA AACATGTAGA TTATTTGGCC
TACTTGTTAG CCAAGTCGTG CAAGTTAATT CCTGTGATGA TCATCCATCT TGTTTTCTAT
AGAACGAGAT TCCCTGTGTC GAAATACATC GTAGCATCGT CGGTCACTTT CGGAGTGACT
CTCTTCACTT TGGCACATTC ATCTAAGTCT TCCAAATCAA GCATAAACGA CGGCAAAACT
CTCCTTGGAA TGGCTCAGCT AATTGGCTCC ATGCTTTTAG ACGGTCTTAC AAATTCTACC
CAGGACCAGA TGTTCAAGTT GCTGTCACCT AGTGGCAGCC AAAATATGGT AAAAATAACA
GGCGCAAAGT TGATGTGTAT TCTCAACTTG TTTGTGTGCG CTTTGACGTT GGCATACACC
GTCATATTTG CATATGAAAG TGAAGTCGTC TATACGCTTA ACTTTTTCCA CAAGCACCCA
GAGGTGTTGT ACAATATCTT GGAGTTTTCT GTCTTTGGAG CCGTGGGCCA GGTGTTTATC
TTCATCATCT TAGAGAAGTT TGACTCGTTA ATTCTCGTCA CAGCAACTGT TACAAGAAAG
ATGATCAGTA TGATCCTCAG TGTCGTATTG TTTGGTCACT TCTTGTCCAG CATCCAGTGG
TGTGGAGTTG GTCTCGTTTT TGGAGGCATA GGCTACGAAG CATTGGTCAA ATTGAACTCA
AATAAAAAGG TCTCAAAGGA GAAAAAGAGC CAATGA
 
Protein sequence
MKKHGLLLTL TICVLGLYGS FLSWSVLQER INTKPYGENE NEIEFFKAPL IINIVQAFFA 
SIVGFGYSLV TTKVNPFKIF TANEKSVARK YMLSLLLISI TSSLSSPLGY QSLKHVDYLA
YLLAKSCKLI PVMIIHLVFY RTRFPVSKYI VASSVTFGVT LFTLAHSSKS SKSSINDGKT
LLGMAQLIGS MLLDGLTNST QDQMFKLLSP SGSQNMVKIT GAKLMCILNL FVCALTLAYT
VIFAYESEVV YTLNFFHKHP EVLYNILEFS VFGAVGQVFI FIILEKFDSL ILVTATVTRK
MISMILSVVL FGHFLSSIQW CGVGLVFGGI GYEALVKLNS NKKVSKEKKS Q