Gene PICST_32526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32526 
SymbolNAG5 
ID4840116 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp16507 
End bp18159 
Gene Length1653 bp 
Protein Length550 aa 
Translation table12 
GC content41% 
IMG OID640391431 
Producthexokinase I 
Protein accessionXP_001385689 
Protein GI150866184 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5026] Hexokinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTATGA TCCACAACTC TTCCAAAGTC GCATCACCAA CAGAAGAAAG CATAATACTT 
CCTATGAAGG GCCTTCATGG AGATGTCACT ATAAGTCCAA CACCCATTCT CATAAACCAG
AGCGAGGAAA GTGACTTCAT AGACGAGTTG CTGTCTTCTA CTTCTACTTC CAGCGAACCT
TCGCCCAACT CATCCATTTC CACTGATAGC TCGTGTCTTT TATCTTCCGT AGTAAACGAT
TTTGTTTATG ATCTCACCAG TCAGAACTTC CTCGAACAGA CGGAGTTTCT TGTCGCCGAC
TTGAATGAGT CGTTATCTAG AAACTCCAAG ATAACCATGT TGCCCAACTA CAATATTTCT
CCAACAGGAC AGGAGTCAGG TGAGTTTTTG GTCATTGACT TGGGAGGCTC CACCTTGAGA
ATCGCTGTTA TCAAGATAGA CCAAGCGTCG GATTCAGACG ACGAAGACAG ATCAAAAAGA
ATACACATCT TGATGGAGAA GAACTGGACT ATCGATAACA GCTTCAAGAC TCTCGATCTT
AACTTCTTCA AGTTCATAGG CTCCAAGATC CACGAGATAT TGTGCCATCA GGATTTGATT
GATATCCGCA ACAATATAAA GACTGGGATC ACCTGGTCGT TCCCTTTGAA GACCACATCG
TACAATAACG GTAAGATCGT CCATGTCTCT AAGGGCTACA CCATTCATCC AGAAATCTAC
AACCAGGACT TGAAGTCCAT TTTAGAATCA GTTTTACTAA ACGAGTTTGA TTTACACATT
GACGTCAAAA GCATTTTGAA CGATTCCTTG GCCGTGTACT CGGCAGGTGC CTTCATTGAT
AAGTATACCA AGTTGGCACT TGTTTTGGGA ACGGGATTCA ACATGTGCTG CTCATTATCT
ACATCGGACA AGATGCATTC TGACAAGACG TTGGAAAGCT GCGACAAGAT CTTATTCAAC
ACTGAACTCA GTTTGTTTGG TGAACATTTG ATTAAGAGCA TAGCTACCAA ATATGATTCG
TTGATTGACG AGAGATTCAA AACCTTTGAC TTTCATTTCA AGCCATTCAT GTCAACTGAT
CCTAATACCC ATTCCATTTT CCAACCTAAC GAGTTGATGA CAAGTGGTAG ATACTTGCCA
GAGTTGACTC GTTTGGTGTT GGTAGATTTA GTTGAAGCTA AAGAAATCTT TGTCAACATA
AGCCAAAAGG AGGAACTTTT ATCTTCGGCC TATGATGGCT TCAGTGGTGA GTTGATGTGC
TTCATCAACG AATCGACAAA CGTTGACGCC ATTACCGAAA AATTGTGTGC TCAATATGGT
TGGTCTGCTT CTGAAGTCAC CATCGGAGAT GTTTTGACGT TGAAGAAGAT TGTTCAAAGT
ATTGTTGAAA GAGCAGCCTT CATTGTTTCC GTCTCGATTG TGTCCTTCAT TAAGTTGCTC
CAACAGCACA ATGATGATCA CTTTGACTCA TCCATCATCA ACATTGGATA TGTTGGCTCA
GTGTTGAAGC ATTTCAATGT CTACAGAGAC TTGGTTAAAC AATATGTTAA CGATAATGAC
GATATTAAAA GGTTAGGAGT CCAGGTTGAT TTTAAGTTGA TTGAGAATAG TTCAATCATT
GGTGCTGCTA TCGGTGCAGC ATACTATTCA TAA
 
Protein sequence
MAMIHNSSKV ASPTEESIIL PMKGLHGDVT ISPTPILINQ SEESDFIDEL SSSTSTSSEP 
SPNSSISTDS SCLLSSVVND FVYDLTSQNF LEQTEFLVAD LNESLSRNSK ITMLPNYNIS
PTGQESGEFL VIDLGGSTLR IAVIKIDQAS DSDDEDRSKR IHILMEKNWT IDNSFKTLDL
NFFKFIGSKI HEILCHQDLI DIRNNIKTGI TWSFPLKTTS YNNGKIVHVS KGYTIHPEIY
NQDLKSILES VLLNEFDLHI DVKSILNDSL AVYSAGAFID KYTKLALVLG TGFNMCCSLS
TSDKMHSDKT LESCDKILFN TELSLFGEHL IKSIATKYDS LIDERFKTFD FHFKPFMSTD
PNTHSIFQPN ELMTSGRYLP ELTRLVLVDL VEAKEIFVNI SQKEELLSSA YDGFSGELMC
FINESTNVDA ITEKLCAQYG WSASEVTIGD VLTLKKIVQS IVERAAFIVS VSIVSFIKLL
QQHNDDHFDS SIINIGYVGS VLKHFNVYRD LVKQYVNDND DIKRLGVQVD FKLIENSSII
GAAIGAAYYS