Gene PICST_32069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32069 
SymbolHEX1 
ID4839099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp609304 
End bp611148 
Gene Length1845 bp 
Protein Length614 aa 
Translation table12 
GC content44% 
IMG OID640390414 
ProductMannosyl-glycoprotein endo-beta-N-acetylglucosamidase 
Protein accessionXP_001384784 
Protein GI150865529 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3525] N-acetyl-beta-hexosaminidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.28283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTTGA CTAGCTTGGT GGTTACTATC GCTCCATTGT TGGCATTGAC CCAGGCGGTT 
AAAGTAAATC CACTCCCAGC TCCTAGACTG ATCGACTGGC TCGATGAAAA CCCAATTTCT
GTGAATTTGG ATAAGTTGAA CTTGGAAATT GGCGCCGAAA ACTCGATTAT TTCAGAAGCC
TTCTACAGAA CCGTTTCAAC ATTGAGAAAG TTGAAATGGT ATCCAGCCGC TACTGAAGCT
CCTATCTCCA GCTTTGTTCC ATTTCCTACT GCTGAAGCTG CTGTTGACGC CAAAAAGAAA
AAAAGAGACA GCCAGCGTAC CTTTGACTTG TCTGGCTTGA GCGTTGTGGA AGTCACAGTT
AACGATTATG CCGCTGACCT TCAAATGGGA GTCAATGAGA CATATACCTT GTCCGTTTCC
CCCTCCAGTA TTATAATTGA ATCTGAAACC GTTTGGGGGG TTCTCCACGC TTTCACTACC
TTGCAACAAT TGATCATCTA CGACAATAGC AAGTTCGTCA TTGAGGGATC AGTAAACATA
TGGGACGCCC CTCTTTACCA ACATCGTGGT GTGATGGTTG ATACTGGTCG TAATTACTTG
AGCATTGACT CCATCTTGGA TCAAATCGAC ATGATGGCTC TTTCCAAGTT GAACTCTTTG
CACATTCACC TAGACGATGC TCAGAGTTGG CCATTGTTAT TGAACTCGTA CCCAGAAATG
ATCATGGATG CCTACAGTGA ACGTGAAATC TACACTATCC AAGACCTTCA ACACATCATC
AAGTATGCAA AGAACAGAGG TGTGAGAGTT ATACCAGAAA TCGACCTTCC AGGACATGCT
CGCGCTGGTT GGAGACAGAT CAACCCTGAT TTGGTTGCTT GTGGTGACTC ATGGTGGTCT
AACGACGTCT GGGCTTCCCA TACTGCTGTA GAGCCACCTC CAGGTCAGTT GGACATCATG
AATGATGAAG TATACGAAGT CATTGCTGAT GTTTATAATG AATTGAGTGA GATTTTCACT
GATAATGTAT TTCACGTTGG CGCCGATGAG ATCCAAACTG GATGTTACAA CATGTCGACC
TTGATTCAAA ACTGGTTCAA GGAAGATCCT TCAAGATCCT GGAATGACTT AAGTCAGTAC
TATGTTGACA AGGCATACCC AATCTTCATG AACAAGACTA ACAGACGTTT GATGATGTGG
GAAGATATAC TCTTGACTCC AGAAGGTGCC CACACTTTGC CTACCGATGT TATTTTGCAA
TCTTGGAACA ACGACTTGGT TAACATTCAA AACTTGACTT CTCGTGGATA CGACGTCATT
GTTTCGTCGT CTTCGCACTT CTACTTGGAC TGTGGTTTTG GTGGATGGGT TTCCAACGAT
CCAAGATACA TTGACGACTA CTCGAACGAT GTGTTCAACA CCGGTTTAGG AGGTTCTTGG
TGTGCTCCTT ACAAGACCTG GCAAAGAATC TACGACTACG ATTTTACTGC CAACTTGACA
GATGCTCAGG CTGAACACGT TATTGGTGCC GAAGTGGCCT TGTGGTCCGA GCAAGTCGAC
TCTACTGTTT TAACCCAAAA GATCTGGCCA AGAGCTGCTG CATTGGCTGA ATCCACTTGG
TCTGGTAACC GTAACTCTGA AGGATACTTG AGAACCAACG AGTTGACTCA AAGAATCTTG
AACTTCAGAG AATATTTGGT TGCTCTTGGT TTCGGTGCTT CACCTCTTGT GCCAAAGTAC
TGTTTGCTTA ACCCTCATGC TTGTGATTTG TACCAAAATC AAACTGTTCT TGAGCAGTAT
GGTACACACA ACGATAAGAA CTCCACTATT GCTGTTCTTA ACTGA
 
Protein sequence
MKLTSLVVTI APLLALTQAV KVNPLPAPRS IDWLDENPIS VNLDKLNLEI GAENSIISEA 
FYRTVSTLRK LKWYPAATEA PISSFVPFPT AEAAVDAKKK KRDSQRTFDL SGLSVVEVTV
NDYAADLQMG VNETYTLSVS PSSIIIESET VWGVLHAFTT LQQLIIYDNS KFVIEGSVNI
WDAPLYQHRG VMVDTGRNYL SIDSILDQID MMALSKLNSL HIHLDDAQSW PLLLNSYPEM
IMDAYSEREI YTIQDLQHII KYAKNRGVRV IPEIDLPGHA RAGWRQINPD LVACGDSWWS
NDVWASHTAV EPPPGQLDIM NDEVYEVIAD VYNELSEIFT DNVFHVGADE IQTGCYNMST
LIQNWFKEDP SRSWNDLSQY YVDKAYPIFM NKTNRRLMMW EDILLTPEGA HTLPTDVILQ
SWNNDLVNIQ NLTSRGYDVI VSSSSHFYLD CGFGGWVSND PRYIDDYSND VFNTGLGGSW
CAPYKTWQRI YDYDFTANLT DAQAEHVIGA EVALWSEQVD STVLTQKIWP RAAALAESTW
SGNRNSEGYL RTNELTQRIL NFREYLVALG FGASPLVPKY CLLNPHACDL YQNQTVLEQY
GTHNDKNSTI AVLN