Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32069 |
Symbol | HEX1 |
ID | 4839099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 609304 |
End bp | 611148 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390414 |
Product | Mannosyl-glycoprotein endo-beta-N-acetylglucosamidase |
Protein accession | XP_001384784 |
Protein GI | 150865529 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.28283 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTGA CTAGCTTGGT GGTTACTATC GCTCCATTGT TGGCATTGAC CCAGGCGGTT AAAGTAAATC CACTCCCAGC TCCTAGACTG ATCGACTGGC TCGATGAAAA CCCAATTTCT GTGAATTTGG ATAAGTTGAA CTTGGAAATT GGCGCCGAAA ACTCGATTAT TTCAGAAGCC TTCTACAGAA CCGTTTCAAC ATTGAGAAAG TTGAAATGGT ATCCAGCCGC TACTGAAGCT CCTATCTCCA GCTTTGTTCC ATTTCCTACT GCTGAAGCTG CTGTTGACGC CAAAAAGAAA AAAAGAGACA GCCAGCGTAC CTTTGACTTG TCTGGCTTGA GCGTTGTGGA AGTCACAGTT AACGATTATG CCGCTGACCT TCAAATGGGA GTCAATGAGA CATATACCTT GTCCGTTTCC CCCTCCAGTA TTATAATTGA ATCTGAAACC GTTTGGGGGG TTCTCCACGC TTTCACTACC TTGCAACAAT TGATCATCTA CGACAATAGC AAGTTCGTCA TTGAGGGATC AGTAAACATA TGGGACGCCC CTCTTTACCA ACATCGTGGT GTGATGGTTG ATACTGGTCG TAATTACTTG AGCATTGACT CCATCTTGGA TCAAATCGAC ATGATGGCTC TTTCCAAGTT GAACTCTTTG CACATTCACC TAGACGATGC TCAGAGTTGG CCATTGTTAT TGAACTCGTA CCCAGAAATG ATCATGGATG CCTACAGTGA ACGTGAAATC TACACTATCC AAGACCTTCA ACACATCATC AAGTATGCAA AGAACAGAGG TGTGAGAGTT ATACCAGAAA TCGACCTTCC AGGACATGCT CGCGCTGGTT GGAGACAGAT CAACCCTGAT TTGGTTGCTT GTGGTGACTC ATGGTGGTCT AACGACGTCT GGGCTTCCCA TACTGCTGTA GAGCCACCTC CAGGTCAGTT GGACATCATG AATGATGAAG TATACGAAGT CATTGCTGAT GTTTATAATG AATTGAGTGA GATTTTCACT GATAATGTAT TTCACGTTGG CGCCGATGAG ATCCAAACTG GATGTTACAA CATGTCGACC TTGATTCAAA ACTGGTTCAA GGAAGATCCT TCAAGATCCT GGAATGACTT AAGTCAGTAC TATGTTGACA AGGCATACCC AATCTTCATG AACAAGACTA ACAGACGTTT GATGATGTGG GAAGATATAC TCTTGACTCC AGAAGGTGCC CACACTTTGC CTACCGATGT TATTTTGCAA TCTTGGAACA ACGACTTGGT TAACATTCAA AACTTGACTT CTCGTGGATA CGACGTCATT GTTTCGTCGT CTTCGCACTT CTACTTGGAC TGTGGTTTTG GTGGATGGGT TTCCAACGAT CCAAGATACA TTGACGACTA CTCGAACGAT GTGTTCAACA CCGGTTTAGG AGGTTCTTGG TGTGCTCCTT ACAAGACCTG GCAAAGAATC TACGACTACG ATTTTACTGC CAACTTGACA GATGCTCAGG CTGAACACGT TATTGGTGCC GAAGTGGCCT TGTGGTCCGA GCAAGTCGAC TCTACTGTTT TAACCCAAAA GATCTGGCCA AGAGCTGCTG CATTGGCTGA ATCCACTTGG TCTGGTAACC GTAACTCTGA AGGATACTTG AGAACCAACG AGTTGACTCA AAGAATCTTG AACTTCAGAG AATATTTGGT TGCTCTTGGT TTCGGTGCTT CACCTCTTGT GCCAAAGTAC TGTTTGCTTA ACCCTCATGC TTGTGATTTG TACCAAAATC AAACTGTTCT TGAGCAGTAT GGTACACACA ACGATAAGAA CTCCACTATT GCTGTTCTTA ACTGA
|
Protein sequence | MKLTSLVVTI APLLALTQAV KVNPLPAPRS IDWLDENPIS VNLDKLNLEI GAENSIISEA FYRTVSTLRK LKWYPAATEA PISSFVPFPT AEAAVDAKKK KRDSQRTFDL SGLSVVEVTV NDYAADLQMG VNETYTLSVS PSSIIIESET VWGVLHAFTT LQQLIIYDNS KFVIEGSVNI WDAPLYQHRG VMVDTGRNYL SIDSILDQID MMALSKLNSL HIHLDDAQSW PLLLNSYPEM IMDAYSEREI YTIQDLQHII KYAKNRGVRV IPEIDLPGHA RAGWRQINPD LVACGDSWWS NDVWASHTAV EPPPGQLDIM NDEVYEVIAD VYNELSEIFT DNVFHVGADE IQTGCYNMST LIQNWFKEDP SRSWNDLSQY YVDKAYPIFM NKTNRRLMMW EDILLTPEGA HTLPTDVILQ SWNNDLVNIQ NLTSRGYDVI VSSSSHFYLD CGFGGWVSND PRYIDDYSND VFNTGLGGSW CAPYKTWQRI YDYDFTANLT DAQAEHVIGA EVALWSEQVD STVLTQKIWP RAAALAESTW SGNRNSEGYL RTNELTQRIL NFREYLVALG FGASPLVPKY CLLNPHACDL YQNQTVLEQY GTHNDKNSTI AVLN
|
| |