Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_48829 |
Symbol | NAG2 |
ID | 4840115 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | - |
Start bp | 13338 |
End bp | 14600 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640391430 |
Product | N-acetyl-glucosamine-6-phosphate deacetylase |
Protein accession | XP_001385688 |
Protein GI | 150866183 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1820] N-acetylglucosamine-6-phosphate deacetylase |
TIGRFAM ID | [TIGR00221] N-acetylglucosamine-6-phosphate deacetylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCCT TCACCAGATT CACCAACTGC CACTTGATCG ACAACGGCCA GCTCTACGAA TACACCGACC TTTATGTCAA CAACAAAACC AACAAAATTT CACATCCTCC AGCAGACTCC AGCCTTATCA CCATAACCGT CGATGTCCAC GGCAATATCC TTGCTCCAGG TTTCTTGGAT ATCCAGAACA ACGGAATCTA TGGCTTGAAC TTTTCCAATC TAAACGCAAA CTCAACCCCT CAGGATGTCG CTGCTTTCGA CAAGTTCTAC AAAGATGCCA TGACAAAGTA CTTGTCTACT GGTGTCACCG CTACTTGCCC TACTGTGACT TCTAACTTCC CCGAAGTGTA CGAGAAGGTA TTACCATTTT ACAAGAAGTC TAGATTGTCA ACCCAAACGG ACTCATTGGG TGCTCACATT GAAGGTCCTT TCATCAACTT GAAGAAGAAA GGCTGCCATC CTGTAGAAAC CTTTGTCGAT GCTAAAGAGG GTGAAGCTAA GTTGTACCAT ATTTACGGTG AGAGCAACTT GATTGACAAC GTATGCATCT TAACAGCTGC TCCTGAGATT CCAGGAGTGT TGGACCTTAT TCCTCTTGTT AAACTGAAGA ACATTGTCTT CTCACTCGGC CACACCATGG CTGACTACAA AACTGGCATT AGAGCTGTAG AATGCGGTGC CTCCATGATC ACTCACTTGT ACAACGCCAT GCCGCAACCT CACCATAGAG ACGCTGGCGT CGTTGGTTTG ATCAACTCTC CTGAACTTGG AGAAGAGAAT ACCCCATACT TCGGCTTGAT CGTAGACGGA GTCCACGTTG ATCCTTCCAT GGTGAACTTG GCTTATAGAT CAAATCCCGA CAAGTGTGTC TTGGTCACCG ATGCCATGCA TTTGATTGGT TTGCCGGATG GAACTTACAA ATGGGACGAC CAATACATTG TAAAGACCGG TGACAGATTG TACTTGAAGG GCACGAAAAC CTTGGCTGGG GCTGCTACTA CTTTGCCCCA ATGTGTTAGA AACTTGATAA AGTGGTCCAA CATCACTTTG CCAGAAGCTG TGAAGACTGT GACTAATAAT GCTGCTGTTT CTGTTGGTCT TGAGCATCAA AAAGGTTTCT TGAACGTTGG GTGTGATGCC GACTTGGTCG TTTTGGACAG AGATGGTTAT ATTCAGAAGG TCTACAAGTT GGGTAGGGAA ATCCAATCAA GTGATATCAA CTTGTCCAAC GAAAAGAATC CAAAAGTGAT GGCTGTTTTG TAA
|
Protein sequence | MASFTRFTNC HLIDNGQLYE YTDLYVNNKT NKISHPPADS SLITITVDVH GNILAPGFLD IQNNGIYGLN FSNLNANSTP QDVAAFDKFY KDAMTKYLST GVTATCPTVT SNFPEVYEKV LPFYKKSRLS TQTDSLGAHI EGPFINLKKK GCHPVETFVD AKEGEAKLYH IYGESNLIDN VCILTAAPEI PGVLDLIPLV KSKNIVFSLG HTMADYKTGI RAVECGASMI THLYNAMPQP HHRDAGVVGL INSPELGEEN TPYFGLIVDG VHVDPSMVNL AYRSNPDKCV LVTDAMHLIG LPDGTYKWDD QYIVKTGDRL YLKGTKTLAG AATTLPQCVR NLIKWSNITL PEAVKTVTNN AAVSVGLEHQ KGFLNVGCDA DLVVLDRDGY IQKVYKLGRE IQSSDINLSN EKNPKVMAVL
|
| |