Gene PICST_48829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_48829 
SymbolNAG2 
ID4840115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp13338 
End bp14600 
Gene Length1263 bp 
Protein Length420 aa 
Translation table12 
GC content45% 
IMG OID640391430 
ProductN-acetyl-glucosamine-6-phosphate deacetylase 
Protein accessionXP_001385688 
Protein GI150866183 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCCT TCACCAGATT CACCAACTGC CACTTGATCG ACAACGGCCA GCTCTACGAA 
TACACCGACC TTTATGTCAA CAACAAAACC AACAAAATTT CACATCCTCC AGCAGACTCC
AGCCTTATCA CCATAACCGT CGATGTCCAC GGCAATATCC TTGCTCCAGG TTTCTTGGAT
ATCCAGAACA ACGGAATCTA TGGCTTGAAC TTTTCCAATC TAAACGCAAA CTCAACCCCT
CAGGATGTCG CTGCTTTCGA CAAGTTCTAC AAAGATGCCA TGACAAAGTA CTTGTCTACT
GGTGTCACCG CTACTTGCCC TACTGTGACT TCTAACTTCC CCGAAGTGTA CGAGAAGGTA
TTACCATTTT ACAAGAAGTC TAGATTGTCA ACCCAAACGG ACTCATTGGG TGCTCACATT
GAAGGTCCTT TCATCAACTT GAAGAAGAAA GGCTGCCATC CTGTAGAAAC CTTTGTCGAT
GCTAAAGAGG GTGAAGCTAA GTTGTACCAT ATTTACGGTG AGAGCAACTT GATTGACAAC
GTATGCATCT TAACAGCTGC TCCTGAGATT CCAGGAGTGT TGGACCTTAT TCCTCTTGTT
AAACTGAAGA ACATTGTCTT CTCACTCGGC CACACCATGG CTGACTACAA AACTGGCATT
AGAGCTGTAG AATGCGGTGC CTCCATGATC ACTCACTTGT ACAACGCCAT GCCGCAACCT
CACCATAGAG ACGCTGGCGT CGTTGGTTTG ATCAACTCTC CTGAACTTGG AGAAGAGAAT
ACCCCATACT TCGGCTTGAT CGTAGACGGA GTCCACGTTG ATCCTTCCAT GGTGAACTTG
GCTTATAGAT CAAATCCCGA CAAGTGTGTC TTGGTCACCG ATGCCATGCA TTTGATTGGT
TTGCCGGATG GAACTTACAA ATGGGACGAC CAATACATTG TAAAGACCGG TGACAGATTG
TACTTGAAGG GCACGAAAAC CTTGGCTGGG GCTGCTACTA CTTTGCCCCA ATGTGTTAGA
AACTTGATAA AGTGGTCCAA CATCACTTTG CCAGAAGCTG TGAAGACTGT GACTAATAAT
GCTGCTGTTT CTGTTGGTCT TGAGCATCAA AAAGGTTTCT TGAACGTTGG GTGTGATGCC
GACTTGGTCG TTTTGGACAG AGATGGTTAT ATTCAGAAGG TCTACAAGTT GGGTAGGGAA
ATCCAATCAA GTGATATCAA CTTGTCCAAC GAAAAGAATC CAAAAGTGAT GGCTGTTTTG
TAA
 
Protein sequence
MASFTRFTNC HLIDNGQLYE YTDLYVNNKT NKISHPPADS SLITITVDVH GNILAPGFLD 
IQNNGIYGLN FSNLNANSTP QDVAAFDKFY KDAMTKYLST GVTATCPTVT SNFPEVYEKV
LPFYKKSRLS TQTDSLGAHI EGPFINLKKK GCHPVETFVD AKEGEAKLYH IYGESNLIDN
VCILTAAPEI PGVLDLIPLV KSKNIVFSLG HTMADYKTGI RAVECGASMI THLYNAMPQP
HHRDAGVVGL INSPELGEEN TPYFGLIVDG VHVDPSMVNL AYRSNPDKCV LVTDAMHLIG
LPDGTYKWDD QYIVKTGDRL YLKGTKTLAG AATTLPQCVR NLIKWSNITL PEAVKTVTNN
AAVSVGLEHQ KGFLNVGCDA DLVVLDRDGY IQKVYKLGRE IQSSDINLSN EKNPKVMAVL