Gene Snas_5673 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_5673 
Symbol 
ID8886888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp6032757 
End bp6034406 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content70% 
IMG OID 
ProductUDP-N-acetylglucosamine--lysosomal-enzyme N-acetylglucosamine phosphotransferase 
Protein accessionYP_003514396 
Protein GI291303118 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0913502 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAGTG TGAACTACAT GTCGTCGAGC CCGGTCGCGT GGGATCCCCC ACTGATCGAG 
GTCCCCACCG TCACCAAGGC CGAACTGACC GGTCTGCCGG TGCGGGGTGT CCGCCACAAC
TTCGCCGAGT ATCGGGGGCG CGCGCTCGAC GCCGCCGCGC CGCTGGCGGT CCGGCAGCAC
AACCTGACGC TGGTGAGCGC CGCCTTCGAC GAGGCCGGGG TGCCGTACTT CGCGGTGCCG
GGACTGGACG ACCTGACCTC CTGCCTGGCC GTGACGCTGA TGGACCGGGA CCGGGCCTGC
GCGGTGCTGC GGCGGCTGTG CCAGGAGACC GACGGCTATA TGTCGATCCT GCACCCGGTG
CCGCCGGTGC GGCAGGAGCC GCGCGCGGGT GGCGACAAGG AGGCGTGGGA GCAGGCCGGT
GACCCACGGG TGGTGCGGCT CAACTGGTAC TGGACCGACC CGGACCACAA GCTCAGCTTC
GGGCCTGAAC ACGGCTGCGA CGTCGAGTTC TGGCGCCGCG ACGCCAAGGT GCGGCTGATC
TCGCCGCGCC CCAACCGGGT GACCCGGGTG GTGCCGGTCG ACGGCGCGAG CGTCGAGGTG
GAGGCGCGCC GGTTCACTCG GCTGCTGGAC GGCGCCGCGA CCACGCTGCC GCCGGTGCGG
TCGCGACAGG AGTTCTCGCA CACCACCCCC GACGCGGTGG AGTTCCCCGT CGACGTCGTC
TACACCTGGG TCGACGGCAC CGACGCGGCC TGGCAGCGCC GCCGCGCCGA GTGCTCCGGC
GAGGTCTACC ACGTCGAGGC GGCCAGCGAC GCGCGCTACA TCAGCCGCGA CGAGCTGAAG
TACTCGCTGC GCTCGGTGCA CCAGAACGCG CCGTGGGTGC GCAACGTCTA CATCGTCACC
GACGACCAGA CGCCGCCGTG GCTCAACACC GACGACCCCC GGGTGCGGGT CGTCGACCAC
CGCGAGATCT TCTCCGACCC GTCGGTGCTG CCGGTGTTCA ACTCGCACGC GATCGAGTCC
CAGCTGCACC ACATTCCCGG GCTGTCGGAC CAGTTCCTGT ACTTCAACGA CGACATGTTC
CTGGGCCGTC CGCTCACCCC GCAGCGGTTC TTCGAGGCCA ACGGACTGTC CCGGTTCTTC
TTCGCGGGCT CGCACGTGCC GCTGGGGCCG ATCACCGAGA ACGACACCCC GGTGGACGCC
GCCTGCAAGA ACAACCGGGA ACTGTTGCGC GACAAGTTCG GCAAGACGAT CTCGCAGACC
TTCCAGCACG TGCCGTACCC GCTGCGGCGC GACGTCATGT TCGACATCGA GAAGGACTTC
GAGGAGGCGC ACCAGCGCAC CGCCGCCAGC CGGTTCCGAG CCCTGACCGA CCTGTCGATC
CCGTCCTCGT TCCAGCACTA CTACGCGTAC TTCACCGGCC GGGCCACGCC CGGGAAGCTG
CAGTCGGTGT ACATCCAGCT GGCCGTCGCC GACCTGCGGG AGCGGCTGGA CCGGCTGCTG
GCCCGCCGCG ACGCCGACGC GTTCTGCCTC AACGACGCCT ACTCGACCCC CGAGGACATG
GAGCGGCAGA ACTCGCTGCT GCTGCCGTTC CTGGAGTCGT ACTTCCCGGT GCCGTCGCCG
TTCGAGAAGA ACCCGGGTGC GTCGCCGTGA
 
Protein sequence
MGSVNYMSSS PVAWDPPLIE VPTVTKAELT GLPVRGVRHN FAEYRGRALD AAAPLAVRQH 
NLTLVSAAFD EAGVPYFAVP GLDDLTSCLA VTLMDRDRAC AVLRRLCQET DGYMSILHPV
PPVRQEPRAG GDKEAWEQAG DPRVVRLNWY WTDPDHKLSF GPEHGCDVEF WRRDAKVRLI
SPRPNRVTRV VPVDGASVEV EARRFTRLLD GAATTLPPVR SRQEFSHTTP DAVEFPVDVV
YTWVDGTDAA WQRRRAECSG EVYHVEAASD ARYISRDELK YSLRSVHQNA PWVRNVYIVT
DDQTPPWLNT DDPRVRVVDH REIFSDPSVL PVFNSHAIES QLHHIPGLSD QFLYFNDDMF
LGRPLTPQRF FEANGLSRFF FAGSHVPLGP ITENDTPVDA ACKNNRELLR DKFGKTISQT
FQHVPYPLRR DVMFDIEKDF EEAHQRTAAS RFRALTDLSI PSSFQHYYAY FTGRATPGKL
QSVYIQLAVA DLRERLDRLL ARRDADAFCL NDAYSTPEDM ERQNSLLLPF LESYFPVPSP
FEKNPGASP