Gene Namu_0961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0961 
Symbol 
ID8446553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1054322 
End bp1055476 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content75% 
IMG OID645040097 
ProductN-acetylglucosamine-6-phosphate deacetylase 
Protein accessionYP_003200360 
Protein GI258651204 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1820] N-acetylglucosamine-6-phosphate deacetylase 
TIGRFAM ID[TIGR00221] N-acetylglucosamine-6-phosphate deacetylase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.781725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATCA CCGCCGACAC CATCGCCACC GGGGCGGAGC TGCTCCGGCC GGGCTGGATC 
GAGGTCGCCG ACGGACGGGT GGTCGCCCTG GGGGACGGGG CACCGCCACG GTCAGCCGAC
CAGCCGGCCG ACCGGGACCT GGGCGCGGTG ACGATCGTCC CCGGCTTCGT GGACATGCAT
GTGCACGGCG GCGGGGGAGG GGCGTTCCCG GAGGCCAGCT TCGCCACCAC CAAGGCCGCG
GTCGAGCTGC ACCGGCGGCA CGGCACCACC ACGATGGTCG CCTCCCTGGT CACCGCGACC
GGGCCGGAAA TGCTGCGGCA GGTCGGCATC CTGGCCGAGC AGGTGCAGGA CGGCCTGGTC
GCCGGTGTGC ATCTGGAGGG CCCGTGGCTC TCCCAGCATC GCTGCGGGGC CCATGAGCTC
TCGGCCCTGC GTGACCCCGA CCCGGCCGAG CTCGACCGGG TGCTGGCGGC CGGCCAGGGC
ACCATCCGGA TGGTCACCCT GGCGCCCGAG CGGGCCGGCG GGCTGGCCGC CATCGGCCGG
CTGGTCGACG CCGGGGTGAT CGCCGCGATC GGCCACACCA ACGCCACCTA CGAGCAGGCC
CGAGCCGCGA TCGAGGCCGG CGCCACCGTC GGCACCCACC TGTTCAATGC GATGCGGCCG
GTGCACCACC GCGAGCCGGG TCCGGTGATC GCGCTGCTGG AGGACCCGCG GGTGACCGTG
GAGATGATCA CCGACGGGGT GCACCTGCAT CCGGCGCTGT ACCGGGACGT CACGTCCAAC
GTCGGTCCGG ACCGGATCGC CCTGATCACC GACGCGATGG CCGCGGCCGG CATGGCCGAC
GGTGCCTACC GGCTCGGCGC GCTCGACGTC GACGTCCGGG ACGGGGTCGC CCGGGTCGCC
GGCACCGACA CCATTGCCGG CAGCACCGCG ACCATGGACC AGGTGTTCCG GTTCGCCGTG
CTGCACAGCG CCCGGCCACG GGACGAGGCC CTGCTGGTCG CGGTCCGGCA GTCCTCGGTC
AACCCGGCCC GCGCGCTGGG CCTGCCGCCG GCCGGTCTGG CCCCCCAGGC GGCCGCCGAT
CTGGTGGTCC TGGATGAGGC GCTGACCGTG AGCGGGGTGC TGCAGGCCGG CTCCTGGGTG
GTTCAGCCCG GCTGA
 
Protein sequence
MLITADTIAT GAELLRPGWI EVADGRVVAL GDGAPPRSAD QPADRDLGAV TIVPGFVDMH 
VHGGGGGAFP EASFATTKAA VELHRRHGTT TMVASLVTAT GPEMLRQVGI LAEQVQDGLV
AGVHLEGPWL SQHRCGAHEL SALRDPDPAE LDRVLAAGQG TIRMVTLAPE RAGGLAAIGR
LVDAGVIAAI GHTNATYEQA RAAIEAGATV GTHLFNAMRP VHHREPGPVI ALLEDPRVTV
EMITDGVHLH PALYRDVTSN VGPDRIALIT DAMAAAGMAD GAYRLGALDV DVRDGVARVA
GTDTIAGSTA TMDQVFRFAV LHSARPRDEA LLVAVRQSSV NPARALGLPP AGLAPQAAAD
LVVLDEALTV SGVLQAGSWV VQPG