Gene Namu_0597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_0597 
Symbol 
ID8446181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp657842 
End bp658858 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content75% 
IMG OID645039730 
Product2-nitropropane dioxygenase NPD 
Protein accessionYP_003200001 
Protein GI258650845 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGAC TGGTCGATCT CGCGGTGGCC GCGACGGTCC TGGCCGCACC GATGGCCGGC 
GGGCCGAGCA CGCCCGATCT GGTGACGGCC ACCGCCGCGG CCGGCAGCCT GGGGTTCCTG
GCCGGCGGAT ACCGGACCGC GGCCCAGCTG GCCGCGCAGA TCGCCGAGGT TCGGGCGATC
ACCCCGACGT TCGGGGTGAA TCTGTTCGCG CCCAACCCGA TTCCGGTCGA CCCCCAGGCC
TACGCGCAGT ACGCCGCCCG GTTGGCCGAG CGGGCGGACC ACTTCGGTGT CGTGTTGCCG
CCCCGGCCCA TCGAGGACGA CGACGGCTGG CCGGACAAAC TCGACCTGCT GATCGAGGAC
CCGGTGCCGC TGGTCAGCTT CACGTTCGGG CTACCGCCGG CCAGGGCGAT CCGGGCCCTG
CAGCGCGCCG GCAGCGCGGT CGCCCAGACG GTGACCGGCC CCGCGGAAGC GCGCTGGGCG
CTGGACGCCG GAGCCGACCT GCTCATCGTG CAAAGCGCCG ACGCCGGCGG GCATTCCGCG
GTCTTCGATC CCTCGGTCCG CCCGCCATCC CCGGCGCTGC CGGACCTCAT CCGGCAGATC
GCGGCGACCA CACCGCGGCC GCTGATCGCG GCCGGCGGGT TGTCCTCGGC CGACCGGGTG
GCGGCGGCGC TCCGGGCGGG CGCGGCCGCC GTCATGGTCG GCACGGCCTT GCTGCTGGCG
GACGAGGCCG GGACCTCGGC CGTGCATCGG GCGGCGATCG CCGGGCGTCC CGGTCCGACC
GTGATCACCC GGGCGTTCAC CGGTCGCCCG GCCCGCGGAC TGGTCAACGA GTTCATCGTG
CAGTTCGAAC CACGGGCGCC GCTGGGCTAC CCGGCCCTGC ACCACCTGAC CAGCCCGCTG
CGCAAGGCCG CGGCCGCCGC CGGCGATCCG GAATGGGTGC ACCTGTGGGC CGGAACCGGC
CACGGCGCCG TCACTCCCGG GCCGGTCGCC GACATCCTGC GGCGCCTGGC CGTCTGA
 
Protein sequence
MSGLVDLAVA ATVLAAPMAG GPSTPDLVTA TAAAGSLGFL AGGYRTAAQL AAQIAEVRAI 
TPTFGVNLFA PNPIPVDPQA YAQYAARLAE RADHFGVVLP PRPIEDDDGW PDKLDLLIED
PVPLVSFTFG LPPARAIRAL QRAGSAVAQT VTGPAEARWA LDAGADLLIV QSADAGGHSA
VFDPSVRPPS PALPDLIRQI AATTPRPLIA AGGLSSADRV AAALRAGAAA VMVGTALLLA
DEAGTSAVHR AAIAGRPGPT VITRAFTGRP ARGLVNEFIV QFEPRAPLGY PALHHLTSPL
RKAAAAAGDP EWVHLWAGTG HGAVTPGPVA DILRRLAV