Gene Namu_4911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4911 
Symbol 
ID8450542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5481257 
End bp5482582 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content71% 
IMG OID645043950 
ProductDyp-type peroxidase family 
Protein accessionYP_003204174 
Protein GI258655018 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR01412] Tat-translocated enzyme
[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAGC AGGGCACCTC ACGTCGGCGG TTCTTCACCG GGGCCGCCGG CCTGGGCGTG 
GCCGCCGGCG TCGGGGTCGG CGTCGGGGTG GCGACCGGGT ACGGCATCCG GGCTGCGACC
GAGGCCAACG CCGCGTCGAC CGACCCCGGC TCGACCGGCG CAGCCTCGAC CGACCCCGAT
GCGCAGGCCA ACGCGATCGT CCCGTTCTAC GGAGCGCGGC AGGCCGGGAT CGTCACCCCG
CAGCAGGAGC GGCTGATGTT CGCCGCCTTC GACGTGAGCA CCACCGACGT CGAGGAACTC
AAGCGGATGC TCGGCCGGTG GGCGGCGATG GCCGCGCGGA TGACGCAGGG CAAGCAGGTC
AGCGACTCGC CGACCAAGCC GGCCCAGCCG CCGTTCGACA CCGGCGAGGC GATGGATCTG
GGCGCGCACT CGCTGACCAT CACCGTCGGC TTCGGCCCCA GCCTGTTCGA CGACCGCTTC
GGGCTGGCCG ACCGGATGCC GCCCGAGCTG ACCGCCTTCG GCACCATTCC CGGTGACGCG
GTGATGCGGG CCGAGCTGTC CGACGGCGAC CTGTGCGTGC AGGCCTGCGC GGACGATCCC
CAGGTGGTCT TCCACGCCAT CCGCAACCTG GCCCGGGCGG CCCGCGGCAC CGCCACCCTG
CGCTGGTCGC AGCTGGGCTT CGGGCGGGCG TCCTCGACCG GGTCGCAGCA GGTCACCCCG
CGCAACCTGA TGGGCTTCAA GGACGGCACC CGCAACGTGC GGGCCGACGA CACCGCGACC
CTGGACGCGC ACGTGTGGGT GGGCGCGAAC GGCGCGTCCC TGGCCCCCGA GCACGAGTGG
ATGCGGGGCG GCTCCTACCT GGTCGCCCGC AAGATCCGGA TGGAGATCGA GTCCTGGGAC
ACCGATCCCC TGGAGGACCA GGAGAAGATC TTCGCCCGGT TCAAGGACAC TGGGGCGCCG
CTGACCGGGG GTGACGAGTT CACCGCGCCC GACTACGCCA AGCTCGGCGA CAACGGTCAG
CCGGTGATCG ACATCGACGC CCACATCCGG CTGGCCTCGC CGGAGCAGAA CAACGGCCTG
ACCATCCTGC GTCGCGGCTA CAACTACACC GACGGCCAGG ACCCGGCCAC CGGCAAGCTC
GCCGCCGGCC TGTTCTTCAT CGCCTACCAG CGGGACCCGC AGACCCAGTT CAAGGTGCTG
CAGACCCGGC TGGGCAAGAG CGATCTGCTC AACGAGTACA TCGCCCACAT CGGCGGCGGC
CTGTGGGGCT GCCCGCCGGG AGTCAGCGCG CCGGGCGACT GGTTCGGCAA GTCTCTTTTC
ACCTGA
 
Protein sequence
MTEQGTSRRR FFTGAAGLGV AAGVGVGVGV ATGYGIRAAT EANAASTDPG STGAASTDPD 
AQANAIVPFY GARQAGIVTP QQERLMFAAF DVSTTDVEEL KRMLGRWAAM AARMTQGKQV
SDSPTKPAQP PFDTGEAMDL GAHSLTITVG FGPSLFDDRF GLADRMPPEL TAFGTIPGDA
VMRAELSDGD LCVQACADDP QVVFHAIRNL ARAARGTATL RWSQLGFGRA SSTGSQQVTP
RNLMGFKDGT RNVRADDTAT LDAHVWVGAN GASLAPEHEW MRGGSYLVAR KIRMEIESWD
TDPLEDQEKI FARFKDTGAP LTGGDEFTAP DYAKLGDNGQ PVIDIDAHIR LASPEQNNGL
TILRRGYNYT DGQDPATGKL AAGLFFIAYQ RDPQTQFKVL QTRLGKSDLL NEYIAHIGGG
LWGCPPGVSA PGDWFGKSLF T