Gene Namu_4236 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4236 
Symbol 
ID8449862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4696648 
End bp4698045 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content75% 
IMG OID645043285 
Productperoxidase, putative 
Protein accessionYP_003203514 
Protein GI258654358 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2837] Predicted iron-dependent peroxidase 
TIGRFAM ID[TIGR01413] Dyp-type peroxidase family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0422414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGCCG GCGAGCCCGA CCTCGATCAG CTGCAGGGGC TGCTGACCAG CGCATTTCCC 
CGCTCGCCGG CCGGCCGGTA CGTGCTGGTC GCCCTGCCGG ATGCCGAACG CGGGCGCGCC
TGGCTGCGCT CGCTGCTGCC CATGATCACG TTCTCGGACG AGGTCGACCA GCAGATCCGG
GCCCGGCGCG CCGCGGACCG GCCGGCCGTC AACGTGGCCT TCACCGCCGC CGGCCTCGCC
GCCCTGGGCG TGCCGGCGGA CCGCACGGCG GACTTCTCCC GGGAATTCCG TGAGGGCATG
GTCACCCCGC ACCGCCAGCG CATCCTGGGC GATCTGGACG GCTCGCCCAG CGACCCGCGC
GGCTGGCGCT GGGGCGGGCC GGGCACCGAT CCGGTCCATG CGGTGCTGCT GCTCTTCGGG
GCGGACGAGG CCGCCCTGGA CGACGTCGTA GGCGAGCTGC TCGGCGCGGC CACCGGGGTC
CGGGTCGTGC ACACCGTCCC GACCGTGTCG ATCGCGGACG GCCGCGAGCA CTTCGGGTTC
CGGGACGCGA TCGCCAGCCC CTGGGTGCCC GGGTTGCACC GGGATCGCGC GAAACGGGAC
CGGGTCGCGG CCGGCGAGCT CGTCCTCGGC CGGCCCGACC TGACCGGGCA GCCGGAACCC
TTCCCGCCGG TGGGCCGGGA CGGCAGCTAC CTGGTGATCC GCCAGCTGGC CCAGGACGTG
CCCGGCTTCT GGACGGCCCT GCGGCAGTCG GTGGGCGACG CGCAGGCCGT GCGGTGGGCC
GCGAAGATGA CCGGCCGCTG GCCGGACGGC ACCGCGCTGA TCCGCTCCCC CGGCGGCGCG
GCGGCCGACC CGTCCGATGA TTTCGGTTAC CACGACGACC CGGACGGTGT CCGCTGCCCG
CTGGGCGCCC ACATCCGGCG GGCCAACCCC CGCGACGGGT TGGGGACCCG GCCGGACGAG
TCGATCCGGC TGGTGAACCG GCACCGGATC TTCCGCCGGG GCCGGCCGTT CGGCGCGGCG
GCACCCTGGC CCACCTGGCC TGCCGGCATC GACCCGGTCG TCGTGGACAG CGGGCCGCCG
GACGACAGCG GTGAGCGGGG GGTCGTTTTC GTCTGCCTCG GCGCCAGCCT GGCCCGGCAG
TTCGAGTTCG TCACGCAGTC TTGGGTGAAC AACCCGAAGT TCGCCGGGCT CTACGACGAA
GCCGACCCGA TCACCGGCGC ACCCCACCGG CGGATGTCCG GGTCGCGCGG GTCGGCGATC
GGATTCGAGT TCACCGCGCC CGGGCCCGTC CTCAACGAGC GGATCGACCG GCCGGCCACC
TACGTGCGCT GCGTCGGCGG CGGCTACTTC TTCCTGCCCG GCCGCCGCGG ACTGGCGCTG
ATCGCCGCGG AGGCCTGA
 
Protein sequence
MTAGEPDLDQ LQGLLTSAFP RSPAGRYVLV ALPDAERGRA WLRSLLPMIT FSDEVDQQIR 
ARRAADRPAV NVAFTAAGLA ALGVPADRTA DFSREFREGM VTPHRQRILG DLDGSPSDPR
GWRWGGPGTD PVHAVLLLFG ADEAALDDVV GELLGAATGV RVVHTVPTVS IADGREHFGF
RDAIASPWVP GLHRDRAKRD RVAAGELVLG RPDLTGQPEP FPPVGRDGSY LVIRQLAQDV
PGFWTALRQS VGDAQAVRWA AKMTGRWPDG TALIRSPGGA AADPSDDFGY HDDPDGVRCP
LGAHIRRANP RDGLGTRPDE SIRLVNRHRI FRRGRPFGAA APWPTWPAGI DPVVVDSGPP
DDSGERGVVF VCLGASLARQ FEFVTQSWVN NPKFAGLYDE ADPITGAPHR RMSGSRGSAI
GFEFTAPGPV LNERIDRPAT YVRCVGGGYF FLPGRRGLAL IAAEA