Gene Namu_3232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_3232 
Symbol 
ID8448846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp3560568 
End bp3561797 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content74% 
IMG OID645042311 
Productarsenite-activated ATPase ArsA 
Protein accessionYP_003202552 
Protein GI258653396 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0003] Oxyanion-translocating ATPase 
TIGRFAM ID[TIGR00345] arsenite-activated ATPase (arsA) 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.00988964 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000117185 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGAATTG CCCTGCACAC CGGCAAGGGC GGGGTCGGCA AGACGACCAT CTCGGCGGCC 
ACCGCGATCG CGTGTGCCGC CGGCGGGGCG CGCACGCTGC TGGTCTCCAC CGACCCCGCA
CACTCCATCG CCGACGTGCT CGGCACCCCG GTCAGCGGCG ACCCGACCCC GGTCGTGGGC
GTCCCCGGGC TGTGGGCCGC CCAGGTCGAC ACCCGCGGCC GCTTCGAGCA GAGCTGGTCG
CACATCCGCG ACTATCTGGT CGGGGTGCTG GCCGCCCGCG GCATGGCCGA GGTGCAGGCC
GAGGAACTGG TCGTGCTGCC CGGCGCCGAG GAGATCGTCG CGCTGCTGGA GCTGCGCCGG
CTGGCCGCCT CCGGTGACTT CGATTCGATC GTGGTCGACT GCGCGCCGAC CGGCGAGACG
CTGCGGCTGC TGGCGCTGCC CGAGACCATC GGTTTCTACG CCCAGCGCCT GCTCGGCGCG
CCGCAGCGGG TCCTGCGCAG CATCGCCGCG TCCTTCACCG GCATGCCCGG CGGGCCCAGC
GCCACCGTGC GGGACGCGGT GGGGGAGCTG CTCTCCGACC TGATGGCCGC GCGGGCGTTG
CTGGCCGACC CGGAGATCAC CGGGGTCCGG CTGGTGCTGA CTCCCGAACG GATGGTGGTG
GCCGAGGCCC GCCGGCTGTT CACCGCGCTG TCCCTGCACG GATTCGCCGT CGAGGCGGTC
ACCGTCAACC GGCTGCTGCC CCGCGGGGTG GGCGGTGATT TCCTGCGCCG TCAGCGGGAG
AGCCAGCGTG AGGCGATGGT CCAGGTCGAG GAATCGTTCC AGGGCCTGCC CATCCACCGG
GTCCGGCAAA AGCCCGAGGA GCCCATCGGC GTCGACCAGC TGTCCGAGCT GGCGACCGAC
ATCTTCGGCT CGGTCGACCC CCTCGCCGTC GCGCCACCCG GTCCGGCGAT CGAGGTCAGC
GGGTCCGACG GGTGGTACCG CCTGTCGCTG CCGCTGCCGC TGGTCCAGCG CGGCGACATC
GCGCTTTCCC GGTCCGGCGC CGACCTGGTG GTCACCGTCG GCGACGTCCG CCGGCGGATC
GCCCTGCCGT CGGTGCTGCA GCGGTGCACG ACCGAGGGCG CCAACTTCGA GGCCGGCCGC
CTGATCATCG ACTTTGCCGC CGATCCCGCG CTGTGGCCGG CCGCCCTCAC CTCCGGCCTG
ACCGGGGCGG CGCTGGCCGG TGCCGGGTGA
 
Protein sequence
MRIALHTGKG GVGKTTISAA TAIACAAGGA RTLLVSTDPA HSIADVLGTP VSGDPTPVVG 
VPGLWAAQVD TRGRFEQSWS HIRDYLVGVL AARGMAEVQA EELVVLPGAE EIVALLELRR
LAASGDFDSI VVDCAPTGET LRLLALPETI GFYAQRLLGA PQRVLRSIAA SFTGMPGGPS
ATVRDAVGEL LSDLMAARAL LADPEITGVR LVLTPERMVV AEARRLFTAL SLHGFAVEAV
TVNRLLPRGV GGDFLRRQRE SQREAMVQVE ESFQGLPIHR VRQKPEEPIG VDQLSELATD
IFGSVDPLAV APPGPAIEVS GSDGWYRLSL PLPLVQRGDI ALSRSGADLV VTVGDVRRRI
ALPSVLQRCT TEGANFEAGR LIIDFAADPA LWPAALTSGL TGAALAGAG