Gene Namu_4072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4072 
Symbol 
ID8449692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp4487633 
End bp4489399 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content75% 
IMG OID645043116 
ProductDNA repair protein RecN 
Protein accessionYP_003203351 
Protein GI258654195 
COG category[L] Replication, recombination and repair 
COG ID[COG0497] ATPase involved in DNA repair 
TIGRFAM ID[TIGR00634] DNA repair protein RecN 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0367674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.136098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCAAG AGCTGCGGAT TGCCGACCTC GGGGTGATCG ACGAGGCGTT GATCGAGCCC 
GATCGTGGCT TCACCGTCGT GACCGGTGAG ACGGGTGCGG GCAAGACGAT GGTGGTCACC
GCGCTCGGCC TGATCGGCGG GCGCCGGGGT GACGCCAGCA AGGTCCGGGC CGGGGCCGAG
CGGGCCACCG TCGAGGTCCG CTGGTCGCCG CCCGACGGCG AGAGCGAGTC TCCCGCCCAG
GAACTGGTGT CCTCGGTCGG CGGCCGCTTC GACGAGGACG GCACGCTGAT CGCCGCCCGT
TCCGTGGGCA CCGACGGCCG ATCCCGCGCG CACGTCGGCG GCCGGTCCGT GCCGCTGGCC
ACGCTGGCCG AGCTGGCCGA GCCGCTCATC GCCGTGCACG GCCAGTCCGA GGCCATCTCG
TTGCTGCGAC CGGGCCCGCA GCGGGCCGTC CTGGACCGCT TCGCCGGGCT CACCGCGCAG
GTCGGTCGGT ATCGCGAGCT ACGCAGCCGC TGGCACCGGA TGGCCGCCGA TCTGGCCGAC
CGGCGGGCCC GGGCCCGGGA GCGGGCCCAG CGCGAGCAGC TGTTGCGCAT CGGCCTGGCC
GAGATCGAGG CGGCCGCGCC GGTGCCCGGC GAGGATCGGG ACCTGGTCGA GGAGGTGCGC
CGGCTGCAGA ACCTGGACGG GCTGCGGGCG GCGGCGGCGG GTGCTCACGA GTCGCTGACC
GGGTCGGAGG ACGCGGCCGC CGCACCGGCC GCGCTGGCCC TGGTGCACGG CGCCCAGCAT
CTGCTGGACA CGGCGGAGGA TCCGCGGTTG GCCGAACTGG GTGGTCAGCT GCAGCAGGCC
GCGCTGGTCC TGGCCGATGT CGGATCCGAG CTGTCAGTCT TCCTCTCCGG GCTGGACGAC
GAGCCGGGCC GGCTGACCCA GGTGCTCGAG CGGCAGGCGA CCCTGCGGGC GCTGACCCGC
CGCTACGGCG ACGACGTCGA CGCCGTCTGC GCCTGGGCTC GGTCGGCCGG CCAGGAGCTG
CTCGAGCTGG ATTCCTCCGA CGACCGGCTG GCCCGGATGC AGGCCGACCT CGACGAGGTG
CGCGGCGAGT TGGGCCGGTT GGCCGCGCGG CTGTCCGGCG AGCGGTCCGC GGCGGCCGAG
CGGCTGGGTC GCCTGGTCAC GGCCGAGCTG GCCTCCCTGG CCATGGCCCG GGCCACCGTC
CGGGTGCGGG TCAGCCAGCA GGCGGCCGAC CCGCACGACC CGCAGGCGGT GCCGGTCGAC
CACAGCTGGC TGCTGGCCGG CCCAGACGGG GTGGACCAGG TGGAGATCGT CATGGTCGCG
CACGCCGGTG CCCCCGAACT GCCGATCGCC AAGGGCGCCT CGGGTGGCGA GCTGTCCCGG
GTGATGCTGG CCCTGGAGGT GGTGCTGGCC GACTCCGATC CGGTCTCGAC CATGGTCTTC
GACGAGGTCG ACGCCGGGGT CGGCGGCCGG GCCGCGACCG AGATCGGGGA GCGGCTGGCC
GCGCTGGCCC GGACCCACCA GGTCATCGTC GTGACCCACC TGGCTCAGGT GGCCGCCCAC
GCCGATCGTC ACTACATCGT CGACGCCGAC TCCTCCGGCC GGATCGGCAC CTCGAACGTG
CGGCTGGTCA CCGGACGCGA ACGCGAACGG GAGCTGGCCC GGATGCTGGG CGGGACGAAC
GGCCCCGCCG CCCGGGCGCA CGCCCGGGAC CTGCTCGCCG CGGCCAAGGG TGCGACCGGC
ACCACCCCCT TGCGCCGGGC GGGCTGA
 
Protein sequence
MLQELRIADL GVIDEALIEP DRGFTVVTGE TGAGKTMVVT ALGLIGGRRG DASKVRAGAE 
RATVEVRWSP PDGESESPAQ ELVSSVGGRF DEDGTLIAAR SVGTDGRSRA HVGGRSVPLA
TLAELAEPLI AVHGQSEAIS LLRPGPQRAV LDRFAGLTAQ VGRYRELRSR WHRMAADLAD
RRARARERAQ REQLLRIGLA EIEAAAPVPG EDRDLVEEVR RLQNLDGLRA AAAGAHESLT
GSEDAAAAPA ALALVHGAQH LLDTAEDPRL AELGGQLQQA ALVLADVGSE LSVFLSGLDD
EPGRLTQVLE RQATLRALTR RYGDDVDAVC AWARSAGQEL LELDSSDDRL ARMQADLDEV
RGELGRLAAR LSGERSAAAE RLGRLVTAEL ASLAMARATV RVRVSQQAAD PHDPQAVPVD
HSWLLAGPDG VDQVEIVMVA HAGAPELPIA KGASGGELSR VMLALEVVLA DSDPVSTMVF
DEVDAGVGGR AATEIGERLA ALARTHQVIV VTHLAQVAAH ADRHYIVDAD SSGRIGTSNV
RLVTGRERER ELARMLGGTN GPAARAHARD LLAAAKGATG TTPLRRAG