Gene Namu_4623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_4623 
Symbol 
ID8450251 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5144962 
End bp5147394 
Gene Length2433 bp 
Protein Length810 aa 
Translation table11 
GC content60% 
IMG OID645043664 
Producttype I restriction-modification system, M subunit 
Protein accessionYP_003203891 
Protein GI258654735 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.218097 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTACTAA GGAAGTCAGA TCTGTACGGC TCGTTGTGGA AGAGTTGCGA CGAGCTGCGC 
GGTGGCATGG ATGCGAGCCA GTACAAGGAC TATATTCTCA CGCTCTTGTT CGTCAAGTAT
GTGTCGGACA AGGCCAAGAC AGACCCGAAC ACCTTGATCG ATGTCCCGAG AGGTGGCTCG
TTTGATGACA TGCTCGCCGC CAAGGGCGAC AAAGAGATCG GCGACCGGCT CAACAAGATA
ATTGCCAAGC TGGCCGAAGC CAACGGCCTT CGGAACGTCA TCGACCAGGC CGACTTCAAT
GACGAAGAGA AGCTTGGCAA GGGCAAAGAG ATGCAAGACC GCCTCTCCAA GCTCGTCACG
ATCTTCAACG ACTTGGACTT CCGCGGCTCC CGTGCTGAGG GCGACGACCT GCTTGGCGAC
GCATACGAGT ACCTTATGCG ACACTTCGCC ACGGAGTCGG GCAAGAGCAA GGGCCAGTTC
TACACGCCGG CCGAGGTCTC CCGGATCCTG GCGAAGGTCG TAGGTATCAA TAGCCGGACC
AGGCAGGACA AGACCGTCTA CGACCCGACC TGTGGCTCCG GGTCGTTGCT GCTCAAAGCT
GCCTCCGAAG CGCCGCGCGG CATGACGATC TACGGCCAGG AGAAAGACAA CGCTACCTGG
GCGCTGTCCA AGATGAACAT GATCCTGCAT GGCAACGAGA TCGCTGACAT CGCCAAAGGC
GACACCATCA CCAATCCGCA GTTCGTGAGT GGTAACCACC TCAGGACCTT TGACTTTGTG
GTCATGAACC CGCCGTTTTC GCTGAAGTCC TGGAGCAATG GGCTCGAAAA CGACTACGGC
CGATTTGAAT ACGGTCGTCC GCCGGAGAAG AACGGCGACT ACGCCTTCCT TTTGCACGCC
CTGAAGTCCC TCAAGAGCGT CGGGAAAGCC GCGATCATCC TGCCGCACGG CGTGCTGTTC
CGTGGCCACG CCGAGGCGAC GGTCCGGCAG CGGTTGCTCA AGCAAGGGTT CATCAAGGGC
ATCATCGGTT TGCCGCCTAA CCTCTTTTAT GGAACTGGAA TACCCGCGTG CATCGTCATC
CTCGACAAGG AGAATGCTGT CGCCCGCACT GGCGTGTTCA TGATCGACGC CTCGAAGGGG
TTCATGAAGG ACGGCAACAA GAACCGCCTA CGCAGCCAAG ACATCCACAA GATCGTTGAC
ACCTTCAACA AGCAGCTCGA GGTCGAACGC TACTCCCGCA TGGTTCCGCT GTCCGAAATC
TCGGACCCGA AGAACGACTT CAACCTCAAC ATCCCGCGCT ACATCGACTC ATCCGAACCG
GAAGACCTCC AGGACCTGCA CGCCCACCTG CATGGCGGCA TCCCGGACCG CGACCTGGAC
GCCCTGGGTG TCTACTGGGA TGCCTTCCCC AGCTTGCGCG CCACCCTGTT CAAACCGAAC
CGGCCCGGCT ACAGCGACCT GACTATTGAC GTCAGCGACG TCCAGCAATC AGTACTCGAC
TCCGACGAGT TCACCAAATT CGCCGCAGAC GTCCGAAGCG CAGTCGATGA CTGGTTCGTG
ACACATCGAC CGATTCTGCA GGCCATTAAC GAGCAAACGG TCCCGAACGA GCTGATCTCC
CGCATCGCCG ACGACCTGCT GGCACGCTTC AAGGACACGG CACTGCTCGA TGAGTACGAC
GTATACGAGC AGCTCATGAC CTACTGGCAC GAGACCATGC ACGACGACGT GTTCCTCGTG
ATGAACGACG GGTGGCTCGA CGCCGCGAAG CCACGCAAGG CGATCGAGGA CAAGGAGCGC
AAGCTCTCAG AAACCCCCGA CCTCGTCGTC GGCTCCGGCC GTGGCGCCAA TAAGTACAAG
ATGGATCTCA TCCCGCCGGC CCTCGTCGTG GCCCGATTCT TTGCCAACGA GCAGGCCAAG
GTTGACGAGC TCAGCCTCGT TGCCGATGAG GCAGCTCGCG CCGTCGAGGA ATACATCGAG
GAGCACGCCG TCGACGACGG ACTACTCGCT GACGCCATGG ACGACGACAA GATCAGCAAG
GCGTTGGTGA GTGCTCGCCT GAAGGTTGCC AAGCACGAAG GGGCGGAACC CGAGGAGACT
CAGGCGCTGC AGCACCTGCT CGGCCTGTAC AACTATGAAG CCGTCACAAG GAAGGCGGCC
AAGGATGCCC AGGCCGCACT TGACGCCGCC ACCCTCAAGA AGTACGGCGA TCTGACGCTG
CCCGAAATAA AAGGCCTCGT GCTCGACGAC AAGTGGCACG CCACCATCGC GGAGGGCGTA
GCCAGCGAAG TCACCGGACT CACTCAGAAC CTGGTCGCAC GGATCCAGCA GCTCGGCGAG
CGGTACGCGG AGACCGTCGA CGACCTCGAC GCCGAACTGA GGGCGTCAGA AGTGCTTGTC
TCCCAACACC TCGCCGCAAT GGGAGTTCGT TGA
 
Protein sequence
MVLRKSDLYG SLWKSCDELR GGMDASQYKD YILTLLFVKY VSDKAKTDPN TLIDVPRGGS 
FDDMLAAKGD KEIGDRLNKI IAKLAEANGL RNVIDQADFN DEEKLGKGKE MQDRLSKLVT
IFNDLDFRGS RAEGDDLLGD AYEYLMRHFA TESGKSKGQF YTPAEVSRIL AKVVGINSRT
RQDKTVYDPT CGSGSLLLKA ASEAPRGMTI YGQEKDNATW ALSKMNMILH GNEIADIAKG
DTITNPQFVS GNHLRTFDFV VMNPPFSLKS WSNGLENDYG RFEYGRPPEK NGDYAFLLHA
LKSLKSVGKA AIILPHGVLF RGHAEATVRQ RLLKQGFIKG IIGLPPNLFY GTGIPACIVI
LDKENAVART GVFMIDASKG FMKDGNKNRL RSQDIHKIVD TFNKQLEVER YSRMVPLSEI
SDPKNDFNLN IPRYIDSSEP EDLQDLHAHL HGGIPDRDLD ALGVYWDAFP SLRATLFKPN
RPGYSDLTID VSDVQQSVLD SDEFTKFAAD VRSAVDDWFV THRPILQAIN EQTVPNELIS
RIADDLLARF KDTALLDEYD VYEQLMTYWH ETMHDDVFLV MNDGWLDAAK PRKAIEDKER
KLSETPDLVV GSGRGANKYK MDLIPPALVV ARFFANEQAK VDELSLVADE AARAVEEYIE
EHAVDDGLLA DAMDDDKISK ALVSARLKVA KHEGAEPEET QALQHLLGLY NYEAVTRKAA
KDAQAALDAA TLKKYGDLTL PEIKGLVLDD KWHATIAEGV ASEVTGLTQN LVARIQQLGE
RYAETVDDLD AELRASEVLV SQHLAAMGVR