Gene Namu_1102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1102 
Symbol 
ID8446698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1223864 
End bp1225510 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content66% 
IMG OID645040239 
ProductSite-specific DNA-methyltransferase (adenine- specific) 
Protein accessionYP_003200498 
Protein GI258651342 
COG category[V] Defense mechanisms 
COG ID[COG0286] Type I restriction-modification system methyltransferase subunit 
TIGRFAM ID[TIGR00497] type I restriction system adenine methylase (hsdM) 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.358797 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCCC GCGGACGGAA GGCAAGTACG GCGGTCACGC CGGCTGTTTC GACACCACCC 
GCTGCCTCCA CGATGAAGGA ACTCAAGGAC ACCCTGTGGA AGGCCGCCGA CAAGCTCCGC
GGCTCCATGG ACGCCTCGCA GTACAAGGAC GTGATCCTTG GCCTGGTCTT TCTTAAGTAC
GTCTCCGACG CGTTCGACGA GCGGCGGGAG CAGATCCGGG CCGAGCTCGA GGCCGACGGG
ATCGATGAAG ACCAGATCGA CGGTTTCCTC GATGACGTCG ACGAGTACCG CGGCCACGGG
GTGTTCTGGG TCAACCGGGA CGCCCGCTGG TCGTACCTGG CTCAGCACGC CAAGGGCATC
CCGGCTGTCG GCAACGAGCC GCCTAAGCAG GTCGGACAGC TGATCGATGA GGCCATGGAC
TACCTGATGG ACGCGAACCC ATCGCTGCGG GCCACGCTGC CGCGGATCTA CAACCGCGAC
AACGTTGATC AGCGCCGCCT CGGTGAACTG CTCGACCTGT TCAACAGCGC CCGGTTCACG
GGACAAGGCG CAACCAAAGC CCGCGACCTG TTGGGTGAGG TGTACGAGTA CTTCCTGGAG
AAGTTCGCCA AGGCCGAGGG CAAGCGGGGT GGCGAGTTCT ATACGCCGGC CAGCGTGGTC
CGGGTGCTGG TCGAGGTGCT GGAGCCGACC CGGGGCCGGG TGTACGACCC GTGTTGCGGG
TCCGGCGGCA TGTTCGTGCA GACCGAGAAG TTCCTGGAGG CCCACCACCG GGAGGGCTCC
GAGATCTCCG TCTACGGGCA GGAACTCAAC GAGCGCACCT GGCGGATGGC CAAGATGAAC
CTGGCCATCC ACGGGCTCAG CGGCAACCTC GGACCGCGGT GGGGCGACAC TTTCGCCCGT
GACATCCACC CCGATGTACA GGCCGACTAC GTGCTGGCGA ACCCGCCGTT CAACATCAAG
GACTGGGCCC GCAACGATAA GGATCCCCGC TGGAAGTTCG GCGTTCCGCC GGCCGGCAAC
GCCAACTACG CCTGGATCCA GCACATCATC TCCAAGCTTG CCCCCGGCGG CTCCGCCGGC
GTGGTGATGG CCAACGGCTC CATGTCCACC CAATCCGGCG GGGAAGGTGC CATCCGCGCC
CAACTGGTCG AGGCCGACCT GGTGTCCTGC ATGGTCGCCC TGCCCACCCA GCTATTCCGG
TCCACCGGGA TTCCGGTCTG CCTGTGGTTC TTCGCCAAGG ACAAGACCGT CGGTACCGGC
GGGTCGGTGG ACCGGTCCGG GCGGGTGCTG TTCATCGACG CGCGGTCAAT GGGGAACATG
GTGGATCGGG CCGAGCGGTC GCTGTCCGAC GACGACATCG GTTTGATCGC CGGGACATTC
CATGCGTGGC GGGGGACAGC CTCGGCGCGA GCGGCGGGGC TGGAGTACGC CGACAAGCCC
GGGTTCTGTT ACTCGGCGAC GCTGGCTGAG ATCAAGGCCG CCGATTACGC GCTGACGCCG
GGGCGGTATG TGGGGGCGGC GGAGGTCGAG GACGACGGGG AGCCGATCGA GGAGAAGATC
GATCGCCTGA CGAAGCAGTT GTTCGACCAG TTCGACGAGT CGGAGCGACT GGCCAAGGTC
GTTCGCCAAC GGCTGGGACG GCTGTGA
 
Protein sequence
MPPRGRKAST AVTPAVSTPP AASTMKELKD TLWKAADKLR GSMDASQYKD VILGLVFLKY 
VSDAFDERRE QIRAELEADG IDEDQIDGFL DDVDEYRGHG VFWVNRDARW SYLAQHAKGI
PAVGNEPPKQ VGQLIDEAMD YLMDANPSLR ATLPRIYNRD NVDQRRLGEL LDLFNSARFT
GQGATKARDL LGEVYEYFLE KFAKAEGKRG GEFYTPASVV RVLVEVLEPT RGRVYDPCCG
SGGMFVQTEK FLEAHHREGS EISVYGQELN ERTWRMAKMN LAIHGLSGNL GPRWGDTFAR
DIHPDVQADY VLANPPFNIK DWARNDKDPR WKFGVPPAGN ANYAWIQHII SKLAPGGSAG
VVMANGSMST QSGGEGAIRA QLVEADLVSC MVALPTQLFR STGIPVCLWF FAKDKTVGTG
GSVDRSGRVL FIDARSMGNM VDRAERSLSD DDIGLIAGTF HAWRGTASAR AAGLEYADKP
GFCYSATLAE IKAADYALTP GRYVGAAEVE DDGEPIEEKI DRLTKQLFDQ FDESERLAKV
VRQRLGRL