Gene Namu_1962 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1962 
Symbol 
ID8447571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp2162370 
End bp2163773 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content55% 
IMG OID645041093 
ProductDNA-cytosine methyltransferase 
Protein accessionYP_003201339 
Protein GI258652183 
COG category[L] Replication, recombination and repair 
COG ID[COG0270] Site-specific DNA methylase 
TIGRFAM ID[TIGR00675] DNA-methyltransferase (dcm) 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0166825 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTGAGG CTGAGTACAG CTTCCGCTTC GTCGACCTCT TTGCAGGTTT AGGGGGCTTT 
CACGTGGCGC TGCGAGAACT CGGAGGAGCG TGCGTGTTTG CTGCCGAGTT GGACCCAACC
CTTAATGCGC TCTACGCGGA GAACTATCAA CTCGAGGCTT GGAAGGATAT AAATGACCTC
GCATCTTCAC GAATCATATC ACAGGAAGTT CCGGACCATG ACGTTCTGAC AGCGGGATTT
CCCTGTCAAC CGTTCTCGAA AGCGGGGGAA CAACTGGGAT TCAAGGACAC GACACAGGGC
CATCTCTTTT TCAAGGTAAT CCAGATCTTG CAGACAAAGA AGCCCAGACA TTTCCTGCTT
GAGAATGTTC CAAACATTCT CAAGCATTCC GGCGGTGGGA CCCTTCAAAC AATACTGGCA
GAGTTAGAAG CAATCGGCTA CTCGGTCGGG GTTCGTCGCT TATCGCCCCA CGAGTTCGGA
ATACCTCAAA TTCGAGACCG GGCGTACTTC GTGGGGTCCC GCGATGGCCT TGAACAATTT
CGTTGGCCTG AGACCGAAAA GAGCAGCACT GACATAAAAT CAGTGCTGAA GCACGATCTC
GTCGATGTGC GGCCCATCCC CGCACAGACT ACACATGCGA TTAACATGTG GGATGACTTT
CTCAAACGTT CTCCAGCGAG GGTGAAGCTT CCATGGTTCC CAATCTGGGC GATGGAGTTT
CGAGCGACGT ACCCCTTTGA AGAGGCCACC CCATCCGCGA TATGGGCTGA AAAGGGAAGT
CGTGGTTTGA GTCGGCATCT AGGAAGCTTC GGATTTGAAT TGAGAGGCCT CGACCGTGCC
GCTCAATTCG AGCGCCTTCC AAGTCATGCC CGTCGCGCAG ACGACTTCAA GTTTCCCGAC
TGGAAGAAAG ACTTCATTCG ACAGAACCGT GAATTCTATT GCGAGAACCG GAAATGGATC
GATCCTTGGC TCGCGAAGTG GGAACCTTGG CGCATGGTCT CAAGCTACCA GAAGTTCGAG
TGGAATGCCC AGGGTGCGGA ACGTAAGATC GACAAGCACG TGATTCAGGT TCGCGCATCC
GGGCTACGTG TAAAGCGCAC GACGACAGCG CCAAGTCTAA TTGCCATGAC TAACACCCAG
GTTCCAATAC TTGGCAGGCA CCTAGTCGGC GTTAGGCGGT ATATGACGCC GCAGGAATGT
GCCGAACTCC AGTGCCTAGG AGATATCGAG TTGCCGAGGA ACGATCTCCA AGCATATAAG
GCCTTGGGGA ATGCCGTCAA CGCCCGGGTA GTGAAGGCGA TCGCAGAACC GTTGCTGGGC
GAGCTGACGC GCGCGGGCGG TGCACGCATA CCCGTGCCAA AGTCGCGACG CAAAGCAATT
GGCGGTAGCG TACCGTCTCA CTAG
 
Protein sequence
MSEAEYSFRF VDLFAGLGGF HVALRELGGA CVFAAELDPT LNALYAENYQ LEAWKDINDL 
ASSRIISQEV PDHDVLTAGF PCQPFSKAGE QLGFKDTTQG HLFFKVIQIL QTKKPRHFLL
ENVPNILKHS GGGTLQTILA ELEAIGYSVG VRRLSPHEFG IPQIRDRAYF VGSRDGLEQF
RWPETEKSST DIKSVLKHDL VDVRPIPAQT THAINMWDDF LKRSPARVKL PWFPIWAMEF
RATYPFEEAT PSAIWAEKGS RGLSRHLGSF GFELRGLDRA AQFERLPSHA RRADDFKFPD
WKKDFIRQNR EFYCENRKWI DPWLAKWEPW RMVSSYQKFE WNAQGAERKI DKHVIQVRAS
GLRVKRTTTA PSLIAMTNTQ VPILGRHLVG VRRYMTPQEC AELQCLGDIE LPRNDLQAYK
ALGNAVNARV VKAIAEPLLG ELTRAGGARI PVPKSRRKAI GGSVPSH