Gene Ndas_2689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2689 
Symbol 
ID9246540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3203005 
End bp3205134 
Gene Length2130 bp 
Protein Length709 aa 
Translation table11 
GC content77% 
IMG OID 
ProductN-6 DNA methylase 
Protein accessionYP_003680610 
Protein GI297561636 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.227134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTCCG GTGAGGTGGA GGTGAGCTCG GCCGACATCG CCAGGATCGC GGGCGTGAAG 
CCGACGGCCG TGAGCAACTG GCGGCGCCGC CACGACGACT TCCCCAGACC GGTGGGGGGC
ACCGACCGCA GCCCCCGGTT CGACCTGGGG CAGGTGGAGG AGTGGCTGTC CACCCACCGC
CGGGCCCCGA GCATCGACCC CGACCAGCGC CTGTGGCAGG CGCTGGACTC CCTGCGCGGC
TCCGTCCCCG TCGACGCGGC GCTCGTCCGG GCCGGGGTGC TGCTGTGGCA CCTGAGCCGC
GACCCCGGCA CCGGGGGACA CAGCGCCGGT GAACCCGGGA AGCAGGACGG ATCCGCGCTG
GCGGGGCGCG CGTGGCGGGA CTTCGAGGGC CTCCTCGAAG GCGCGGGTCT GGCCGCTCTG
GCCGAGCACC GCCCCGAACC CGTTCCGGAC GCCCGACTGG CGCCGCTGGC CGACGCGGCG
GTGCGCGCCG TGCGGGAGAG CGATCCGCGG ACGGCCTTCG AACGCCTGCT CAGCCGCCTG
GACCAGCGCT CCCCCTCCGG TTCGCACACG GTCCCGCCCG AGCTGGCCGA CCTCATGGTG
GTTCTGGCCG GAATCGCGAA CGCCGGATCC GGCCCCGAGG ACACCGTGGC CGACCCCGCC
TGCGGGCGGG GCGGGCTGCT GCTCGCCGCC GCGCGCGGTG GACGGCGCGC CCTGCTGGGG
CAGGACCGGG ACGCCGCCTC GGTGTGGCTG GCCGCCCTGC GCCTGGCCTT CGCCGGGGCC
CTGACCGGGG AGGCGGACCT GCGGGTGTCC GACGCGCTGC GCCTGCCCGC GTTCGCCCCG
GACGCACCGG ACGGCGCGGA CGGCGCCGAC GCGGTCGTGT GCGCCCCGCC CTTCGGCGAG
CGCAACTGGG GCGTCGAGGA GCTCGCGGAG GACCCCCGGT GGACGTACGG GGTCCCCCCG
CGCCTGGAGT CCGACCTGGC CTGGGTCCAG CACTGCCTGT CCCTGGTCCG GCCGGGCGGC
TCGGCGGTCG TGCTGATGCC CCCGGGCGCC GCCCAGCGCC CCTCGGGTCG GCGCATCCGC
CGGTCGCTGC TGCGCGCGGG CGCCTTCCGC GCGGTGGTCT CGCTCCCGCC CGGGTTCGCG
GCCCACTACG CCGTTCCCCT CCAGCTGTGG GTGCTGCGCC GACCGGAGCG GGACGCGGTC
CCCGCTCCCG TGCTGCTCGT GGACACCGCC CCGCACGAGC CCGGGCAGAC CTGCCCGCCG
GGCGAGGTCC TCGGCCGGAT CGACCGGCTC TGGCGGGAGT ACCTGGCCGA CCCGGAGGAC
TTCGACGAGC ACCCGGGTGT GGCGCGGACG CTGGAGGTGG CCGACCTGCT GGACGAGGAC
GTCGACCTGA CCCCCCGCGC GCGCCTTCCG GTTCCGCGCG CCGCGGCGGG CGACCTGGAC
CTGTTCGCCG AGCGGCGCAC GCGCCTGGAG GGCGCGCTGC GTACGCTGCG CACGCTCCTG
CCCGAGGCGC CGCGGGCCGA GCCCGGCGGT GAGGCGGTCG GGGTGGTCAC GCTCGGCGAG
CTGGCCCGGG CCGGGTCGGT GACGATCCGG CGGCCCCTCC CCCGGCGTTC GGCGGAGGAG
GCGGGCCCGC GCACCAGCGC CAGGGTGGTC ACCGGTGAGG ACGTGGCCCG CGGAGCGGGA
GCCTCGCGCA CGGAGGAGGT GGACGCCGAC CCGATGCGCA ACCCCCCGAT CCAGGAGGGC
GACGTCCTCA TCCCCGCGGT CGCGCCGAGG CCCGCCGCGC GGGTGGCGAC CGGGGACGAC
GCGGGCGCCT ACCCCGGCGG CGGCCTGTAC GTGGTGCGCA CGAACCCGGG CGCGGTGGAC
CCGTGGTTCC TGGCGGGGTT CGTCACCAGC TCCGGGGAGA GCCGGAACAT CACCCGGATG
AGCAGCAGCA TGCGGGGCCG TCTGAGGATC GACCCGTCCC GGATGCGGCT CCCGGTGCTG
CCGGTGGAGG AGCAGAGGCG GTACGGGGAG CTCTTCCGGA GGGCGGCGCG GTTCCGGGAG
GCCCTGCTGG AGTTCCAGGG ATGGGGCGAG GAGCTGGCCG ACCAGGCCGT CGACCTGGTC
GCCGGGCGGT GGGAGACCTC CTCTTCCTAG
 
Protein sequence
MGSGEVEVSS ADIARIAGVK PTAVSNWRRR HDDFPRPVGG TDRSPRFDLG QVEEWLSTHR 
RAPSIDPDQR LWQALDSLRG SVPVDAALVR AGVLLWHLSR DPGTGGHSAG EPGKQDGSAL
AGRAWRDFEG LLEGAGLAAL AEHRPEPVPD ARLAPLADAA VRAVRESDPR TAFERLLSRL
DQRSPSGSHT VPPELADLMV VLAGIANAGS GPEDTVADPA CGRGGLLLAA ARGGRRALLG
QDRDAASVWL AALRLAFAGA LTGEADLRVS DALRLPAFAP DAPDGADGAD AVVCAPPFGE
RNWGVEELAE DPRWTYGVPP RLESDLAWVQ HCLSLVRPGG SAVVLMPPGA AQRPSGRRIR
RSLLRAGAFR AVVSLPPGFA AHYAVPLQLW VLRRPERDAV PAPVLLVDTA PHEPGQTCPP
GEVLGRIDRL WREYLADPED FDEHPGVART LEVADLLDED VDLTPRARLP VPRAAAGDLD
LFAERRTRLE GALRTLRTLL PEAPRAEPGG EAVGVVTLGE LARAGSVTIR RPLPRRSAEE
AGPRTSARVV TGEDVARGAG ASRTEEVDAD PMRNPPIQEG DVLIPAVAPR PAARVATGDD
AGAYPGGGLY VVRTNPGAVD PWFLAGFVTS SGESRNITRM SSSMRGRLRI DPSRMRLPVL
PVEEQRRYGE LFRRAARFRE ALLEFQGWGE ELADQAVDLV AGRWETSSS