Gene Ndas_3080 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3080 
Symbol 
ID9246936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3683797 
End bp3685446 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content66% 
IMG OID 
ProductSite-specific DNA-methyltransferase (adenine-specific) 
Protein accessionYP_003680995 
Protein GI297562021 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.801077 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGACCA CGAAGGCGGC CAAGAAGCCG ACCAGAACCG GGTCCAAGGA CTTGAAGGAC 
ACGCTGTGGA AAGCGGCGGA CAAACTGCGC GGCAGCATGG ACGCGGCCGA GTACAAGCAC
TTCGTGCTCG GGCTCATCTT CCTGAAGTAC GTGTCCGACG CGTTCGCCGA GCGCCGGGTA
CACATCGAGA AGGAGCTTCG CGAGGAGGGC GGGTACTCCG AGACGGACAT CGCCGAGACC
CTGGAGGACC GGGAGGAGTA CATCGGCTAC GGCGTCTTCT GGGTGCCGCA GGCCGCGCGC
TGGGAGGCGA TCGCCGAGCG CGCCAAGACC GGTGCGGGCG AGGACGGTGT CGGCAAGCTC
CTCGACGACG CCATGAAGGC CGTCGCCAAC ACCAACCCCA GTCTGCGCAA CTCATTGCCG
CAAGGGCTGT TCAACGCGCG GGGGGTGGAC GAGCGGCGTC TGGGCGAACT GGTCGACCTC
ATCAACCGGA TCGGGTTCGG AGACCAGCTG GACCCCGACG GCAACCGCCG CAGCGCCCGG
GATGTCCTGG GTGAGGTGTA CGAGTACTGC CTGGGCAAGT TCGCCCTGGC GGAGGGCCGT
CGGGGCGGCG AATACTACAC ACCCGCGTGT GTGGTCGAGC TGATCGTCGC GATGCTCGAA
CCCCAGAAGG GCGAGCGTGT CTATGACCCG GCGTGCGGCT CGGGCGGGAT GTTCGTCCAG
GCGGAGAAGT TCGTCGAGAG CCACGGCGGC AACGCTCGGG ACATCGCCGT GTACGGTCAG
GAGCTCAACC AGAACACCTG GCGGCTGGCC AAGATGAACC TCGCCATCCA CGGGATCAGT
GCCGATCTCG GCACCAAGTG GGACGACACC TTCCACAACG ACCACCACCC CGACCTGCGA
GCGCACGTGG TGATGGCCAA TCCGCCGTTC AACATCTCCG ACTGGGGCGG TGACCGGCTG
GTCATGGACC CGCGTTGGCA ATGGGGCGTG CCTCCGGTGG GCAACGCCAA TTACGCCTGG
CTCCAGCACA TGGCCTACAA GCTGGCGCCG AAGGCGGGAC GGGCGGGCAT CGTGCTGGCC
AACGGGTCGA TGAGCAGTAA GCAGTCCGGC GAGGGCGACA TCCGCCGAGC CATGGTTGAG
GACGGACTCG TCGCCTGCAT GGTGGCACTG CCCGGACAGC TGTTCCGGTC CACACAGATT
CCCGCGTGTG TGTGGATCCT GGCCAAGGAC AGGGGCGCGA AAGGTGGTCG GGGCTCGATT
GACCGGACCG GCCAGGTGCT GTTCATCGAC GCGCGCGAAC TCGGTGAGAT GGTCACGCGC
ACCGAGAAAC AGCTCACCGA GGACGAGATC AAGCAGATCT CGAACACCTT CCACGCCTGG
CTCGGAACTT CGTCCGCCAA GCGGAACGGT CTCACCTATG AGGACATCGG CGGGTTCTGC
AAGTCCGTGA GCTTGGACGA GATCCGTGAG CACGACTTCA TCCTGACCCC GGGACGCTAC
GTCGGCGCCG CCGAAGTCGA GGAAGATCCG GACGCCGAGC CCCTGGACGA GAAGGTCGCC
CGTCTACAGA AGGAGCTTTT TGAGCACTTC GATGCGTCCG CCCGCCTGGA AGCCGTCGTT
CGCGAGCAGC TCGGGAGGGT CGATGCCTGA
 
Protein sequence
MATTKAAKKP TRTGSKDLKD TLWKAADKLR GSMDAAEYKH FVLGLIFLKY VSDAFAERRV 
HIEKELREEG GYSETDIAET LEDREEYIGY GVFWVPQAAR WEAIAERAKT GAGEDGVGKL
LDDAMKAVAN TNPSLRNSLP QGLFNARGVD ERRLGELVDL INRIGFGDQL DPDGNRRSAR
DVLGEVYEYC LGKFALAEGR RGGEYYTPAC VVELIVAMLE PQKGERVYDP ACGSGGMFVQ
AEKFVESHGG NARDIAVYGQ ELNQNTWRLA KMNLAIHGIS ADLGTKWDDT FHNDHHPDLR
AHVVMANPPF NISDWGGDRL VMDPRWQWGV PPVGNANYAW LQHMAYKLAP KAGRAGIVLA
NGSMSSKQSG EGDIRRAMVE DGLVACMVAL PGQLFRSTQI PACVWILAKD RGAKGGRGSI
DRTGQVLFID ARELGEMVTR TEKQLTEDEI KQISNTFHAW LGTSSAKRNG LTYEDIGGFC
KSVSLDEIRE HDFILTPGRY VGAAEVEEDP DAEPLDEKVA RLQKELFEHF DASARLEAVV
REQLGRVDA