Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3080 |
Symbol | |
ID | 9246936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 3683797 |
End bp | 3685446 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | Site-specific DNA-methyltransferase (adenine-specific) |
Protein accession | YP_003680995 |
Protein GI | 297562021 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.801077 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGACCA CGAAGGCGGC CAAGAAGCCG ACCAGAACCG GGTCCAAGGA CTTGAAGGAC ACGCTGTGGA AAGCGGCGGA CAAACTGCGC GGCAGCATGG ACGCGGCCGA GTACAAGCAC TTCGTGCTCG GGCTCATCTT CCTGAAGTAC GTGTCCGACG CGTTCGCCGA GCGCCGGGTA CACATCGAGA AGGAGCTTCG CGAGGAGGGC GGGTACTCCG AGACGGACAT CGCCGAGACC CTGGAGGACC GGGAGGAGTA CATCGGCTAC GGCGTCTTCT GGGTGCCGCA GGCCGCGCGC TGGGAGGCGA TCGCCGAGCG CGCCAAGACC GGTGCGGGCG AGGACGGTGT CGGCAAGCTC CTCGACGACG CCATGAAGGC CGTCGCCAAC ACCAACCCCA GTCTGCGCAA CTCATTGCCG CAAGGGCTGT TCAACGCGCG GGGGGTGGAC GAGCGGCGTC TGGGCGAACT GGTCGACCTC ATCAACCGGA TCGGGTTCGG AGACCAGCTG GACCCCGACG GCAACCGCCG CAGCGCCCGG GATGTCCTGG GTGAGGTGTA CGAGTACTGC CTGGGCAAGT TCGCCCTGGC GGAGGGCCGT CGGGGCGGCG AATACTACAC ACCCGCGTGT GTGGTCGAGC TGATCGTCGC GATGCTCGAA CCCCAGAAGG GCGAGCGTGT CTATGACCCG GCGTGCGGCT CGGGCGGGAT GTTCGTCCAG GCGGAGAAGT TCGTCGAGAG CCACGGCGGC AACGCTCGGG ACATCGCCGT GTACGGTCAG GAGCTCAACC AGAACACCTG GCGGCTGGCC AAGATGAACC TCGCCATCCA CGGGATCAGT GCCGATCTCG GCACCAAGTG GGACGACACC TTCCACAACG ACCACCACCC CGACCTGCGA GCGCACGTGG TGATGGCCAA TCCGCCGTTC AACATCTCCG ACTGGGGCGG TGACCGGCTG GTCATGGACC CGCGTTGGCA ATGGGGCGTG CCTCCGGTGG GCAACGCCAA TTACGCCTGG CTCCAGCACA TGGCCTACAA GCTGGCGCCG AAGGCGGGAC GGGCGGGCAT CGTGCTGGCC AACGGGTCGA TGAGCAGTAA GCAGTCCGGC GAGGGCGACA TCCGCCGAGC CATGGTTGAG GACGGACTCG TCGCCTGCAT GGTGGCACTG CCCGGACAGC TGTTCCGGTC CACACAGATT CCCGCGTGTG TGTGGATCCT GGCCAAGGAC AGGGGCGCGA AAGGTGGTCG GGGCTCGATT GACCGGACCG GCCAGGTGCT GTTCATCGAC GCGCGCGAAC TCGGTGAGAT GGTCACGCGC ACCGAGAAAC AGCTCACCGA GGACGAGATC AAGCAGATCT CGAACACCTT CCACGCCTGG CTCGGAACTT CGTCCGCCAA GCGGAACGGT CTCACCTATG AGGACATCGG CGGGTTCTGC AAGTCCGTGA GCTTGGACGA GATCCGTGAG CACGACTTCA TCCTGACCCC GGGACGCTAC GTCGGCGCCG CCGAAGTCGA GGAAGATCCG GACGCCGAGC CCCTGGACGA GAAGGTCGCC CGTCTACAGA AGGAGCTTTT TGAGCACTTC GATGCGTCCG CCCGCCTGGA AGCCGTCGTT CGCGAGCAGC TCGGGAGGGT CGATGCCTGA
|
Protein sequence | MATTKAAKKP TRTGSKDLKD TLWKAADKLR GSMDAAEYKH FVLGLIFLKY VSDAFAERRV HIEKELREEG GYSETDIAET LEDREEYIGY GVFWVPQAAR WEAIAERAKT GAGEDGVGKL LDDAMKAVAN TNPSLRNSLP QGLFNARGVD ERRLGELVDL INRIGFGDQL DPDGNRRSAR DVLGEVYEYC LGKFALAEGR RGGEYYTPAC VVELIVAMLE PQKGERVYDP ACGSGGMFVQ AEKFVESHGG NARDIAVYGQ ELNQNTWRLA KMNLAIHGIS ADLGTKWDDT FHNDHHPDLR AHVVMANPPF NISDWGGDRL VMDPRWQWGV PPVGNANYAW LQHMAYKLAP KAGRAGIVLA NGSMSSKQSG EGDIRRAMVE DGLVACMVAL PGQLFRSTQI PACVWILAKD RGAKGGRGSI DRTGQVLFID ARELGEMVTR TEKQLTEDEI KQISNTFHAW LGTSSAKRNG LTYEDIGGFC KSVSLDEIRE HDFILTPGRY VGAAEVEEDP DAEPLDEKVA RLQKELFEHF DASARLEAVV REQLGRVDA
|
| |