Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3434 |
Symbol | |
ID | 9247301 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4111150 |
End bp | 4112250 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | N-acetylmuramoyl-L-alanine amidase family 2 |
Protein accession | YP_003681345 |
Protein GI | 297562371 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0021339 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.405491 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAGGC GTACCCTCCT GGCCGGTGCC ACCGCCGCTG CGGGACTGAC CGCGCTGGGC ACCGCGCTCA CCTCCGCCCG CCCCGCACTG GCCGACACCG GCGGCGACCG CACGGTCCCC CGCGTGCTGG CGGAGCCCTC CGCCTCACAC GGACTGGTCC GCCCCGACCT GCGCTTCGAC ATGGTGGCCG TCACCGGTGA CCCCGGAGAG GCCGACGCGG CGGTCCGCTT CGAGACCGCC GACGGCCTGG GCTCGTGGAA CCCGGTGCAC CTGCACACCG GGGGCCGCGA CGACCGGGAC CCGGTCGCCG CCGCGCTGGT GCGCGCGCCC GAGGGCGCCA CCGGTTACGA GGTGCGTTCG AGAGGGGGGA CCGCGGCCGT GAACCTGCGC GACGGAGAGG GTCTGCGCTT CGGCGGCCCC GCACAGGCGT CGCTGTCCGC GGAGGCCTCC GGTACGCTGC GCGGCCGGAC CAGCGTCCCG TTCCGCACCC GCGCGGGCTG GGGCGCCGAC GAGTCCTGGC GTTTCGACGA CCAGGGCGAC AACCTGTGGG AGGCGGAGTT CCACCCCGTG CAGGCGCTGA CCGTGCACCA CACCGCGATG CCGACCGGGG ACGACCACGC GGCGGACGTG CGGGCGGTGT ACTACCTGCA CGCGGTGCAG CAGCTGTGGG GGGACATCGG CTACCACGTG CTCATCGACC CCGACGGGGT GGTCTACGAG GGCCGCCACT CGGGCGAGGA CGGCGTGCCG GTCTTCTCCG GGATCCCGCG GCCGGGGCGG GCCGAGTCGG TGACCGCGGG GCACGCCTAC GGGTTCAACC AGGGCAACGT GGGCGTGTGC CTGCTCGGGG ACTTCACCGA CGAGCTGCCC ACGCGGGCGG CGCAGGACTC CCTGGTCGAC GTGCTGCGCG TGCTGTGCGC GGTGACCGGC GTGGACCCGG CCGGGCAGAT CGAGTACGTC AACCCGGGCA CGGGCGTGGT CACGCCGGGC GACGCGATCT CCCGGCACCG CGACTGGCTG GAGACCGAGT GCCCGGGCAA CGCCTTCTCC GAGGTGTTCG ACAGCGCGGT CCGCCAGCGC GTCATCGCGG GCCTGGCCTA G
|
Protein sequence | MRRRTLLAGA TAAAGLTALG TALTSARPAL ADTGGDRTVP RVLAEPSASH GLVRPDLRFD MVAVTGDPGE ADAAVRFETA DGLGSWNPVH LHTGGRDDRD PVAAALVRAP EGATGYEVRS RGGTAAVNLR DGEGLRFGGP AQASLSAEAS GTLRGRTSVP FRTRAGWGAD ESWRFDDQGD NLWEAEFHPV QALTVHHTAM PTGDDHAADV RAVYYLHAVQ QLWGDIGYHV LIDPDGVVYE GRHSGEDGVP VFSGIPRPGR AESVTAGHAY GFNQGNVGVC LLGDFTDELP TRAAQDSLVD VLRVLCAVTG VDPAGQIEYV NPGTGVVTPG DAISRHRDWL ETECPGNAFS EVFDSAVRQR VIAGLA
|
| |