Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1112 |
Symbol | |
ID | 9244962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1364575 |
End bp | 1365756 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | |
Product | MoeA domain protein domain I and II |
Protein accession | YP_003679059 |
Protein GI | 297560085 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.363631 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.36117 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAGCGG GCGGCTGCAC CGGGGGACAC GGCGGGCACC GGTCGTGGCC GCGGGCGCGG GAGGCCGCCC GGCTCCTGGG CGGCGGCAGG CCCCCGGCTC CGCGGGAGCT CCCCCTGGAG GACGCCCTGG GCGCCGTCCT CGCCGCGGAC GTGCACGCGT TGACGGGCCT GCCCGCCTTC GACGCCTCGG CCATGGACGG CTTCGCGGTG TCGGGCCCGG GGCCCTGGCG GCTGGTCGGC AGGCGGCTGG CCGGGGCGGC GGAGGCACCC GTGGGCCTGC GCCCGGGCGA GGCCGTCGAG ATCGCCACCG GCGCCCGTGT GCCCAAGGAC ACGGAGTGCG TCGTGCCCTA CGAGCTGGCC GCCGTTCGGG ACGGGACGGT GGACGGGCCC GCGGAGGCGG GCCGCCACGT GCGCTGGGCG GGGGAGGAGA CCGCCCCCGG CGAGACCGTG CTCGAACGGG GAGCGGTGGT CGGCCCCGCC GCGCTGGGTC TGGCCGCCAG CCTGGGCCAC GACACCCTGC CGGTGCTGCG CCCCCGGGTC TCGGTCCTGG TCACCGGTGA GGAGATCACC ACATCGGGGC TGCCCGGCGA CGGCAGTGTG CGCGACGCCA TCGGTCCGGC GCTGCCCGGG GTCGTCGAGC GCGCGGGCGG CCGGATCGCG TCCCTGCGCC ACCTGGGCGA CGAGCGGCGT CCGCTGCTCG ACGCGCTGGA GGGCGCGGAC GGCGACGTGG TCGCGGTGTG CGGTTCCTCG TCCCGGGGGC CGGCCGACCA CCTGCGTTCG GTCCTGGAGG AGCTGGGCGC CGAGGTCGCG GTGGACGGGG TCGCCTGCCG CCCGGGCCAC CCGCAGCTGC TCGCGCACAC CGACCGGACC GTGTTCGTGG GCCTTCCCGG CAACCCCGGG GCCGCCCTGG TCGCCGCGGC CACCCTGCTG GTCCCCCTGC TGGCCGCCAT GACCGGACGC CCGGACCCCG GAACGGGTCT GGCCCGAGCC GTCCTGGAGG GCGCCGTCAC CGCTCACCCC CGGGACACCC GGCTGGTCGC GGTGCGCCTG GACGGCGGCC GGGCGCGGCC GGTGGGCCAC GACCGTCCGG GCAGTCTGCG CGGCGCCGCG CTGGCCGACG CCTACGCGGT GGTGCCGCCG GACTGGGACG GCGGCGAGGT GGAACTGCTC CGCGTGCCCT GA
|
Protein sequence | MGAGGCTGGH GGHRSWPRAR EAARLLGGGR PPAPRELPLE DALGAVLAAD VHALTGLPAF DASAMDGFAV SGPGPWRLVG RRLAGAAEAP VGLRPGEAVE IATGARVPKD TECVVPYELA AVRDGTVDGP AEAGRHVRWA GEETAPGETV LERGAVVGPA ALGLAASLGH DTLPVLRPRV SVLVTGEEIT TSGLPGDGSV RDAIGPALPG VVERAGGRIA SLRHLGDERR PLLDALEGAD GDVVAVCGSS SRGPADHLRS VLEELGAEVA VDGVACRPGH PQLLAHTDRT VFVGLPGNPG AALVAAATLL VPLLAAMTGR PDPGTGLARA VLEGAVTAHP RDTRLVAVRL DGGRARPVGH DRPGSLRGAA LADAYAVVPP DWDGGEVELL RVP
|
| |