Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2236 |
Symbol | |
ID | 9246086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2676015 |
End bp | 2676995 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | D-alanine--D-alanine ligase |
Protein accession | YP_003680164 |
Protein GI | 297561190 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0429667 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.174652 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAGCAG CACAGAAGGA GGTCCGCGCC GTGGCAGACC TTGACCGGGT TCTCGTCCTG GCCGGGGGCC TGTCCCCCGA ACACGAGGTG AGCGTCCACT CCGGGCGCGG CGTCGCCGAG GCGCTGCGCA GGCTCGGCGT GGAGGTCCAG GTCGCCGACG TGGACTCCAC CCTGCTGGAC CGCCTCGTCC AGGACCCGCC GCAGGTGGTC TTCCCCGTCC TGCACGGGGC CGCGGGCGAG GACGGCGCCA TCCGGGAGGT CCTGGAACTG GTCGGGGTGC CCTACGTGGG CGCCCGGCCG GGGGGCTGCC GACTGGCCTA CTCCAAGCCC GCCGCCAAGG CGCTGCTGGC CGCCGAGGGC GTGCGGGTGC CGCGCGGCGC CGCCCTGCCC AAGTCGGCCT TCCACGACCT GGGCGCTCCG GCGCTGATGG ACCGGCTCGC CGACCGGCTG GGCCTGCCGC TGTTCGTCAA GCCCGACCGG GGCGGCTCCG CGTTCGGCGC GGCGCCGGTC GGCTCGGTGC AGGAGCTGTC GGCGGCCCTG GTGTCCTGCT TCGCCTACAG CGACTCGGCC CTGGTCGAGG AGCAGGTGCA GGGCACCGAG CTGGCCGTGG GCGTCCTCGA CACCGGCGAC GGGCCGGTGG CGCTGCCGCC GGTGGAGATC GTGCCCGACG GCGGGGTGTA CGACTACGCC GCCCGCTACA CCGCCGGGCG CACCGAGTTC TTCTGCCCGG CGCGCCTGGA CCCCGCCACG GTCGCCGAGG CCACCGAGGT GGCGCTGACC GCGCACCGCG TCCTGGGGCT GCGGGACCTG TCCCGCACCG ACGTCATCGT GGGGACCGAC GGCCGCGTCA CCTTCCTGGA GACCAACGTG GCCCCGGGCC TGACCGAGAC CTCCACCTTC CCGATGGGGG CCGCCGCCGC GGGGCTGGAC TTCGCGGTCG CGTGCCGGGA GCTGGCCCAC CAGGCGCTGC TGCGCGGCTG A
|
Protein sequence | MTAAQKEVRA VADLDRVLVL AGGLSPEHEV SVHSGRGVAE ALRRLGVEVQ VADVDSTLLD RLVQDPPQVV FPVLHGAAGE DGAIREVLEL VGVPYVGARP GGCRLAYSKP AAKALLAAEG VRVPRGAALP KSAFHDLGAP ALMDRLADRL GLPLFVKPDR GGSAFGAAPV GSVQELSAAL VSCFAYSDSA LVEEQVQGTE LAVGVLDTGD GPVALPPVEI VPDGGVYDYA ARYTAGRTEF FCPARLDPAT VAEATEVALT AHRVLGLRDL SRTDVIVGTD GRVTFLETNV APGLTETSTF PMGAAAAGLD FAVACRELAH QALLRG
|
| |