Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3850 |
Symbol | |
ID | 9247721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4620922 |
End bp | 4622040 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | UDP-N-acetylglucosamine 2-epimerase |
Protein accession | YP_003681753 |
Protein GI | 297562779 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.275001 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.265812 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGAGCCG TGGCAACGAG TGCCCCCCTG GGGAGCCAGG ACACCGGGAT CGTGCACGTC GTCGGCGCGC GCCCCAACTT CGTCAAGGCA GCGCCCGTGG TCTCCGCCCT GCGCGGACGC GGCGCGCACC AGGCGGTCGT GCACACCGGC CAGCACTACG ACGACCGCAT GTCCGCGGTG TTCTTCCGCG ACCTGGGCCT GCCCACCCCC GACGTCGACC TCGGCGTCGG CTCGGGCTCC CACGCCGCCC AGACCGCCGC GCTCATGGTG GGCCTGGAGA AGGAGTTCAC CGAGCGCAGG CCCGGCATGG TGGTCGTCTA CGGCGACGTC AACTCCACCG TGGCCGCGGC CCTGGTCGCC GCCAAGCTGC ACGTCCCCGT CGCCCACGTG GAGGCGGGGC TGCGCTCCTT CGACAACACC ATGCCCGAGG AGATCAACCG CCGCGTCACC GACCAGCTCA GCGACGTGTG CTTCGCCACC AGCCCGGAGG CCGTCGGCCA CCTGGCCGCC GAGGGCGTCC CGCCCTCCCG CGTCCACCTG GTCGGCAACC CCATGATCGA CACCCTCCTG GGCAACCTCG ACCGCTTCGA CGCCGACGCC CTGCGCGAGC GCCTGGACCT GCCCGAGCGC TACGTCGCCG CCACCCTGCA CCGGCCCGCG AACGTGGACG ACCCCGACAC CGTCGCCCGC CTCGCCGCGC GCCTGCACGA GATCGCCGAC CTCGCCGACG TGGTCATGCC CGTGCACCCG CGCGGCAAGG CCGCCTTCGA CCGGGCCGGG CTCGGCGACC ACCCGCGCGT GCGGCTCCTC GAACCCCTGG GCTACCTCGA CTTCGTCGCG CTCACCCGCG GCGCCGCCGC CGTGGTCACC GACTCCGGCG GCGTCCAGGA GGAGACCACG ATCCTCGGGG TCCCCTGCCT GACCCTGCGC CCCAACACCG AGCGCCCCGT CACCATCACC CACGGCACCA ACCAGCTCGT CACGGAGGCC GACCTCCTCC AGGCCGTCAC CAAGGTCCTG CACGGGCGGA GCCCGGAGCG GATCGGCGAC ACCCCGCCCC TGTGGGACGG CCGCGCGGGG GAGCGCATCG CCTCCGTGCT CACCCAGTGG TCGAGGTGA
|
Protein sequence | MRAVATSAPL GSQDTGIVHV VGARPNFVKA APVVSALRGR GAHQAVVHTG QHYDDRMSAV FFRDLGLPTP DVDLGVGSGS HAAQTAALMV GLEKEFTERR PGMVVVYGDV NSTVAAALVA AKLHVPVAHV EAGLRSFDNT MPEEINRRVT DQLSDVCFAT SPEAVGHLAA EGVPPSRVHL VGNPMIDTLL GNLDRFDADA LRERLDLPER YVAATLHRPA NVDDPDTVAR LAARLHEIAD LADVVMPVHP RGKAAFDRAG LGDHPRVRLL EPLGYLDFVA LTRGAAAVVT DSGGVQEETT ILGVPCLTLR PNTERPVTIT HGTNQLVTEA DLLQAVTKVL HGRSPERIGD TPPLWDGRAG ERIASVLTQW SR
|
| |