Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3714 |
Symbol | |
ID | 9247583 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4458506 |
End bp | 4459639 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | N-acetylglucosamine-6-phosphate deacetylase |
Protein accession | YP_003681618 |
Protein GI | 297562644 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCACA CCCCCGTCAC ACTCACCAAC GCGCGCGTCG TGACCCCCGA CGGGGTCCAC GAAGGATGGT TGAGGATCGA GAACGGCCGC GTGGCCGCGC TGGGTTCGGA CGGACCCGTT CCCGGCGGCC GCGACCTCGG CGGCGCCTGG GTCGTCCCCG GCATGGTGGA CACCCACGTC CACGGCGGCG CCGGGCACGC GTTCACCGAG ACCGACCCCG AGGGCGTCCG TGAGACGATC GCGTTCAACC GCTCCCGGGG CGTGACCGCG CTGGTGGGCG GCCTCGTGGC GGCCACCCCG GAGGACACCC TGCGCCAGGT GGCGGCGCTG GCCGAACTGT GCGACGCGGG CGAGCTGGCG GGCATCTACC TGGAGGGCCC CTTCATCTCC CGGGCCAGGT GCGGGGCGCA CGACCCCGAC CTGCTCCGCG ACCCCGACAC CGCCGAGTTC GACCGCTGGC TCAAGGCCGG GCGGGGGCAC GTGCGCATGG TCACGGTGGC CCCGGAGCTG CCCGGAGCGC TGGACCTGAT CGGCGCGGCG GCCTCCTCCG GCGTGGTGGC GGCGGTGGGG CACACCGAGG CCACCTACGA GCAGACGCTG GCGGCCTTCG ACGCGGGGGC GTCGGTGGCC ACCCACCTGT ACAACGCGAT GCGTCCGCTG GGCCACCGCG ACCCGGGGCC GATCGCCGCC GCGCTGGGCG ACGAGCGCGT GACGGTCGAG CTGATCCTGG ACAACGTGCA CGTCCACCCG GGCGCGGCCG GGCTGGTCTT CGACGCCGCG GGCGCGGACA GGGTGTCCCT GGTGACGGAC GCGATGTCGG CGACGGGCCT GGGCGACGGC GAGTACACGC TGGGCGACCT GCGGGTGCGG GTGAGCGGCG GCGAGGCGCG CCTGGCGGAG AGCGGCACGA TCGCCTCCAG CACGATCGTC CTGCCCCAGG CGGTGCGCAA CGCCGTGCGG AGCCTGGGCG TCGGCGTGCC CGAGGCCGTG CGATCGGCGT CGTCGGTTCC GGCCGCGGCG CTGGGGCTGG ACGGGGTGGG CCGGATCGAG GTCGGAGGGC GCGCCGACCT CGTCGTGCTC GACGACGACC TCGGAGTCCG CGAGGTGGTG TACGAGGGCG CCTGGGTGGA GTAA
|
Protein sequence | MAHTPVTLTN ARVVTPDGVH EGWLRIENGR VAALGSDGPV PGGRDLGGAW VVPGMVDTHV HGGAGHAFTE TDPEGVRETI AFNRSRGVTA LVGGLVAATP EDTLRQVAAL AELCDAGELA GIYLEGPFIS RARCGAHDPD LLRDPDTAEF DRWLKAGRGH VRMVTVAPEL PGALDLIGAA ASSGVVAAVG HTEATYEQTL AAFDAGASVA THLYNAMRPL GHRDPGPIAA ALGDERVTVE LILDNVHVHP GAAGLVFDAA GADRVSLVTD AMSATGLGDG EYTLGDLRVR VSGGEARLAE SGTIASSTIV LPQAVRNAVR SLGVGVPEAV RSASSVPAAA LGLDGVGRIE VGGRADLVVL DDDLGVREVV YEGAWVE
|
| |