Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3990 |
Symbol | |
ID | 9247861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4772267 |
End bp | 4773727 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | UDP-N-acetylglucosamine pyrophosphorylase |
Protein accession | YP_003681893 |
Protein GI | 297562919 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.797377 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGTGA ACCGTCCGGC TGCCGTCATC GTCCTCGCGG CGGGCGAGGG CACCCGAATG AAGTCGAAGC TTCCCAAGGT CCTCCACGAA CTCAACGGCC GCAGCATGCT CGGCCACGTG CTCGCGGCGG CGCGGGAACT CGACCCCCAG CACGCGGTCG TGGTCGTCGG CCACGCGCGT GAGCAGGTCA GATCCCATCT GGAGGAGATC GCCCCCCAGG CCGCCACCGC GGTCCAGGAG GAGCAGAACG GCACCGGCCA CGCCGTGCGC ATGGCGATCG AGGACCTGGC CGCCAAGGGC GTCAAGCTCA GCGGCACCGT GGTCCTGACC TGCGGTGACA CCCCCCTCCT GCGCGGTTCC ACGCTCGCGG AGCTCGTCGC GGCCCACGAC GAGGAGGGCA ACGCCGTCAC GGTGCTCTCC GCGCGCGTAC CCGACCCCCA CGGTTACGGC CGCATCGTCC GCGACGCCGA CGGCGACTTC ACCGGCATCG TCGAGCACGC CGACGCCACC CCCGAGCAGC ACGCGATCGA CGAGATCAAC TCGGGCATGT ACGCCTTCGA CGGCGCCCTG CTGTCCGAGG TCGTCCAGCG CCTGTCCACC GACAACGCCA AGGGCGAGGA GTACGTGACC GACGCGGTCT CCCTGCTGCG CGGCGACGGC CACCGGGTCG GCGCCTGGGC GGCGGACGAC TGGCACGAGG TCCAGGGCGT CAACAACCGC GTCCAGCTCT CCGAGGCCCG CCGCGTCCTC AACGACCGGC TGGTCAACGA GCACATGCTC GCCGGGGTCA CCGTCGTGGA CCCCGCCACC ACCTGGATCG ACGCCCAGGT CACCATCGGC CGCGACACCG TGATCGAACC GGGGACCCGG CTGCTGGGCG CCACCTCCGT CGGCGAGGAC GCCGTCGTCG GCCCCCGCGC GGACCTGAGG GACACGGTCG TCGGCGCGGG CGCCACGGTG CGCGAGACCA CGGCGGACCG GGCCGAGATC GGCCCCGGGG CCTCCGTCGG CCCCTACACC TACCTGCGGC CGGGCACCCG CCTGGCCGAG CGGTCCAAGG CCGGAGCCTT CGTCGAGGTC AAGAACTCGA ACGTCGGCGC CGAGTCCAAG ATCCCGCACC TGACCTACGT GGGCGACGCG GACATCGGCG TGGGCAGCAA CATCGGCTGC TCCTCGGTGT TCGTCAACTA CGACGGGGTC AACAAGTCCC GGAGCGTCAT CGGCGACCAC GTCAGGATCG GCAGCGACAA CACCATCGTC GCCCCGGTCC GCGTGGGCGA CGGCGCCTAC TCCGGGGCGG GGACCGTGGT CCGCGACGAC GTGCCGCCCG GTGCCCTCGC CGTTTCCGAG GGGCACCGCC AGCGCAACGT CGAGGGCTGG ACCCGGCGCA AGCGCCCGGG CACGCCCTCC GCCGAGGCGG CGGAGCAGGC CGATCGGCAC AGAGCCGACG ACAAGCAGTG A
|
Protein sequence | MSVNRPAAVI VLAAGEGTRM KSKLPKVLHE LNGRSMLGHV LAAARELDPQ HAVVVVGHAR EQVRSHLEEI APQAATAVQE EQNGTGHAVR MAIEDLAAKG VKLSGTVVLT CGDTPLLRGS TLAELVAAHD EEGNAVTVLS ARVPDPHGYG RIVRDADGDF TGIVEHADAT PEQHAIDEIN SGMYAFDGAL LSEVVQRLST DNAKGEEYVT DAVSLLRGDG HRVGAWAADD WHEVQGVNNR VQLSEARRVL NDRLVNEHML AGVTVVDPAT TWIDAQVTIG RDTVIEPGTR LLGATSVGED AVVGPRADLR DTVVGAGATV RETTADRAEI GPGASVGPYT YLRPGTRLAE RSKAGAFVEV KNSNVGAESK IPHLTYVGDA DIGVGSNIGC SSVFVNYDGV NKSRSVIGDH VRIGSDNTIV APVRVGDGAY SGAGTVVRDD VPPGALAVSE GHRQRNVEGW TRRKRPGTPS AEAAEQADRH RADDKQ
|
| |