Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4334 |
Symbol | |
ID | 9248209 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5169249 |
End bp | 5170232 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | protein of unknown function DUF523 |
Protein accession | YP_003682229 |
Protein GI | 297563255 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.928853 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.590252 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCACAC GGCTTCCTCC GCTGCCCGAC ACGCCGGTCC GCCCCAGGGT CGGGGTGTCC AGCTGCCTGC TCGGCGCTCC GGTCCGCTAC AACGGCGGGC ACTCACGTTC GCGTTTCCTC ACCGACGAGC TCGACCGGCA CGTGGACTGG CTGCCGGTCT GCCCGGAGGC GGAGATCGGC CTGGGCGTCC CGCGCCCCAC CCTGCGCCTG CAGCGCCGGG AGGGGCTGGA CCGGGTGGTC TCCAGCGCGG ACGGCGCCGA CCGCACCGAG GAGCTGGCCG AGGTCGCCGA CCACCACCTG GCCCAGCTGC GCCACCTGGA CGGGTACGTG CTCAAGAACA AGTCGCCCAG TTGCGGCCTG TTCGCCCTGC CCGTGTTCGA CCAGGGCGGC GGCCGGGTGG ACGGCAGGGG CCGTGGCGCC TTCGCCCAGC GGCTCACCGA GCTGCTGCCC TCCCTTCCGG TGGAGGAGCA GGGCCGCCTG ATGGACCCCG TGCTGCGTGA GCTGTTCGCC CAGCGCGTCT TCGCGCACGC GCGGCTGCGC CACCTGTGGG AGTCGGACTG GCGTCCGCGC GACCTGGTGG CGTTCCACAG CCGCCAGAAG CTCCAGCTGA TGTCGCACTC CCCGGAGGGC TACCGGGAGA CGGGAAGGAT CGTGGCCCGC GCGGGCGCAG ACGACCCCGA GGAGGTCCGG GCGGCCTACA CCGACGCCTT CCACCGGGCG ATGGCCGTGC GGCCGAGCCG GGGCAAGCAC GTCAACGCCC TCCAGCACGC CTTCGGGATG CTGAGCGCGC TGCTGGACGA CGCCCGCAGG CACGACCTGC TGGGGGCGAT CGAGGACTAC CGGCGGGAGC AGGTGCCGCT GAGCGTCCCG GTGGCGCTGC TGCGCCACCA CTGCGCGGCG GAGGACGTCG AGTGGGCCCG CGACCAGACC TACCTGCGCC CGTACCCGGA CGACCTCCGG CTGCGGCACG CCGTCACGGT CTGA
|
Protein sequence | MTTRLPPLPD TPVRPRVGVS SCLLGAPVRY NGGHSRSRFL TDELDRHVDW LPVCPEAEIG LGVPRPTLRL QRREGLDRVV SSADGADRTE ELAEVADHHL AQLRHLDGYV LKNKSPSCGL FALPVFDQGG GRVDGRGRGA FAQRLTELLP SLPVEEQGRL MDPVLRELFA QRVFAHARLR HLWESDWRPR DLVAFHSRQK LQLMSHSPEG YRETGRIVAR AGADDPEEVR AAYTDAFHRA MAVRPSRGKH VNALQHAFGM LSALLDDARR HDLLGAIEDY RREQVPLSVP VALLRHHCAA EDVEWARDQT YLRPYPDDLR LRHAVTV
|
| |