Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4416 |
Symbol | |
ID | 9248291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5254623 |
End bp | 5255783 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | integral membrane sensor signal transduction histidine kinase |
Protein accession | YP_003682311 |
Protein GI | 297563337 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.495846 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCTA TGGCCGGGCG GTGCGGCGCG CGCGACTGGA TCGTGGACTC CTCCATCTTC CTGATCGCCC TCCTCTCCGT ACTGATGAAC GCCGCCGAGT TCCCCCACCT GCCCGTCCGG GAACCGGTCC TGCCCGTGTG GCTCAAGGCC GCCGACGCGG CCCTCTCCCT GCTGGCCTGC CTGGCGCTGT GGTGGCGGCG GCGCTGGCCG GTCCAGATCG CCGTCGTGCT CGTCCTGTAC TCCGCCGTCT CCGGTCTGGC CTCGGGCGCG ATGCTCATCG CCCTGTTCTC CCTGGCCGTG CGCCGTCCGC CCCGCACCAG CCTGGCGGTG TACGGGCTGA GCGTGGGCGC CTCGCTCGTG CACGCCGCGC TGTGGCCCGA CCCGCACGCC CCGTTCCTGG TGATCCTCCT GCTGGGGGCC GCCCTCCAGG GCGCCGTGAC CGGCTGGGGG CTCACCGTCC AGCACCGGCG CGAGCTGGTG GAGTCGCTGC GCGACCGGGC CCTGCACGCC GAGACGGAGG CGCAGCTGCG CGCCGAGCAC GCCCAGCACC AGGTCCGCGA GGCCATGGCC CGCGAGATCC ACGACGTGCT CGGGCACCGG CTGTCGCTGC TGAGCGTGCA CGCGGGCGCC CTGGAGTACC GGCCCGACGC CCCCGCCGAG GAGGTGGCCC GGTCGGCGAA GGTGATCCGC GAGAGCGCCC ACCAGGCCCT CCAGGACCTG CGGGAGGTGA TCGGCGTGCT GCGCGCGCCC GTCGGGGAGC TGCCGCAGCC GACCATGGCC GACCTGCGGC AGCTGGTGGA GGAGGCCGAC GAGGCCGGGA CCCGGGTGGA GTTCGTGCAG GAGTGCGCCG GGACGGTCCC CGAGCGCACC GGGCGCACCG CCTACCGGAT CGTCCAGGAG GGGCTGACGA ACGTGCGCAA GCACGCCCCG GGCGCCACCA CGCGCGTACT GGTCCGGGGA GCCCCGGGCG ACGGCCTGCT GGTGGAGGTG GGCAACGACC CCTCCCCCGG CGCCCCTCCC GCGGCGTCGG GCGGGGACGG CGACGGCCAG GGCCTGGTCG GGCTCGCCGA GCGGGTGTCC CTGGCCTCGG GGCGGCTGGA GCACGGCCCG GACGGTCGGG GTGGCTGGCG GCTGGCGGCA TGGCTACCGT GGCCGACATG A
|
Protein sequence | MNAMAGRCGA RDWIVDSSIF LIALLSVLMN AAEFPHLPVR EPVLPVWLKA ADAALSLLAC LALWWRRRWP VQIAVVLVLY SAVSGLASGA MLIALFSLAV RRPPRTSLAV YGLSVGASLV HAALWPDPHA PFLVILLLGA ALQGAVTGWG LTVQHRRELV ESLRDRALHA ETEAQLRAEH AQHQVREAMA REIHDVLGHR LSLLSVHAGA LEYRPDAPAE EVARSAKVIR ESAHQALQDL REVIGVLRAP VGELPQPTMA DLRQLVEEAD EAGTRVEFVQ ECAGTVPERT GRTAYRIVQE GLTNVRKHAP GATTRVLVRG APGDGLLVEV GNDPSPGAPP AASGGDGDGQ GLVGLAERVS LASGRLEHGP DGRGGWRLAA WLPWPT
|
| |