Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0443 |
Symbol | |
ID | 9244282 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 535672 |
End bp | 537597 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_003678396 |
Protein GI | 297559422 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.324888 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGACCG AGGGATCCGG GCCCGCCCCG GACACCGGCG TGCACGGCGC GGCCTCGACC GGCGGCACGC ACTTCGTCAA CGGCACCAAC CGCGAGGGGA AGCGCTCGGC CCACCAGTCG TGGAGCCTGC GGCGGCGCGT CACGAGCCTG CTGGCCGTGG TCGCCGTCGT CCTGGTCGTG GCGGTGTCCG TCATCACCCT GGCGGCGTTC CAGGCCCGCG AGTCCCTGGC GCTCCAGGTC GACTCCCTGA CACCCGCCGT GAGCGCGGTC GAGCAGACGC ACTCGGCCTA CCTCACCCAG GACCACGCCC TGCGCGGCTA CATCCTCACC CAGGACCGCG AGTTCCTCCA GCCCTACGTC GAGAGCCAGC TGACGCTGAC GGAGAACCAG GCCGTCCTGG CCCAGCTGGC CGAGGACAAC CCCGAGGTGG CCACCAACGT CGACGACCTG CTCACCGCGG GCCGGGTGTG GACCGAGGAG TTCGCCGAGC CCGCCTTGGA GCGGGTCAGC GGCGGCCAGG AGGTCACCCA GGAGGAGCTG CGGCGCGGCC GGGTCCTCTT CCTGGAGCTC AGCCGGATCA GCGACGCCAC CACCAGCCAG CTGGAGGCGG AGATCAAGGA GGCCCGCGAG GGCCTGACCC TGGCCACCCA GCAGGTCGTG GCCCTGCTGG TCCTGGTCGG CCTGGTCGTG GTGTTCCTGT CGGTGTTCCT GTGGGTGATG CTCCAGCAGT GGGTTCTGCG CCCCCTGGAG GAACTCGCCG GGCACATGCG CCAAGTGTCG GAGGGCTACT ACGCCCACCG GATCTCCCTG CACGGCCCGC CCGAGATCGT CCGGCTCGGC CAGGACGTGG ACGCCATGCG CGAGCGCATC GTGCAGGACC TGGACGAGGT CGCCTCCGCG CGGCGCAAGC TCCAGGAGCA GTCCGTCCTC ATGGAGAACC AGACCGAGGA ACTGCGCCGC TCCAACCTGG AGCTGGAGCA GTTCGCCTAC GTCGCCTCCC ACGACCTCCA GGAGCCGCTG CGCAAGGTGG CGAGCTTCTG CCAGCTGCTC CAGCGCCGCT ACCAGGGGCA GCTGGACGAG CGCGCCGACG CCTACATCGA CTTCGCGGTC GAGGGCGCCA AGCGCATGCA GACCCTCATC AACGATCTAC TGGCCTTCTC CCGGGTCGGC CGGGTCAGGA ACTTCGCGCC GGTCGCCCTC GACGACGCGC TGGACGACGC CCTGAGCAGC CTGTCCACCC GTCTGGAGGA GGCCGACGCC GAGGTCACCC GGGACCCGTT GCCGACCGTG CAGGGCGACC GCACCCTCCT GACCCAGGTG TTCTTCAACC TCGTGGGCAA CGCCGTGAAG TTCCGCGGCG AGGAGGACCC CCGGGTCCAC ATCAGCGTCG AACGGCGCGG TGACGAGTGG GTGTTCTGCT GCTCCGACAA CGGGATCGGA ATCGAACCGC AGTACGCGGA GCGCATCTTC GTGATCTTCC AGCGGTTGCA TACCAGGGAC AAGTACACGG GAACCGGCAT CGGCCTGGCG ATGTGCAAGA AGATCGTGGA GTTCCACGGG GGACGGATCT GGCTGGAAAC CGGCTCCCGG GACCCCGGGG AATCCGAAAC CTCAGGTGAC CGGGACTCTG GTCGAACCGG AACGCGCATA TGCTGGTCCT TGCCCGCCGA CCCCGCGGAG GACGAGGACC CGGCCCCCGA CAGGGGAACC GCCGAGCCCC TCGCCGTCGA TAACGGGGAC GAGGGCACCG AGGACGCCGA GACCACCCCC GAGGACTCGG CTCCGACGGC CGCACGACAG CCCACGGGTA CGGACAACCC TGGGGAGGAC GGCGCCGAAC CGGGCGACAG GCCCGGCGGC GTCCGCTCCG CCCAGCCCGA CACCGGTGGC GGTACGGTTC CCCCCGGTCA CGGGGCGGGA CCCTGA
|
Protein sequence | METEGSGPAP DTGVHGAAST GGTHFVNGTN REGKRSAHQS WSLRRRVTSL LAVVAVVLVV AVSVITLAAF QARESLALQV DSLTPAVSAV EQTHSAYLTQ DHALRGYILT QDREFLQPYV ESQLTLTENQ AVLAQLAEDN PEVATNVDDL LTAGRVWTEE FAEPALERVS GGQEVTQEEL RRGRVLFLEL SRISDATTSQ LEAEIKEARE GLTLATQQVV ALLVLVGLVV VFLSVFLWVM LQQWVLRPLE ELAGHMRQVS EGYYAHRISL HGPPEIVRLG QDVDAMRERI VQDLDEVASA RRKLQEQSVL MENQTEELRR SNLELEQFAY VASHDLQEPL RKVASFCQLL QRRYQGQLDE RADAYIDFAV EGAKRMQTLI NDLLAFSRVG RVRNFAPVAL DDALDDALSS LSTRLEEADA EVTRDPLPTV QGDRTLLTQV FFNLVGNAVK FRGEEDPRVH ISVERRGDEW VFCCSDNGIG IEPQYAERIF VIFQRLHTRD KYTGTGIGLA MCKKIVEFHG GRIWLETGSR DPGESETSGD RDSGRTGTRI CWSLPADPAE DEDPAPDRGT AEPLAVDNGD EGTEDAETTP EDSAPTAARQ PTGTDNPGED GAEPGDRPGG VRSAQPDTGG GTVPPGHGAG P
|
| |