Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3946 |
Symbol | |
ID | 9247817 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4718774 |
End bp | 4720060 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | |
Product | putative CheA signal transduction histidine kinase |
Protein accession | YP_003681849 |
Protein GI | 297562875 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGCGCC GCTTGGTAAC ACCCACAGTG CTGACCGCCG GACTCGCGGC GGCGATGGTG AGCGCCGCGG TGGCGCCCGC CTCGGCTGAC GCCCAACCCC AACCGCCCGA GGCCCAGGCG ATCGCCGAGG AACTCGGGAA CGACAACGTC TTCATCGACC CGAGCATCGA CGCGATCCCC GAGGCCGAGC AGGGAATCCT GGAGTCGACC GCCGCCGAGG CGGACGTCCC CGTCTACTAC GTGATCCTGC CCACCGACAG CATCTCCTCC CAGGCCGGTC TCGACGCCCT GATGAACCCC GTCATGGACG AGGTCGGGGA CGGGGTCTAC GGCGTGTTCG CGGGCAGCCA GACCTTCCAG GTGCTGTCCC CCAACGTCGA GGACACCGAG GCCATCCAGC AGCTCGCCGT CCGGGAGGGC GGCGGCAACC AGGTCGACAC CCTCGTCGCC ATCCCCGACG CCGCCACCCA GGTCGAGGAG GCCGAGGCTG CCGGGGCCAC CTCCGGGTTC GTGCTCCTCG GACTCCTGCT GGCGGTCGTC GCCGCGGGCG CGTGGTTCGT CCACCGCAGC CGCAAGAAGC GCGAGGCCGA GAAGGCCAAG CAGCTGGAGG AGATCAGGCA GATGGCCACC GAGGACGTCG TGCGCCTCGG TGAGGACGTC GCCCGGCTGG AGATCGACGT CTCCAAGGTC GACGACGCCA CCCGCAACGA CTACTCCCAG GCCATGGACG CCTACGACCA GGCCAAGGCC CAGCTGGACA ACATCCGCGA GCCCGAGCAG GTCAGGCTGG TCACCAGTGC CCTGGAGGAC GGCCGTTACT ACATGACCGC CACCCGGGCC CGCCTCAACG GCGACCCGGT GCCCGAGCGG CGCGGCCCCT GCTTCTTCAA CCCGCAGCAC GGCCCGTCCG TGGAGGACGT GACCTGGGCC CCGCCCGGCG GCGCGCCCCG CGAGGTCACC GCCTGCGCCG ACTGCGCGCG TGCGGTGCGC ACCGGCGGCC AGCCCGACGT CCGCCTGGTC GAGGTGGACG GCGAGCGCCG CCCGTACTAC GACGCCGGTC CGGCCTACTC GCCCTACGCG AGCGGCTACT TCGGCATGAA CATGATGATG GGCATGTTCA CCGGCATGAT GATGGGCTCC ATGATGGGGT CGATGATGGG CATGGGTATG GGCATGGGCG CCGGTGAGGT CGGCGCCGGA GAGGACTTCG GAGGCGGGGA CTTCGGGGGC GGCGACTTCG GGGGCGGCGA CTTCGGCGGA GGCGACTTCG GCGGCTTCGA CTTCTGA
|
Protein sequence | MLRRLVTPTV LTAGLAAAMV SAAVAPASAD AQPQPPEAQA IAEELGNDNV FIDPSIDAIP EAEQGILEST AAEADVPVYY VILPTDSISS QAGLDALMNP VMDEVGDGVY GVFAGSQTFQ VLSPNVEDTE AIQQLAVREG GGNQVDTLVA IPDAATQVEE AEAAGATSGF VLLGLLLAVV AAGAWFVHRS RKKREAEKAK QLEEIRQMAT EDVVRLGEDV ARLEIDVSKV DDATRNDYSQ AMDAYDQAKA QLDNIREPEQ VRLVTSALED GRYYMTATRA RLNGDPVPER RGPCFFNPQH GPSVEDVTWA PPGGAPREVT ACADCARAVR TGGQPDVRLV EVDGERRPYY DAGPAYSPYA SGYFGMNMMM GMFTGMMMGS MMGSMMGMGM GMGAGEVGAG EDFGGGDFGG GDFGGGDFGG GDFGGFDF
|
| |