Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5232 |
Symbol | |
ID | 9249125 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 384690 |
End bp | 385892 |
Gene Length | 1203 bp |
Protein Length | 400 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | putative signal transduction histidine kinase |
Protein accession | YP_003683118 |
Protein GI | 297564145 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0452492 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00276259 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGCAGCAA CGGCCGCGGA CACCCCGCGG CTCACCCGCG CGGTCAGGGG CAGGCTCCTC GGCGGCGTCG CCGCCGGGCT CGCACGCCAC CTGGGCCTGG ACCCGGTGGT GGTCCGCCTC GCCTTCATGG CGCTGTCCGT GGGCGGCATC GGCATCGCCG TCTACGCCGC CCTGTACTTC TTCGTGCCCA GCGACTCCTC GGAGGAGGAG GCCGCCGAGC CCGAGAGCCG ACGCAAGGGC CGCGACCTCT CCCAGCTCAT CGCCTACGTC GGACTCGCCG CCGGACTCGG CCTGCTCTTC CTGCTCTTCG GCGGCGTCTT CGACCCGCTG CTGTGGTTCG TGGTCTTCGG CACCCTCGGC GGCATCATCC TCTGGCAGCA GGCCAACCCG TCCCAGCGCG ACCAGTGGAT GACCAGCACC GTCAGCCGCT CCTCACCCAA GGGCATCATG CGCGCGGGCG CGGGCGTGCT GCTCGTCGTC GTCGGCGCCA TCGGCTTCCT CGTCTTCCAG GAACAGCTCC AGAACGCCCG CGCCGGCCTG ACCTTCGCCT TCACCGTGCT CGCGGGCATC GCGCTCATCG TGGCCCCCTG GATCATCGGG CTGATCCGCG AACGCGACCA GGAGCGCCGC GAGCGCATCC GCAACGCCGA GCGCGCCGAA CTCGCCGCCC ACATCCACGA CTCCGTCCTG CACACCCTGA CCCTCATCCA GCGCCGGGCC GAGGACCCCC GCGAGGTGCA GCGCCTCGCC CGCGTCCAGG AGCGCGCCCT GCGCAGCTGG CTCTACCAGC GGCCCGCCGA CGCCGACACC ACCGTCTCGC CCGCGCTCGA ACGCGTCGCC GCCGAGGTCG AGGAGGAGCA CGGGGTCCCC ATCGAAGTGG TGTGCGTGGG CGACTGCCCC ATGGACGACG CGCTGGCCGC CATGCTCCGC GCCGCCCGCG AGGCCATGGT CAACGCCTCC AAGTACGCCG GGACGGACAG CATCTCGGTG TTCGGCGAGG TCGACCAGGA GGAGGTGCTG GTGTTCGTCC GCGACCGCGG CGCCGGATTC GACATGGACG CCGTCCCCGA GGACCGCATG GGCGTGCGCG GCTCCATCCT GGGCCGGATG GACCGCCACG GCGGCTCCGC GCGCATCCGC ACCGGCCCCG GCGAGGGCAC CGAGGTGCAG CTGCGCATGC CGCGCGTCCC CGACCTGTTG TGA
|
Protein sequence | MAATAADTPR LTRAVRGRLL GGVAAGLARH LGLDPVVVRL AFMALSVGGI GIAVYAALYF FVPSDSSEEE AAEPESRRKG RDLSQLIAYV GLAAGLGLLF LLFGGVFDPL LWFVVFGTLG GIILWQQANP SQRDQWMTST VSRSSPKGIM RAGAGVLLVV VGAIGFLVFQ EQLQNARAGL TFAFTVLAGI ALIVAPWIIG LIRERDQERR ERIRNAERAE LAAHIHDSVL HTLTLIQRRA EDPREVQRLA RVQERALRSW LYQRPADADT TVSPALERVA AEVEEEHGVP IEVVCVGDCP MDDALAAMLR AAREAMVNAS KYAGTDSISV FGEVDQEEVL VFVRDRGAGF DMDAVPEDRM GVRGSILGRM DRHGGSARIR TGPGEGTEVQ LRMPRVPDLL
|
| |