Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4783 |
Symbol | |
ID | 9248666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5670142 |
End bp | 5671356 |
Gene Length | 1215 bp |
Protein Length | 404 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | integral membrane sensor signal transduction histidine kinase |
Protein accession | YP_003682673 |
Protein GI | 297563699 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.714313 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGAGGG GGAGGGGCGG CGGGGCCAGG GCCGGGCTCA CCCTGCGCGC CAGGCTGACC CTGGTCTACA CCGCCGTCTT CGCCGCGGGC GGGGCGGTGC TGCTCGGGGC CAACTACGCG ATGGTCTCCG CCAGTCTGGA GGCCCGGGGC GTCGACCTGG CCGCGACCAT CGTCGCCTCG GGGGACGGAG GGGAGCCCTC CATCGCCGTC GGGCCCGCCG AGCTCGTGGA GTCGGCGGGG CCGTTCGATG AGGTGGAGAG GGCGTCCGGG CCGACGAGTC CGCTGGTCGA CTACCAGGAC GGTGTGCTCT CCGACCTGCT GACCACGTCG GTCGCCGTGC TGGTGGCCGT CGCCGCCCTG GCGGCCCTGG CCGGGTGGCT GATCGCGGGC AGGCCGATGC GCAGGCTGCA CGCGGTCACC GAGACGGCCC GCCGCATCTC CGAGCGCGAC CTGCACAGCA GGCTGGCGCT GACCGGTCCC GACGACGAGT TCCGGGAGCT CGGCGACACC TTCGACGGGA TGCTCTCCCG GCTGGAGCGG GCCTTCGACG CGCAGCGGCG GTTCGTCGCC AACGCCTCAC ACGAGCTGCG CACCCCGCTG GCGGTCCAGA AGGCCGCCGT GGAGGTACCG CTGTCGCAGG GCCGGGTCCC CGAGGACCTC AGGCCCGCGT TCACCCGCGT GCTGGACTCG GTGGACCGCA GCGAACTCCT CATCGCGGGG CTGCTCCTGC TCGCCCGCTC CGACCGCGGC CTCGCCCGCA CCGAGCCCGT GGACCTGGCC GGGGCGGCGG AGAACGCATC GGCGCTGCTC GAACGCGACG CGCGGGAGGC CGGGGTCGAC CTGCGGACCG ACCTCGCGGC CGCGGAGGTG GAGGGGGATC CGGTCCTGCT GGAGCACCTC GTGCGCAACC TGGTGGACAA CGCGGTCCGC TACAACGTCC CGGGCGGGTG GGTCGAGGCG CGGGTGCGCA CGGCGGCCGG TCGCACCGTG CTGAGGGTGG CCAACACGGG CGGGAACGTC ACCGACCCCG AGAGCCTGTT CGAGCCCTTC CACCGGGGCG GCGACGCCCG GTTGCGCACC GCGCGGCCCG GCAGCGGCCT GGGCCTGGCC ATCGTCCGCT CGATCGCCTC CGCCCACGGC GCGCGGGCCG AGGCCCGCGC GCGCCCGGGC GGGGGACTGG TGGTGACGGT CGGGTTCCCG GCCGCCGGGG GCTGA
|
Protein sequence | MGRGRGGGAR AGLTLRARLT LVYTAVFAAG GAVLLGANYA MVSASLEARG VDLAATIVAS GDGGEPSIAV GPAELVESAG PFDEVERASG PTSPLVDYQD GVLSDLLTTS VAVLVAVAAL AALAGWLIAG RPMRRLHAVT ETARRISERD LHSRLALTGP DDEFRELGDT FDGMLSRLER AFDAQRRFVA NASHELRTPL AVQKAAVEVP LSQGRVPEDL RPAFTRVLDS VDRSELLIAG LLLLARSDRG LARTEPVDLA GAAENASALL ERDAREAGVD LRTDLAAAEV EGDPVLLEHL VRNLVDNAVR YNVPGGWVEA RVRTAAGRTV LRVANTGGNV TDPESLFEPF HRGGDARLRT ARPGSGLGLA IVRSIASAHG ARAEARARPG GGLVVTVGFP AAGG
|
| |