Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4008 |
Symbol | |
ID | 9247880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4793980 |
End bp | 4795554 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | integral membrane sensor signal transduction histidine kinase |
Protein accession | YP_003681911 |
Protein GI | 297562937 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00577691 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.384071 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGTGGC ACGAGGCCCT CCGGCCCGCG CGTTCCTATC CTGGGGGAAT GCCCGACCTG TCCCCACGCT CCCTCCTGGA CCGCGTGACG CTGTGGGCCC GCGGCCACGT GCTCGCCGTG GACGCGCTCT GGGCCGTCGT CTGGTTCGCC ATGTCGATGG CGACCTGGCC GAGGGCCAGC TTCGACACCG CCGAGTCGTG GGTCTACCTC GTGCTGGCCA CCGCGTGCTG CGCCGCCCTG GCCCTGCGCC GCGTCCGCCC GTTCGCGTGC CTCGCGGTGC TGGGCGTCCT GCTGGCGTTC CACATCCTCT GGTTCGACCA GCCCACGGCC CCGGTGGGCA TCTGCGCCCT GGTCGCGTCC TACACGGCGC AGGCCGAGCT CCCGCGCCCG TGGCGCGCCG TCGGTCTCCT CCTGCTCCTG GCCGGAGCGG CCTGGGCCGT CCTCTCCATC CCGCCGGAGA ACCTCTCCGC GGACCTGGAG CTGCGCCTCA ACAGCGTCGT CTCGGCGTGG ACGGCGGTCG CGCTGTTCTC CCTGCTCGGA GCGTTCCGCA GGCGCAACCG CGAGGAGTTC GCCCGCGTCG TGGAGCACGC CCGGCTGCTG GAGACCCAGC GGGAGCAGGA GGTGCGCCTG GCCGCGCTCG ACGAGCGGAC GCGCATCGCC CGGGAGATGC ACGACATCCT CGCGCACTCG CTGAACGTCA TCGTCGCCCA GGCCGACGGC GGCCGCTACG CCGCGAAGGC CGCCCCGGAG CGCGCGGTGG CCGCCCTGGC CACCGTCGCC CAGGTGGGAC GCGAGTCGGC GGCGGAACTG CACCAGCTCC TGGGCGTCCT GCGCGACGGC GAGGAGCGCG GGGCCGCCCC GGCCCCCGGG GTCGGCGACC TGCCCGGCCT CGTGGAGGAG TACCGCCGCG CCGGTCTGCG GATCCGCCTG GTCCAGCACG GGTCCCCGGC CGCCCCGCGC GGCGGCCGGG CGGACACCGG CGCCCCGGCG ACCCTGCCGG CGACCGCGTC CCTGACGGTC TACCGCGTGG TGCAGGAGTC GCTGGCCAAC GCGCTCAAGC ACGGGGGACC CGCCGCCGCG CGGGTCGAGC TGACGTGGTC GCCCGGGCGG GTCGGGATCG ACGTGGCCAA CTCCGTCCGC GAGGCCGCAC CCGCGGCCCT CACCACACCC GCGGGCCCCT CAGGCCGCTC GGAATCCACA GGTCCGTCCG CGTCTGGAGG CCCCTCGGCT CCCCCGGTGT TCGCGGGCCC CTCCATGCCC ACGGCCCTCT CCGCACCCGA GGGGGTTCCT CCCACGGGCC GCTCGGGTCC ATCGACACCC GCGGACGCCT CGGGCCCCTC GGCGCCCGCG GGTTCCTGCG GCACCGGAAG GCCCTCCGGT ACCGGGAGGC ACTCCGCCGC CAAGGCGCCC TCCGCCGTCG GGGCGGTCCC GGGCGGCGCT CGGCGGGGCC CCGGCCACGG CCTGGTCGGC ATGCGCGAGC GTGTGGGCCT GCACGGCGGC ACCCTGGAGG TCGGCGCCGA CGACGCCACC GGCACCTGGC GGGTGCGCGC GGTGGTCCCC TGGGAGGAGG CGTGA
|
Protein sequence | MPWHEALRPA RSYPGGMPDL SPRSLLDRVT LWARGHVLAV DALWAVVWFA MSMATWPRAS FDTAESWVYL VLATACCAAL ALRRVRPFAC LAVLGVLLAF HILWFDQPTA PVGICALVAS YTAQAELPRP WRAVGLLLLL AGAAWAVLSI PPENLSADLE LRLNSVVSAW TAVALFSLLG AFRRRNREEF ARVVEHARLL ETQREQEVRL AALDERTRIA REMHDILAHS LNVIVAQADG GRYAAKAAPE RAVAALATVA QVGRESAAEL HQLLGVLRDG EERGAAPAPG VGDLPGLVEE YRRAGLRIRL VQHGSPAAPR GGRADTGAPA TLPATASLTV YRVVQESLAN ALKHGGPAAA RVELTWSPGR VGIDVANSVR EAAPAALTTP AGPSGRSEST GPSASGGPSA PPVFAGPSMP TALSAPEGVP PTGRSGPSTP ADASGPSAPA GSCGTGRPSG TGRHSAAKAP SAVGAVPGGA RRGPGHGLVG MRERVGLHGG TLEVGADDAT GTWRVRAVVP WEEA
|
| |