Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5265 |
Symbol | |
ID | 9249163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | + |
Start bp | 429716 |
End bp | 431701 |
Gene Length | 1986 bp |
Protein Length | 661 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | multi-sensor signal transduction histidine kinase |
Protein accession | YP_003683151 |
Protein GI | 297564178 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTTCCG AGCGCGCGCC CCTCAGCGCC GAAGCCGTGC TGGCCGCCTC AGCGGACGCG GTGGTCTGCG TCGACAGCGA ACTCCGAGTC ACCCTCTGGA ACCCGGCCGC CGAGCGGTTG TTCGGTTGGC GTGCGGAGGA GGTGCTCGGG GGCGAGCTGC CGATCGTCCC CGCCGAGCTG AGGGCCGAGC ACGGCGCGGT GCTCGAACAC GTGCGCGCCG GAACCCCGCT GTCCATCCAC ACCCGCCGGG TGCGCAGGGA CGGCGGGGTC GTCGACGTGC GGATCAACAC CAACCGGGTG CCCGACGCCG ACGGCGGCGT GGCCGGGTGG GTGCTCAACC TCTACCCCTC CGACAGCGAC GTGCAGGCCC GCTCCTCGGC GATGGAGCGG GCCCGTCTCG TGCGGCGGCT CACCGACGTG GTCGTCGACA TCAACGCCGA CCTCAGCCTC AAGACCGTGC TCGACCGCAT CTCCCGCGGC ATCACCGAGC TCACCGGGGC GGACGCGGGC GGCTTCGTCC TGCTCAACGA GGACCGCGTG GAGCTGGTCA GCATCTCGGA GCTGTCGGAG GACCTCCAGG GGTTCAGCTC GGCGCTGGAC GACAGCCTGT TCGGCGAGCT GCTGCGCAGC GGCAAGTCCG TGCTGCTGGC CAACGAGGAC ACCCGCGGCC TCCAGGACCT GGTCTGGGCC GACCTGCCCG GGCTGCACAC CATCGCGCTG GGCGTGTCCA ACGTGCACGG CCGCCCCTAC GGCGCCCTGT ACGCGCTCTA CAGCCAGCGC AAGGTCGGGC ACGTGGAGCT GGAGCTGCTC GAACTGCTCG CCGCGCACGC CGGGGTGGCG ATCGGCAACG CGCTGGCCTA CGAGGAGCTC AACCGCCAGC GCGTCCACGA GCAGGCCGTG GCCGACTCCA GCGCCGACGG GATCGCCGTG CTCGACTTCG GCGGCCGGGT GCGCAAGTGG AACCGGTCGG CGGTGGAGCT CACCGGTTAC ACGGCCGAGG TGATGGAGGG GCGCCACCCG CCCTTCCCGC TGCCCGCCTG CCACGGCCAG CCCGTCAAGC ACAAGCTCCG CGACGGCCGC TGGCTGGAGA TCCTGATGGC GCAGATCCCG CGCACCCACG AGTGGGTGGT GGACTTTCGC GACATCACCG CGCAGAAGGC CATGGAGGAC GAGCGGGAGG AGTTCCTGGC CACGAGCGGG CACGAGCTGC GCACCCCCAT CACCGTTATC CACGGCTACG CCACCACGCT CCTGCGCAAG TGGTCCCGGC TCAGCCCGGA CTCGCAGTAC CAGGCCGTCG GCACCATCGC GGAGCGCTCC TCGGCCCTGG CCGCGCTGGT GGACCGGCTC CAGCTGGGCT CGGACGTCGC CCGCGGCGAG ATGCGGGTCG GCCGCGACCG GTTCGACCTG CCCGAGGTGC TGCGGCAGGC GGTCAGCGCC TTCCGCCCTC TGTCCGATCG GCACGACGTC ACCCTGGACG ACCTGCCCGA CCTACCGCAC ACGGCGGGCG ACCCGCTGGC CACGGGCATG ATCATGGACC AGCTGCTGGA CAACGCCCTC AAGTTCTCGC CCCGGGGCGG GGCCGTGCGC GTGAGCGCGC GGGAGGAGGG CGACGCCGTC GCGGTCATGG TCGACGACGA GGGCGTGGGG CTGCGTCAGG GCGACGAGGA GCGGATCTTC GACCGCTTCG TGCAGTCCGG GGTGCGCGGT GAGGAGTCCC GCTTCGGGGG GCTCGGGCTC GGTCTCTACA TCGTGCGCCA GCTGGCCAGG GACCAGGGCG GCGACGTGAC CGCCCAGCGC CTGGAGCGAG GGACGCGCAT GCGGTTCACC GTGCCGCTGC ACTCCGGGGA GGATCCGGAG CCCTCTCCAC GCGGCGAACC GCTCCACAGG CAACCTGAGT CCCAGGAAAC AACAAAAACC TCATCAACCC CTGAACCCCT GTCCCGTCAG CGATCCCCGC GCCGACGTGC GAGTCGTACG CCGTGA
|
Protein sequence | MSSERAPLSA EAVLAASADA VVCVDSELRV TLWNPAAERL FGWRAEEVLG GELPIVPAEL RAEHGAVLEH VRAGTPLSIH TRRVRRDGGV VDVRINTNRV PDADGGVAGW VLNLYPSDSD VQARSSAMER ARLVRRLTDV VVDINADLSL KTVLDRISRG ITELTGADAG GFVLLNEDRV ELVSISELSE DLQGFSSALD DSLFGELLRS GKSVLLANED TRGLQDLVWA DLPGLHTIAL GVSNVHGRPY GALYALYSQR KVGHVELELL ELLAAHAGVA IGNALAYEEL NRQRVHEQAV ADSSADGIAV LDFGGRVRKW NRSAVELTGY TAEVMEGRHP PFPLPACHGQ PVKHKLRDGR WLEILMAQIP RTHEWVVDFR DITAQKAMED EREEFLATSG HELRTPITVI HGYATTLLRK WSRLSPDSQY QAVGTIAERS SALAALVDRL QLGSDVARGE MRVGRDRFDL PEVLRQAVSA FRPLSDRHDV TLDDLPDLPH TAGDPLATGM IMDQLLDNAL KFSPRGGAVR VSAREEGDAV AVMVDDEGVG LRQGDEERIF DRFVQSGVRG EESRFGGLGL GLYIVRQLAR DQGGDVTAQR LERGTRMRFT VPLHSGEDPE PSPRGEPLHR QPESQETTKT SSTPEPLSRQ RSPRRRASRT P
|
| |