Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0781 |
Symbol | |
ID | 9244626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 960936 |
End bp | 962546 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | integral membrane sensor signal transduction histidine kinase |
Protein accession | YP_003678731 |
Protein GI | 297559757 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACCTCT CCGTACCGCG CGCGGGCCTC CGGCGTGCGC TGACCCCGCT CGGGGGGTGG CCCACCCGGG GCGCCGTCTG GCGCGACGTG CTGTTCGGGG CCGCGATCGC CGTCCTGTGC CTGGGGGAGA TGCTCATCCG CTGGCGCGCC GGTGAGACGT CGGCCGCCGA GGCGCTCACC ACCGCGTCCG CCGCGGTGGC GCTGACCGCG CTGACCGTGG TGCTCTGCCG CCTCTACCCG CTGCTCGCCC TGACCGTGGC GCTGCTCGGC TCGTTCTGGG TGTACGGCTT CGGCGTCCTG CTGCTGTGCG TCGCCTACCT GGTGGGCCGC CGCCTCCCCA CGGTGTGGTC GGCGCTGGCG CTCTTCACGT CCGTGACCGT CCTGTGGACG GCGGTCGGAG CGCTGCTGTG GCCCGAGGCG CCCGCGGCGT GGCCCGCCAC CGTGAGCACC GTCGTGTTCA CCCTCGTGCT CCCCTGGCTG GTGGGCGTCT ACCGGCGCCA GCACGTGGCC CTGGCCGAGG CCGGATGGGA GCACGCCCGC CAGCTCCAGC GCGAGCACCG CCTCACCGTC GACGAGGCCA GGCTCCGCGA GCGCTCCCGC ATCGCCCAGG ACATGCACGA CTCCCTAGGC CACGAGCTCA GCCTCATCGC CCTGCGCGCC GGGGCCCTGG AGGTCAGCCC CGACCTGGAC CAGGAGCACC GCCGCTCCGC GTCCGAGCTG CGGATCACCG CGGTCGAGGC CACCCGCAGC CTGCGGGAGA TCATCGGCGT GCTGCGCGCC GACACCGAAC CCGCGCCCAT GTCCCCCGCG GGCGAGGGCG TTCCCGCGCT GGTCGAGCGC GTCCGCGACT CCGGCATGCG GGTCGTGCTG GTGCGCGACG GGGACACCGG CGGCCTGCCG CCGATGGTGG ACCGCGCGGT CCACCGGGTC GTCCAGGAAT CCCTCACCAA CGCCGCCAAG TACGCCCCCG GCGCCGACGT GACCGTGTGG CTGACCGCGA GCGGGGAGCG GGTGGAGGCC AGGGTCACCA ACTCCGCGCC CCCCGAGCCG TGCCCCGACA CCGGTCCGGG CGGCAGGCGC GGACTCATCG GGCTGCGCGA GCGCGTCCGC CTGACCGGGG GCTCCTTCGC CGCCGGGCCG AACGGCGACG GCGGCTGGGA GGTCTGCGCG ATCATGCCGC TGGACGGCAC CGCCGGGGCG GGGCCCGACG AGGACGGGGA GGCGGCCCAG ATCGACCAGC TCCAGCTCGC CGCCCACCGC CGGGTGCGCG GCTGGACGAT GGCCCTGGCC CTGGTACCGG TCACCGCCGC CGTGGCGCTC GTGCTCGGGC TGGGCTGGAT CGGCGTCACG AACATGGAGA GCAGCACCCT GACCCCCGAG GAGTTCGCCG ACCTGACCGT CGGCGACCCC CAGACCGAGG TCGAGGAGGC CCTGCCGCCG AACTCGGTGT ACATGGACGC GAGCGTCCTG GCGCAGGACG CCGTCCCCCC GGGCGCCACA TGCAGCTACT ACCGCAGCAA CGGCGTGTTC CTCGGCGAGG AGACGGTCTT CTACCAGCTG TGCTTCTCCG AGGGGCGCCT GCTGACCAAG GGCGAGGCGC CGCTACGGTG A
|
Protein sequence | MDLSVPRAGL RRALTPLGGW PTRGAVWRDV LFGAAIAVLC LGEMLIRWRA GETSAAEALT TASAAVALTA LTVVLCRLYP LLALTVALLG SFWVYGFGVL LLCVAYLVGR RLPTVWSALA LFTSVTVLWT AVGALLWPEA PAAWPATVST VVFTLVLPWL VGVYRRQHVA LAEAGWEHAR QLQREHRLTV DEARLRERSR IAQDMHDSLG HELSLIALRA GALEVSPDLD QEHRRSASEL RITAVEATRS LREIIGVLRA DTEPAPMSPA GEGVPALVER VRDSGMRVVL VRDGDTGGLP PMVDRAVHRV VQESLTNAAK YAPGADVTVW LTASGERVEA RVTNSAPPEP CPDTGPGGRR GLIGLRERVR LTGGSFAAGP NGDGGWEVCA IMPLDGTAGA GPDEDGEAAQ IDQLQLAAHR RVRGWTMALA LVPVTAAVAL VLGLGWIGVT NMESSTLTPE EFADLTVGDP QTEVEEALPP NSVYMDASVL AQDAVPPGAT CSYYRSNGVF LGEETVFYQL CFSEGRLLTK GEAPLR
|
| |