Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3600 |
Symbol | |
ID | 9247469 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4313728 |
End bp | 4314939 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | |
Product | integral membrane sensor signal transduction histidine kinase |
Protein accession | YP_003681506 |
Protein GI | 297562532 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.242845 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCAGCAC TACTCCTCCG CCCCGCGCGC ACCGACGTCT GGCTCGCGTT CGGCCTGCTC GCCGCCTCCA CGGGCAGCAC GGCGCTGGTC AGCGCGCTCC AGCGGGAGGC GGGGGCGGTC AACCCCTGGC CGTGGGGCCA CGCGCTGATC CTGGTGGCCT GCCTGGCCGT CCTGTGGCGC ACCGCCTACC CGTGCGTGGC CGGGCTGGTG TCGGTCGTCG CCTCCACCGT GTACTACCCC CTGGGCTTCC CCGACGGCAT GGTCATGCTG TGCACCGCCG TCATGCTCTA CACCCTGGTG CGCTGGGGGT ACCGCGTGTT CGGCTGGACG CTGGGCGTGG GCCAGTTCGT GGCCGTCAAC AGCTACGAGT TCCTCATGAC GGGGACCTTC CGGCCCGAGG CGATCGGCAT CGTCGGCTGG GTGCTCGTCC TGCTGTGCAC GGGCGAGGTC GTGCGCTGGC GGCAGGAGTA CCTGCGGGCC GACCACGAGC GCGAGGCCGA GTCCCTGCGC ACCCGGGAGG AGGAGCTGCT GCGCCGGGCC TCCGAGGAGC GCGTCCGGCT GGCCCGGGAC GTGCACGACA CGGTCGCGCA CAACATCTCC CTCATCAACG TGCAGGCGGG CACCGCCCTG TACCTGATGG AGACCGAGCC GCAGCGGGCC GCGGAGGCGC TGGCCACCAT CAAGCAGACC AGCAAGGACA CCCTGGTCGA GCTGCGCGGC ATGCTCGGCG TGCTGCGCGC CGTGGACGAG GCCGCGCCGA GGTCGCCGGT CCCCGGGCTG GACCGCCTGG AGGAGCTGGC CGAAGGCACC CGCCGCGCGG GGATCGACGT GGTCGTGGAG GTCAGCGGTG TGCCCGGGCA CCTGCCCGTG AGCACCGAGT CGGCCGCCTA CCGCACGGTC CAGGAGGCGC TGACGAACGT GGTCCGCCAC TCCGGGGCCT CCTCGGCGCG GGTGGGCATC GAGCACCGCG CGTCCTGGCT GGTCGTGGAG GTCACCGACG ACGGCGCCGG GACCGTCGGG CCGCCGGCGG CGGGCAACGG CATCACCGGG ATGCGCGAAC GCGCCGCCCT GGTGGGCGGC ACGGTGGACG CGGGCCCCCT GCGGGAGGGA GGATTCCGGG TGCGCGCCCG GCTCCCGCTC GACGGCGCCG GACCGGACCG CGGGGCACAG GCCGTCCCCT CCCCCGACAC CGACGAAACG AGGCCCCGTT GA
|
Protein sequence | MAALLLRPAR TDVWLAFGLL AASTGSTALV SALQREAGAV NPWPWGHALI LVACLAVLWR TAYPCVAGLV SVVASTVYYP LGFPDGMVML CTAVMLYTLV RWGYRVFGWT LGVGQFVAVN SYEFLMTGTF RPEAIGIVGW VLVLLCTGEV VRWRQEYLRA DHEREAESLR TREEELLRRA SEERVRLARD VHDTVAHNIS LINVQAGTAL YLMETEPQRA AEALATIKQT SKDTLVELRG MLGVLRAVDE AAPRSPVPGL DRLEELAEGT RRAGIDVVVE VSGVPGHLPV STESAAYRTV QEALTNVVRH SGASSARVGI EHRASWLVVE VTDDGAGTVG PPAAGNGITG MRERAALVGG TVDAGPLREG GFRVRARLPL DGAGPDRGAQ AVPSPDTDET RPR
|
| |