Gene Ndas_0443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0443 
Symbol 
ID9244282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp535672 
End bp537597 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content70% 
IMG OID 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_003678396 
Protein GI297559422 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.324888 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGACCG AGGGATCCGG GCCCGCCCCG GACACCGGCG TGCACGGCGC GGCCTCGACC 
GGCGGCACGC ACTTCGTCAA CGGCACCAAC CGCGAGGGGA AGCGCTCGGC CCACCAGTCG
TGGAGCCTGC GGCGGCGCGT CACGAGCCTG CTGGCCGTGG TCGCCGTCGT CCTGGTCGTG
GCGGTGTCCG TCATCACCCT GGCGGCGTTC CAGGCCCGCG AGTCCCTGGC GCTCCAGGTC
GACTCCCTGA CACCCGCCGT GAGCGCGGTC GAGCAGACGC ACTCGGCCTA CCTCACCCAG
GACCACGCCC TGCGCGGCTA CATCCTCACC CAGGACCGCG AGTTCCTCCA GCCCTACGTC
GAGAGCCAGC TGACGCTGAC GGAGAACCAG GCCGTCCTGG CCCAGCTGGC CGAGGACAAC
CCCGAGGTGG CCACCAACGT CGACGACCTG CTCACCGCGG GCCGGGTGTG GACCGAGGAG
TTCGCCGAGC CCGCCTTGGA GCGGGTCAGC GGCGGCCAGG AGGTCACCCA GGAGGAGCTG
CGGCGCGGCC GGGTCCTCTT CCTGGAGCTC AGCCGGATCA GCGACGCCAC CACCAGCCAG
CTGGAGGCGG AGATCAAGGA GGCCCGCGAG GGCCTGACCC TGGCCACCCA GCAGGTCGTG
GCCCTGCTGG TCCTGGTCGG CCTGGTCGTG GTGTTCCTGT CGGTGTTCCT GTGGGTGATG
CTCCAGCAGT GGGTTCTGCG CCCCCTGGAG GAACTCGCCG GGCACATGCG CCAAGTGTCG
GAGGGCTACT ACGCCCACCG GATCTCCCTG CACGGCCCGC CCGAGATCGT CCGGCTCGGC
CAGGACGTGG ACGCCATGCG CGAGCGCATC GTGCAGGACC TGGACGAGGT CGCCTCCGCG
CGGCGCAAGC TCCAGGAGCA GTCCGTCCTC ATGGAGAACC AGACCGAGGA ACTGCGCCGC
TCCAACCTGG AGCTGGAGCA GTTCGCCTAC GTCGCCTCCC ACGACCTCCA GGAGCCGCTG
CGCAAGGTGG CGAGCTTCTG CCAGCTGCTC CAGCGCCGCT ACCAGGGGCA GCTGGACGAG
CGCGCCGACG CCTACATCGA CTTCGCGGTC GAGGGCGCCA AGCGCATGCA GACCCTCATC
AACGATCTAC TGGCCTTCTC CCGGGTCGGC CGGGTCAGGA ACTTCGCGCC GGTCGCCCTC
GACGACGCGC TGGACGACGC CCTGAGCAGC CTGTCCACCC GTCTGGAGGA GGCCGACGCC
GAGGTCACCC GGGACCCGTT GCCGACCGTG CAGGGCGACC GCACCCTCCT GACCCAGGTG
TTCTTCAACC TCGTGGGCAA CGCCGTGAAG TTCCGCGGCG AGGAGGACCC CCGGGTCCAC
ATCAGCGTCG AACGGCGCGG TGACGAGTGG GTGTTCTGCT GCTCCGACAA CGGGATCGGA
ATCGAACCGC AGTACGCGGA GCGCATCTTC GTGATCTTCC AGCGGTTGCA TACCAGGGAC
AAGTACACGG GAACCGGCAT CGGCCTGGCG ATGTGCAAGA AGATCGTGGA GTTCCACGGG
GGACGGATCT GGCTGGAAAC CGGCTCCCGG GACCCCGGGG AATCCGAAAC CTCAGGTGAC
CGGGACTCTG GTCGAACCGG AACGCGCATA TGCTGGTCCT TGCCCGCCGA CCCCGCGGAG
GACGAGGACC CGGCCCCCGA CAGGGGAACC GCCGAGCCCC TCGCCGTCGA TAACGGGGAC
GAGGGCACCG AGGACGCCGA GACCACCCCC GAGGACTCGG CTCCGACGGC CGCACGACAG
CCCACGGGTA CGGACAACCC TGGGGAGGAC GGCGCCGAAC CGGGCGACAG GCCCGGCGGC
GTCCGCTCCG CCCAGCCCGA CACCGGTGGC GGTACGGTTC CCCCCGGTCA CGGGGCGGGA
CCCTGA
 
Protein sequence
METEGSGPAP DTGVHGAAST GGTHFVNGTN REGKRSAHQS WSLRRRVTSL LAVVAVVLVV 
AVSVITLAAF QARESLALQV DSLTPAVSAV EQTHSAYLTQ DHALRGYILT QDREFLQPYV
ESQLTLTENQ AVLAQLAEDN PEVATNVDDL LTAGRVWTEE FAEPALERVS GGQEVTQEEL
RRGRVLFLEL SRISDATTSQ LEAEIKEARE GLTLATQQVV ALLVLVGLVV VFLSVFLWVM
LQQWVLRPLE ELAGHMRQVS EGYYAHRISL HGPPEIVRLG QDVDAMRERI VQDLDEVASA
RRKLQEQSVL MENQTEELRR SNLELEQFAY VASHDLQEPL RKVASFCQLL QRRYQGQLDE
RADAYIDFAV EGAKRMQTLI NDLLAFSRVG RVRNFAPVAL DDALDDALSS LSTRLEEADA
EVTRDPLPTV QGDRTLLTQV FFNLVGNAVK FRGEEDPRVH ISVERRGDEW VFCCSDNGIG
IEPQYAERIF VIFQRLHTRD KYTGTGIGLA MCKKIVEFHG GRIWLETGSR DPGESETSGD
RDSGRTGTRI CWSLPADPAE DEDPAPDRGT AEPLAVDNGD EGTEDAETTP EDSAPTAARQ
PTGTDNPGED GAEPGDRPGG VRSAQPDTGG GTVPPGHGAG P