Gene Ndas_0489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0489 
Symbol 
ID9244330 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp589633 
End bp592131 
Gene Length2499 bp 
Protein Length832 aa 
Translation table11 
GC content76% 
IMG OID 
Producthistidine kinase 
Protein accessionYP_003678442 
Protein GI297559468 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATCATC CCGTGCATCC GCACTCGCAG AATTCCGGCC GCGCGTCGGA CCGGGACAGC 
GCCCAGCCCA CGGGCGGGTC CGCTCCCGAG TCGGGCGCGA CGGAGCGCGG CGACGCCGCG
CCCCCCGCGA CCGGCCCCTT CTCCGGCTCC GCCGAGAACG GGTACCGACC CGCCTCCAGG
ACCAGCGTCC CCAACCCCTA CACGACCCGG CGGGTGAACG TCGACGACGT GGACACCGGG
GCCTCCCCGG TCAACCAGTG GCGCGGCGTG GGCCTGGACG AGCCCGAGCC GCGTACCCCA
CCCATCACCG CCACCACGGA CACCATGACT GACACCTCCT CACCGACAGA GCGCGCGGAG
CGGCCCCAGG CCCCCGCGGC GCAGACGCAG GCGGCGCCCG CGGACACCAC GCCGCAGCCG
GCCCCGCGGC CCGAGCAGGC GGGGCGGCAG CGGGCCGAGG CCCCGCACGG GCGGACGCGG
CTCCCCGAGG AGCAGGGCCG CTCGCCCCGG CCCGAACAGG CGGACTCCCA CCGACAGGAC
CCGCCGCAGG CCGCTGACGA CCACCGCCGG ACCGAGCAGC CGCAGGCCGA CGCGCCCCAG
GCCGACGACC ACCGCCAGCC GGAGCAGCCG CAGGCCGCCG CGCCCGCCCA GGGCCACCAG
CAGCCTCCCC AGGCGTGGCC CGCGCCCCAG CCCTACCAGG TGCCGGGCGA GCGCCCCGCG
GCCCCGCAGG TGCAGGGCCA GCACGGGTAC GCCCAGCAGC CGGTCCCCGG TCCGGTCGGC
CACGACCCCT ACGCGGCCCC CGCGCCCTAC CAGGCGCCCC CCGGCGCGCA GTACGGCCCC
TACCACCCGG CCGGGCAGGA GCCCCACCAC CCCTACCCGC CGCACCCGCA GGCCGCCCCG
GCGCCCTACC CCGCCGCGCC GCCGCAGCAC GGCGGTTACG CGCCCCAGCC GCAGGGCCAG
CACCCCTACC CGGCGCAGCC CGCGCACTAC CCGGCGCAGT ACCCGCCCCA GCCGCTGGTC
ACCTACCAGA GCGCCTACCC GGCGCCCTTC CAGCCCCCCG GCTACGTCAT GCACGGCCAG
CCGATGTTCT ACCCGGGCCA GTACAACCCC TACGGCCAGC CGGTCCAGCA CATCATCGTG
CTCACCGCCG GACAGCAGCC GCAGGTGATC AGCGCCGAGT CGGCCAGCAC GCTCTTCGCC
GAGCACGCCG AGACCCGGAC CGACTCCGCC GAGCGCGAGG ACCGGGCCCT GACCCGCGCG
GGCTCCGGCA CGGACGCGGG CAAGGAGCAG CCCGAGCGGG AAGAGGAGCG GGAGGCCCCG
GCCGAGGCCG CCGCGGAGCC CGCCGAAGCG GCTCCCGAGG CCCCCGCCGA GACGCTGACC
GAGCGCAGGG CGGCCCTCAT GGACGCGATC GCCCCCGCCG CCGCGGCCGC CCCCGCGCCC
TCCGAGGGCG GCGACAGCCT CCTCAACGAG GCCCTCGCGG GTCTGGCCAT GCGCGACCTG
TCGCTCGTGG ACGCCCTGCT GGAGATGGTC GAGGAGCTGG AGACCGACGC CCAGGACCCC
GACCTGCTCG ACAAGCTCTT CCAGATCGAC AACTTCGCGA CCCGTATGCG GCGCAACGGG
GAAAACTTCC TGGTGCTCAC CGGACATGAC GGCGGCGAGT CCGACGCCCA CGACGAGATC
GTCCCCCTGC TCGACGTGGC CCGCGCCGCC ACCTCCGAGA TCAAGGACTA CCCCCGCGTC
CGGATGGGCA AGATCCCGCA GACCTCCATC ACCGGTATGG CGGCCGACGA CATCAGCCAC
CTGCTCGCCG AACTGCTGGA CAACGCCACC GCCAACTCGC CCGAGCACTC CCAGGTCGTC
ATCAGCGCCC AGCAGATGAC CGACGGGCGC CTCATGATCG TCGTCGAGGA CGAGGGCGTG
GGCATCCCCG AGGGGCAGCT CGCCGAACTC AACGACCGCC TCGCCGGCGA ACCCGTCCTG
GACGCCGACG TGCCCCGCCA CATGGGCCTG TACGTCGCCA GCCGGATCGC CAAGAAGCAC
GGGCTGGAAG CCAGGCTCGA ACCCCGATCC TTCCGCGGGG TGAGCGCCTA CGCGATCATC
CCCAAGGAGC TGCTGCGCGT GGCCACGCCG CGAACCCCCG GACAGGCCCG CACCTCGACC
GTCCCGGCGA GCGCCCCGCC CGCCAAGCCC GTCGCGCCCG CCCCGGCTCC CGCCCGCCCC
GCCGCCAACG GCACGGGCAG GCCCTCCGCG GCGAGCGGTT CGGCGGTGAC CTCCGCCGGG
CTGCCCCGGC GCAGCGCCAC CCCGCACGGA TCGCCCCTGC GGATGATGCC CCGCCCCGGG
CAGAAGCCGG ACGCCGCGCC CCAGGCCCGC GAGGACACCC CGCCCAGGCT CACCGGCGAG
GCCCGGGCCG AGCAGATCCG CGACGAGCTC GGCGACTTCT TCGACGGCGA GCGCGAGGCC
CGCGAGGGCG GCGACGAGCC CAAGGACGAC AAGAAGTGA
 
Protein sequence
MHHPVHPHSQ NSGRASDRDS AQPTGGSAPE SGATERGDAA PPATGPFSGS AENGYRPASR 
TSVPNPYTTR RVNVDDVDTG ASPVNQWRGV GLDEPEPRTP PITATTDTMT DTSSPTERAE
RPQAPAAQTQ AAPADTTPQP APRPEQAGRQ RAEAPHGRTR LPEEQGRSPR PEQADSHRQD
PPQAADDHRR TEQPQADAPQ ADDHRQPEQP QAAAPAQGHQ QPPQAWPAPQ PYQVPGERPA
APQVQGQHGY AQQPVPGPVG HDPYAAPAPY QAPPGAQYGP YHPAGQEPHH PYPPHPQAAP
APYPAAPPQH GGYAPQPQGQ HPYPAQPAHY PAQYPPQPLV TYQSAYPAPF QPPGYVMHGQ
PMFYPGQYNP YGQPVQHIIV LTAGQQPQVI SAESASTLFA EHAETRTDSA EREDRALTRA
GSGTDAGKEQ PEREEEREAP AEAAAEPAEA APEAPAETLT ERRAALMDAI APAAAAAPAP
SEGGDSLLNE ALAGLAMRDL SLVDALLEMV EELETDAQDP DLLDKLFQID NFATRMRRNG
ENFLVLTGHD GGESDAHDEI VPLLDVARAA TSEIKDYPRV RMGKIPQTSI TGMAADDISH
LLAELLDNAT ANSPEHSQVV ISAQQMTDGR LMIVVEDEGV GIPEGQLAEL NDRLAGEPVL
DADVPRHMGL YVASRIAKKH GLEARLEPRS FRGVSAYAII PKELLRVATP RTPGQARTST
VPASAPPAKP VAPAPAPARP AANGTGRPSA ASGSAVTSAG LPRRSATPHG SPLRMMPRPG
QKPDAAPQAR EDTPPRLTGE ARAEQIRDEL GDFFDGEREA REGGDEPKDD KK