Gene Ndas_1124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1124 
Symbol 
ID9244974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1380390 
End bp1381658 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content77% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003679071 
Protein GI297560097 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.146653 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACACGA CACCGGCCGC GGACGGCGGC ACCGCGTCCG GCAGCGCGAC CGGGGAGCGC 
CGGGGCCCGC GGGGATGGGC GGCGGACGCC CTGCTCTTCC TCGTCGCCGG GCTGGTGTGG
GCCTACTACG TCCTGGCCTA CAGCGTGGGC CTGCACCCGT ACGTGCCCTC CTGGCTCGTC
CTCCCGGACC TCGCCCTGGG CGCGGTCGGC TGCCTCGCGC TGTGGTGGCG CCGCGGCCAC
CCCCTGGGCG TGGCCGTCCT GCTGTGCCTG GTCCAGAGCG CGTCCAGTTC GGTGACCGCG
GCCCTGGTCG TCGCCCTGTT CAACCTCGCC GTCCGCCGAC CCTGGCGGCA GGCCGTGGTG
ACGGCGATGG CGAGCCTCAT CCTGGGCCTG CCCTGGATCG TCATGGTCCC GCCCACCCGG
GGCGACAGCG TCGTCGTCCT GATCATCGCC GCACTGCTCT TCACGGGCAG TATCGGCTGG
GGTGTGGCCA TCCGGACCCG GAGCCAGCTC ATCGAGCGGC TGCGCGCGGA CGTGCGGCGC
GAACGCGAGG ACCGGGGGCG GCGCCTGGCC GCCGCCCGGA CCGAGGAGCG CCAGCGCATC
GCCCGCGAGA TGCACGACGT GGTCGCCCAC CGCATGTCGC TGCTGTCGGT GCACGCCGGT
GCGCTCGCCT ACCGGACCGA GCGCGCGGAG CGGGGCGAGG CGCCGCCGCT GGAGAACGCC
GAACTCGGCG CGGCCGTGCG CGTCATCCGC GACAACGCCC ACCAGGCGCT GGAGGAACTG
GGCGGCGTCC TGTCGGTGCT GCGCGCGGCC GACACCGCCC CGGGGGAGGA GGGCGACCAC
GCGGGCACCG CCGCGCCCCA GCCGCCCGCC CTGGCCGAGG TGACCCGCCT GGTGGAGGAG
GCGGTCCGGG CGGGGCAGCG CGTGCGGTCC GTCCACGACG TTCCCGAAGG AGCCGAACCG
CCCGGCCAGG TGCGCCGCAC CGCCTACCGG CTGGTCCAGG AGGGGCTGAC CAACACCCGC
AAGCACGCGC CCGGGGCCCG GGTGGACGTG CGGATCAACG GTGCCCCCGG TCGCGGCCTG
GAGGTGTCGG TGGTCAACCC GCTGCCGGTG GGGGTGGCGC CCGCCGAGAT CCCCGGGGCC
GGGGCGGGCC TGACCGGCCT GTCCGAGCGG GTCGCCCTGG ACGGCGGAAC CCTGCGCCAC
GGGCCCGAGG GCGGGGAGTT CCGCCTGCTG GCCACGCTGC CCTGGCCCGA GACGGCGGAG
GTGCCATAG
 
Protein sequence
MDTTPAADGG TASGSATGER RGPRGWAADA LLFLVAGLVW AYYVLAYSVG LHPYVPSWLV 
LPDLALGAVG CLALWWRRGH PLGVAVLLCL VQSASSSVTA ALVVALFNLA VRRPWRQAVV
TAMASLILGL PWIVMVPPTR GDSVVVLIIA ALLFTGSIGW GVAIRTRSQL IERLRADVRR
EREDRGRRLA AARTEERQRI AREMHDVVAH RMSLLSVHAG ALAYRTERAE RGEAPPLENA
ELGAAVRVIR DNAHQALEEL GGVLSVLRAA DTAPGEEGDH AGTAAPQPPA LAEVTRLVEE
AVRAGQRVRS VHDVPEGAEP PGQVRRTAYR LVQEGLTNTR KHAPGARVDV RINGAPGRGL
EVSVVNPLPV GVAPAEIPGA GAGLTGLSER VALDGGTLRH GPEGGEFRLL ATLPWPETAE
VP