Gene Ndas_3125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3125 
Symbol 
ID9246981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3740918 
End bp3741973 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content77% 
IMG OID 
Productputative signal transduction histidine kinase 
Protein accessionYP_003681040 
Protein GI297562066 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCGTGT GGACGGCGTT CACCACCCGC GCCTACGCCC GCCCGCACCT GCGCGACTGG 
CGCGTGCTCA CCGCCGACCT CGCGGTGGCC TTCGCGTGCC TGTTCGCCAC CGCCCTGGCG
GCCACGCCGT TCTACCTCAC CCAGGCCCCG CCGCTGAGCG GCTACTGGTT CGCGGGCACC
GCGCTGGCGG CCAGCGTCAT CTGGGGCGGG CGCGCCGCCG CGGCCGTCGC CGTGCTCTAC
GGTCTCGCCG ACCTCACCCT GCGCACGGTC ATGGACGCCG CCGTGACCGC CGCGACCGCC
CGCGGCGTGG TCCTGCTGCT CCTGGCCGGC CTGGCGGTCG GATACATGTC CTGGATGTCG
GAGCGGGCCG AACGGCGGTT CGCGCAGGCG GTCGCGCTGG AGGCGCGCAC CCGCGAGCGC
GAGCAGCTGG CCCGCTCGAT CCACGACTCG GTGCTCCAGG TGCTGTCCCT GGTGAGCAGG
CGCGGCGCCG AGGCCGGGGG AGAGGCCGCC GAACTGGGCC GGATGGCCGG TGAGCAGGAG
GCGCGCCTGC GCGCGCTGGT CGCGATCGGA TCGTCGGAGG ACGCCTCCGG CGGGACGGAC
GGGATGAACG GGACCGGCGG GGCGGCGCCG CCTCCCGCGG GGCGCACTGC CGCGGCAGGA
ACCGGGGACG CGGTGGACCT GCGCGAGCCG CTGCGCCGCG CCGAGTCGGC CCGCGTGTCG
GTGTCCGCGC CCGCCACGCC CGTCGTCCTG CCCGCGCACA CCGCCGCCGA ACTCGCCGCC
GCCGTGCTCG CCGCCCTGGA CAACGTGGAG CGGCACTGCC CCGAGGGCAC GCGCGCCTGG
GTGCTGGTGG AGGACGAGGA CGACGCGGTG ACCGTGTCCG TGCGCGACGA GGGCCCCGGC
ATCGAGCCCG GCCGCCTGGA GCGGGCCCGC TCCGAGGGCC GCATCGGCGT GGCCCAGTCC
GTGCGGGGCC GTGTGCGCGA CCTGGGCGGC ACCGTCGAGT ACGTGTCGGT CCCCGGCCAG
GGCACCGAGG TGGAGATGCG GGTCCCGCGC CGCTGA
 
Protein sequence
MVVWTAFTTR AYARPHLRDW RVLTADLAVA FACLFATALA ATPFYLTQAP PLSGYWFAGT 
ALAASVIWGG RAAAAVAVLY GLADLTLRTV MDAAVTAATA RGVVLLLLAG LAVGYMSWMS
ERAERRFAQA VALEARTRER EQLARSIHDS VLQVLSLVSR RGAEAGGEAA ELGRMAGEQE
ARLRALVAIG SSEDASGGTD GMNGTGGAAP PPAGRTAAAG TGDAVDLREP LRRAESARVS
VSAPATPVVL PAHTAAELAA AVLAALDNVE RHCPEGTRAW VLVEDEDDAV TVSVRDEGPG
IEPGRLERAR SEGRIGVAQS VRGRVRDLGG TVEYVSVPGQ GTEVEMRVPR R