Gene Ndas_4921 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4921 
Symbol 
ID9248808 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp54525 
End bp55763 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content74% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003682810 
Protein GI297563837 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0892179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGAC GGAGCACCGG GGCGGACGCC GTGCGGGACT GGGACCGGTG GTGGGACGGG 
TTCCTCGCCG CCGCCCCCTG GGCCCTGCTG CTGCCCTCGG CGGCGCTGGT CCTGTCCCGG
CCCGTGGCGG ACTGGACCGA GCACGCGGTC ACCGTCGGCC TGACCGCGCT CACCGCCGCC
TGGGTGCTCC TCGGGTACAC GCCGCCGTGG CGGGACGGCC GACCGCTGCC CGCCGGGATC
CACTTCGCGG TGCTCCTCGC GCTGGCCTCG GCGCTGATGG CGCACGACAC CCTGTTCATG
ATGTTCACGG TCTCGCTGTT CTTCCGCGCG ATGGCGCTGC CCCGGCGGCT GACCCTCGTC
GGGATCGCGG CCACCGCGGT CGCGCTGTAC ACGAACACGA TGGGATTCCC CGGAACGGAC
ACGGAGCGGC CGTTCCTGGA GCACCTGTTC GTCTACGTCG GCGTGATCAC CATCCAGACC
GTGGCCGTCG GCGGAGGGCT CGTGATCGCG TCCAAGGCGG CCGAACAGCA CCGGGAGCGC
CGGGTGACGG TGGCCAGGCT GGAGGCGGCC CTGGAGGAGA ACGCCGGGCT GCACGCGCAG
CTCCTCACCC AGGCCCGGGA GGCCGGGGTG TTGGACGAGC GCCAGCGGAT GGCACGGGAG
ATCCACGACA CCCTGGCCCA GGGGCTCGCC GGGATCGTGA CGCAGATCCA GGCGGCGCAG
CGGGTGTGGG AGGACCCCGG GGCGGCGCGT CCGCACGCGG ACCGCGCGCT CGGCCTGGCG
CGGGAGAGCC TGGCCGAGGC CCGGCGCTCC GTGCAGGCCC TGCGCCCCGG GCAGCTGGCG
GAGAGCCAGC TGCCCGAGGC CCTGGGCGAA CTCACCCGCC GCTGGGCGGA GGAGCACGGC
GTCCGTCCCG ACCTCGACGT CACCGGCGAA CGCGTCGCAC TGAGCCCGGC GATCGAGGTG
GTCCTCTTCC GGGTGGCCCA GGAGGCGCTC ACCAACGTCG CCAGGCACGC CGACGCCTCG
CGCGTGGGCG TGACCCTGTC CTACTCCGAC GACGTCGTCC TGCTCGACGT GCGCGACGAC
GGCAGGGGCA TCACGGGACA CAACCAGCAC GGGTTCGGGC TCAGCAGCAT GCGCCAGCGC
GTGCGCGGGA TCGGAGGAGC GTTGGAGATC GAGAGCGGCG AGGGGGAGGG CACCGCCGTC
AGCGCCACGG TGCCCGCGAT TCCGGTGGGG GCGGCGTGA
 
Protein sequence
MSRRSTGADA VRDWDRWWDG FLAAAPWALL LPSAALVLSR PVADWTEHAV TVGLTALTAA 
WVLLGYTPPW RDGRPLPAGI HFAVLLALAS ALMAHDTLFM MFTVSLFFRA MALPRRLTLV
GIAATAVALY TNTMGFPGTD TERPFLEHLF VYVGVITIQT VAVGGGLVIA SKAAEQHRER
RVTVARLEAA LEENAGLHAQ LLTQAREAGV LDERQRMARE IHDTLAQGLA GIVTQIQAAQ
RVWEDPGAAR PHADRALGLA RESLAEARRS VQALRPGQLA ESQLPEALGE LTRRWAEEHG
VRPDLDVTGE RVALSPAIEV VLFRVAQEAL TNVARHADAS RVGVTLSYSD DVVLLDVRDD
GRGITGHNQH GFGLSSMRQR VRGIGGALEI ESGEGEGTAV SATVPAIPVG AA