Gene Ndas_4858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4858 
Symbol 
ID9248744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5758783 
End bp5759931 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content77% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003682747 
Protein GI297563773 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.935263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.172343 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGTG ACCGCCGGGG CGTCGTGGTC GACGCGCTCT TCGGGTTCGC CGTCTTCGCC 
GTGGTGGCGA TCGCGGTCGC CTCCGACGTC AGCGACACGG GGAACCTGCC GCTCGGGTAC
GGGGTGGCCG GGGTCCTCGG CGCGCTCATG CTGGTCCGGC GCCGCCGACC GGTCGCGGTC
CTGGTCGCCA CCTCGGTCCT GGTCGTCGCC AACTACGTGC TCCAGCTGCC CGTCATCGGC
CTGGCCGTCC CGGTGGCCGC AGCGCTGTAC TCGGCCGCGG AGGCGGGACG CACCCTGTGG
TCGGTGGGCC TGGCCGCCGC GCTGGTCCTG GTCTCCACGT TCACCCGGTT GCAGCAGGGG
CAGGACCCCG CCTACCTCCT GGGGTACGAG CTGCCCACCA CGGTGGCCGT CATGGGCGCG
GCCACGGCCC TGGGCCACGT GCAGTGGCAG CGGCGCCGGG CCGAGCACCA GCGCGTGCGG
ATCGAGGTCC TCGACCGGCA GGCGCGCGAG TCCGAGGCGG CCGAGCGCCA GGCGCTCGAA
CGCACCCGTA TCGCCCGCGA CCTGCACGAC GCCGTGGGGC ACCACCTCTC CGTGGTGTCC
CTGCACGCGG GGGTGGCCGC GGAGGCCCTG GACGACGACC CGGCGGACGT GCCCGTCGCG
CGAGCCGAGC TGGATCACGT GGCCAGGGCC TCGCGGGCGG GTCTGAGCGA GCTGCGGGCC
ACCGTGCGCG CGCTGCGCGA GGCCGACCCC GGCGGCGACC GGGTCTCCTC CCTCGCGCAC
CTGGACGAAC TCGTGGCGAC GGTGCGCGCG GCCGGGGTGG ACGTGGAGGT GACCGGGGTT
CCCGCTCCCG GCGAGGTGCC CGGGATGGTG GACGCGACCG CCTACCGGAT CGTCCAGGAG
GCGCTGACGA ACACGCTGCG CCACGCGCGC GCCCCCCGGG CGCGGGTGGC CTTCTCCAGG
AGGGACGGGA TGCTCGACGT GACGGTGACC GACGAGGGGA CGGCCGTGCC GGTCGCGTCC
CCCGGGGGCA GCGGGCTGGC GGGGATGCGC GAACGGGTGC GCCTGGTCGG CGGATCGGTG
GAGGCCGCGC CCCGGCCCGG CGGCGGGTTC GGGGTGCGTG CGCTGCTGCC GCTCGGGGAC
GACCGGTGA
 
Protein sequence
MPSDRRGVVV DALFGFAVFA VVAIAVASDV SDTGNLPLGY GVAGVLGALM LVRRRRPVAV 
LVATSVLVVA NYVLQLPVIG LAVPVAAALY SAAEAGRTLW SVGLAAALVL VSTFTRLQQG
QDPAYLLGYE LPTTVAVMGA ATALGHVQWQ RRRAEHQRVR IEVLDRQARE SEAAERQALE
RTRIARDLHD AVGHHLSVVS LHAGVAAEAL DDDPADVPVA RAELDHVARA SRAGLSELRA
TVRALREADP GGDRVSSLAH LDELVATVRA AGVDVEVTGV PAPGEVPGMV DATAYRIVQE
ALTNTLRHAR APRARVAFSR RDGMLDVTVT DEGTAVPVAS PGGSGLAGMR ERVRLVGGSV
EAAPRPGGGF GVRALLPLGD DR