Gene Ndas_3744 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3744 
Symbol 
ID9247613 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4495751 
End bp4497568 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content72% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003681648 
Protein GI297562674 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.404155 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCAG CCACACCAGA GGAGGAGCGG GGGACCGCCT CCGACACCCA GATCCCGCCC 
TCCGGGCCGT CCGCGGCCGG AGGGCGCTGG CGCGACGGCG GCACGGGGCT CACCCGCTCC
TACGCGCTCG CCCAGCGCGC CGCCCGCGCC CTGGTGCGCG GCGTCCACAG CAACTGGCGC
CGCTCGCTGC ACCTGCGGGT CATCTCCACC ACCCTGGTGC TGTCCCTGGT GGTCATGGTG
GGCCTGGGGT ACGTGCTCAT CTCCCAGGTG CGCGGCGGCC TGCTCGACGC CAAGATCTCC
ACGGCCGTCA CCGACCACCG CGCCGGGCTC AACTACGCGT CCGCCGAGCT CCAGGAGAAC
GAGTCGGGCG ACCGCAACCG GCTCATGTAC AACCTCGCCA ACGAGCTCAC CAGCCGCAGC
GGCGACACCG GGCTGTACAG CGTGGTCATC CTGCCCTCGG TGGGCGGAGA GGTCGGCTGG
GCCACCGGCG AGGGCAGCGT CCCGCAGCGG CTCATCAACC AGGTCCACGA GTCCGAGGTG
GACGAGCAGC AGTACCACAC GTACACGCGC ATCACCACCG ACCAGGGCGA GGAGCCCGCG
CTGGTGGTCG GCGCGCAGCT CACGCGCGCC TACGAGCTGT ACTACATCTT CCCGCTCCAG
CACGAGCAGC AGATCCTCGA CCTGGTCCAG GGCACGGTCG GCCTGGTGGG CGTGCTGCTG
GTCATCCTCC TGGGCCTGAT CGCCTTCGTC ATCACCCGGC AGGTGGTCAG CCCGGTCCGC
TCGGCCGCCC AGTCCGCCGA ACGCCTCTCC TCGGGCGACC TCACCGAACG CATGGCCGTG
CACGGCGAGG ACGACCTCGC CCGGCTCGCC CTGTCCTTCA ACGACATGGC GGGCAACCTC
CAGGAGAAGA TCCAGGAGCT GGAGGAGCTG TCCAAGCTCC AGCGCCAGTT CGTCTCCGAC
GTCTCCCACG AGCTGCGCAC CCCGCTGACC ACCATCAAGA TGGCCGGGGA CGTGCTCTTC
GACGAGCGCG AGGAGCTCGA CCCCACCATG CGCCGCTCGG TGGAGCTGCT GCAGAGCCAG
GTGGAGCGGT TCGAGGAACT GCTCTCCGAC CTGCTGGAGA TCAGCCGCCA CGACGCCGGG
GCCGCCACCC TGGGCACCGA GTCGCTCGAC ATCCGCGACG CCGTCATGAA GGCCGTCGGC
GACGCCGAGC AGATCGCCGA GCGGCGCGGC ATCAAGGTCG TGCTGCGCCT GCCCACCGAC
CCCTGCACCG CCGAGTACGA CGGCCGCCGC ATCAACCGCA TCCTGCGCAA CCTGGTGGTC
AACGCCATCG AGCACAGCGA GGGCCGCGAC GTGGTGGTCA CCGCCGCGTG CGACCGCGAC
GCGGTCGCCG TGGCCGTGCG CGACTACGGG GTGGGCCTCA AGGAGGGGGA GGAGCACCTG
TGCTTCGACC GCTTCTGGCG CGCCGACCCG GCCCGGGTGC GCACCACCGG CGGCACCGGC
CTCGGGCTCT CCATCGCCAA GGAGGACGCC ACCCTGCACG GCGGCTGGCT CCAGGCGTGG
GGCCAGCCCG GCCAGGGCTC CCAGTTCCGC CTGTCCCTGC CGCGCCGCTC CGGCAGCGAA
CTGCGCGGCT CCCCCCTGCC GCTGGTGCCG CCGGAGTTCG CGCTGGGCCG CACCTACACC
ACCTTCGCCG ACAAGGGTGA GAACGGCAAC GGCGCCGCCC GCCCCGAGAG GGTGCCCGCC
AGGAACGGCG CGGACGCCGC GGAGGAGGGC GCGGCCGCCG CCCGCGCCGA GCAGCGCGAG
GACGAGGAGA AGCTGTGA
 
Protein sequence
MTAATPEEER GTASDTQIPP SGPSAAGGRW RDGGTGLTRS YALAQRAARA LVRGVHSNWR 
RSLHLRVIST TLVLSLVVMV GLGYVLISQV RGGLLDAKIS TAVTDHRAGL NYASAELQEN
ESGDRNRLMY NLANELTSRS GDTGLYSVVI LPSVGGEVGW ATGEGSVPQR LINQVHESEV
DEQQYHTYTR ITTDQGEEPA LVVGAQLTRA YELYYIFPLQ HEQQILDLVQ GTVGLVGVLL
VILLGLIAFV ITRQVVSPVR SAAQSAERLS SGDLTERMAV HGEDDLARLA LSFNDMAGNL
QEKIQELEEL SKLQRQFVSD VSHELRTPLT TIKMAGDVLF DEREELDPTM RRSVELLQSQ
VERFEELLSD LLEISRHDAG AATLGTESLD IRDAVMKAVG DAEQIAERRG IKVVLRLPTD
PCTAEYDGRR INRILRNLVV NAIEHSEGRD VVVTAACDRD AVAVAVRDYG VGLKEGEEHL
CFDRFWRADP ARVRTTGGTG LGLSIAKEDA TLHGGWLQAW GQPGQGSQFR LSLPRRSGSE
LRGSPLPLVP PEFALGRTYT TFADKGENGN GAARPERVPA RNGADAAEEG AAAARAEQRE
DEEKL