Gene Ndas_2086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2086 
Symbol 
ID9245936 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2506518 
End bp2507687 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content77% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003680018 
Protein GI297561044 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.724108 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCCGGT ACTGTCGGGC GATGCACCGC ATCACCGGCC GCCACGCGTA CGCCGCCGAC 
CTGGCGGCGG CCCTCGTCCT GACCGCGGTC TACGCCGGAT TCGCGCACCT GTCCCCGGTC
GACGGGCAAC CCGCCTACGA CGGCCCGGTC TGGCTCCCCT GGGCCGTCGC CGCGGCCGTG
GGCCTGCCCG TCGCGGTCCG CCGCCGCTGG CCGCTGCCCG TGCTGGGCAC GGTAATGGCC
GCCCTCACCG CCGCCACCCT CCTGGACCTG ACCCGGGAGC CCTACACCGC GGCGGGTCTG
GCCGCCTACC TGGTGGGGTT GGCCGAGCCG GCCCGCCGTG CGGTCCCGGC CCTGGTCGCC
GCGCTGGCGA CGGCCGCCGC CGGAGTGTAC GTCGGGGAGG CGGTCGTCAC CCCGGCGGGG
GACCGGCAGG ACGCGGTCGG CCTGGCCTCC CTGGTGGTGC TGGTGGTCGG CGGCGCCTGG
GCGGCGGGCT TCGCCGTGCG CTCCCACCGG GCCCGGGGGC GGCGGCGGGC CGAGCGGGCG
CTGACCGAGG AACGGCTGCG CATCGCCCGC GACCTGCACG ACGTCGTCTC GCACAACCTC
GGCCTGATCG CCGTCAGAGC GGGTGTGGCC GCACACGTGG CGGAGGCTGA CCCACGCGAG
GCCCGGGTCG CGCTCAGGGA CATCGAGGAG GCCAGCAGGT CCGCTCTGAC GGAGATGCGC
CGCGCCCTGG GGGTGCTGCG CACCGAACAG GCCCCGCTGG CCCCGGCGCC GGGCCTGGAC
GGTCTCGACG GACTCGCGCG GGACGCCCGC AGGGCCGGGG TCGACGTGCG CCTGACGGTC
CGCGGCATGC GGGGTGTTCC GGAGGGCACC CGCCTCATGG CGTACCGGAT CGTGCAGGAG
GCCCTGACCA ACGCGGTCCG GCACGCGGCT CCGACCCGGT GCGAGGTGAC CGTCGCCTCG
GACGGCGCGG CGGTCGACAT CGAGGTGGTC GACGAGGGGC CCGCGGAGGG CTCCCGTCGC
CCGCCCGGGG GTCCCACGGG CGGACACGGC CTCCTGGGCA TGCGGGAACG GGCGATGATG
TGCGGGGGCG CTTTCACCGC GGGACCCCGT CCGCAGGGCG GTTTCGCGGT GGCCGTACGA
CTGCCGACCG GACAGGAGAG CACGCCGTGA
 
Protein sequence
MGRYCRAMHR ITGRHAYAAD LAAALVLTAV YAGFAHLSPV DGQPAYDGPV WLPWAVAAAV 
GLPVAVRRRW PLPVLGTVMA ALTAATLLDL TREPYTAAGL AAYLVGLAEP ARRAVPALVA
ALATAAAGVY VGEAVVTPAG DRQDAVGLAS LVVLVVGGAW AAGFAVRSHR ARGRRRAERA
LTEERLRIAR DLHDVVSHNL GLIAVRAGVA AHVAEADPRE ARVALRDIEE ASRSALTEMR
RALGVLRTEQ APLAPAPGLD GLDGLARDAR RAGVDVRLTV RGMRGVPEGT RLMAYRIVQE
ALTNAVRHAA PTRCEVTVAS DGAAVDIEVV DEGPAEGSRR PPGGPTGGHG LLGMRERAMM
CGGAFTAGPR PQGGFAVAVR LPTGQESTP