Gene Ndas_1971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1971 
Symbol 
ID9245821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2394666 
End bp2395847 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content74% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003679904 
Protein GI297560930 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value3.29191e-07 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value4.14728e-08 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
GTGGTCGTGG ACCTGGTGCT GGGTGTGGTG CTGGGCGCGG TGGCGGTGGG GTTCGTGCGC 
GGCCAGGTGG TGTGGCAGCG GCGCGGCGGG GGCGGACACG GCCCCTGGCG GGGCGGGCCT
CCCCCGGGGT GGGACGTGGA CTGGGGTGCG GTGGTGGTGC CTTGGTCGGT GTGGGTGGGT
GTGGGGGTGT TGGTGTGCGC GGTGGCGGTG CGTCGTGTGT GGCCGGTGGC GGCGTTGGTG
GTGGCCGCGG TGGGTGCGGC GGTGTTCGTG GGGGCGGGTC ACCCGGTGGG TCCGGTGTTG
GTGTTGCCCG CGCTGGTGGT GTTCTCGGCC GCGATCCGGC TGGGCGCGGG GGTGTTCTTG
GCGTGGACGG CGCCGTTGGT GGTGGGGGTG GGGTTGCTCT CGCGGGTGCG TGAGCCCTGG
TGGGGGTTGG TTGACGGGGG CGCGCTGGTG GGGGTGGTGT TCGCGGTGGG GGTGATGGCC
TTTCCCGCGG GCGGTGGGGT GTTCGTGCGT GCGCGGCGTG AGGCGCGTCG GCGCGAGCGT
CAGGAGGAGG TGGACCGGCA CCGCTACGAG GAGCGGTTGC GTATCGCCCG GGAGGTGCAC
GATCTGGTGG GGCACAGTCT GTCGGTGATC AGTATGCAGG CGTCGGTGGC CCTGCACGTG
GTGGATCGTC GGCCCGAGCA GGCGTCGGTG GCGTTGGCCG CGATCCGGGA CAGCAGCCGG
TCGGCCCTGG AGGAGTTGCG GGGCACGCTG GCGGTGTTTC GGGGCGGGGC CGAGGTGGCG
GGTCGGGGTC CGTTGCCGGG GTTGGGGCGT GTGGAGGCGT TGGTGGGGGA GTTGCGCGGG
GCGGGACGTC GGGTGGAGGT GGTGTGGGAG GGGGAGGCGG TGCGGGTGCC CGCGGCGGTG
GACCACGCGG CGTTTCGGAT CGTGCAGGAG GCGTTGACGA ACGTGGTGCG CCATGGCGGT
GAGGGCGCGT CGGCGCGGGT GCGGGTGGTG TACGGGGAGC AGGAGGTGCG GGTGTGGGTG
GTTGATGAGG GAGTGGGTGT GGTGGGGCCG GTGCGTGAGG GGTCGGGGAT CGCGGGGATG
CGTGAGCGCG CGCGGGCGGT GGGCGGGTCG GTGAGGGTGG GGTCCGGTGA GGGCGGCGGG
TTCGTGGTGG GGGCGTCTTT GCCGTTGGGG GGTGAGCGGT GA
 
Protein sequence
MVVDLVLGVV LGAVAVGFVR GQVVWQRRGG GGHGPWRGGP PPGWDVDWGA VVVPWSVWVG 
VGVLVCAVAV RRVWPVAALV VAAVGAAVFV GAGHPVGPVL VLPALVVFSA AIRLGAGVFL
AWTAPLVVGV GLLSRVREPW WGLVDGGALV GVVFAVGVMA FPAGGGVFVR ARREARRRER
QEEVDRHRYE ERLRIAREVH DLVGHSLSVI SMQASVALHV VDRRPEQASV ALAAIRDSSR
SALEELRGTL AVFRGGAEVA GRGPLPGLGR VEALVGELRG AGRRVEVVWE GEAVRVPAAV
DHAAFRIVQE ALTNVVRHGG EGASARVRVV YGEQEVRVWV VDEGVGVVGP VREGSGIAGM
RERARAVGGS VRVGSGEGGG FVVGASLPLG GER