Gene Ndas_1809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1809 
Symbol 
ID9245659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2214625 
End bp2215794 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content74% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003679743 
Protein GI297560769 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.13648 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAATC CGCCGCTGGT CGGCGCGCGC AGGTTCGAGA CCTACGTGCG CTGGTCCACC 
TACCTGGCCG TCGCGACTCC GGTCGGAACA CTCCTCCGGA GCGCGGTGGA GGCCGACCGG
CTCACTGGAG CTCCCCTGAC CGCGCTGGTC GCGGCGGCCG TGCTGTGCCT GCTCCTGGTC
GGCGGGAACG TCCTCGTCAC CAAGTGGAGC ATCGACACCG TCGTCGGACG CGCGCGGGGG
TTCCCCGTCA CCGCCGTCGT CGGGTGGGCC CTGGTCCTGG CGGCGTTCGT CGCCGTGGCC
CTGATGCTTC CCCTTCCCGC GATGGACATG ACGGTCGCGG CGGCGATCGC CTCGGCGGCC
GCGAGCCTCG CACCGGCCCT CCACGCCCGC CGGGCGCTGC TGCTCCACGC GGCGGCGCTG
GTGCTGAACA CCGCGCTCGT GGGATTCGCG GACGTCGTGG TCCTGCTGGT CGGGGCGCTG
ATGATCAGCG CCGTCCTGTG GTTGTGCTGG TCCAGCGCGT GGATGCTGCG GGTGCTGCTC
GAACTCCAGG CGGCCCACGA GGACCGCGCC GCGCTCGCCC TGGCCAACGA ACGCCTGCGC
ATCTCCCGCG ACCTGCACGA CGTGTTCGGC CGCACCCTGG CCGCGATCGC GGTCAAGAGC
TCGCTGGCCT CCGAACTCGT CCAGCGCGGT CACGGCGAAC GGGCCGCGAC GGAGATCTCC
GCGATCCGCG GACTGGCCGA GGAGGCGGGC ACCGAGGTCC GCCGCGTCGT GCGCGGCGAG
CTGCGCACAA CCTGGGAGGA CGAGGTCTCG GGCGCCCGTT CTCTGCTCAG GTCCGCCGGT
ATCCGCTGCA CGGTGACCGG GGATCCCGTC CCCGAGCGGT GCGCGGAGCC CCTGGCGTGG
GTCGTGCGCG AGGGCGTCAC CAACCTGCTG CGCCACGCCT CGGCCACCCA GGTCACCCTC
GCGACCGCGA ACGAGGACGG GGAGGTGCAC CTGACCGTCG CCAACGACGG AGCCGGTCCC
CCGCGGTCCG CGCGGGACGG CGAGGGCACC GGGCTGCGCG CGATGTCCGA GCGCCTGCAC
GCGCTCGGCG GGCACGTCAC GGCGCGCCGT GACGGAGACT GGTTCCTGCT CGACGCCGTG
CTCCCGCTTC CGAAGGACGC CCCCAGATGA
 
Protein sequence
MSNPPLVGAR RFETYVRWST YLAVATPVGT LLRSAVEADR LTGAPLTALV AAAVLCLLLV 
GGNVLVTKWS IDTVVGRARG FPVTAVVGWA LVLAAFVAVA LMLPLPAMDM TVAAAIASAA
ASLAPALHAR RALLLHAAAL VLNTALVGFA DVVVLLVGAL MISAVLWLCW SSAWMLRVLL
ELQAAHEDRA ALALANERLR ISRDLHDVFG RTLAAIAVKS SLASELVQRG HGERAATEIS
AIRGLAEEAG TEVRRVVRGE LRTTWEDEVS GARSLLRSAG IRCTVTGDPV PERCAEPLAW
VVREGVTNLL RHASATQVTL ATANEDGEVH LTVANDGAGP PRSARDGEGT GLRAMSERLH
ALGGHVTARR DGDWFLLDAV LPLPKDAPR