Gene Ndas_0781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0781 
Symbol 
ID9244626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp960936 
End bp962546 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content75% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003678731 
Protein GI297559757 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCTCT CCGTACCGCG CGCGGGCCTC CGGCGTGCGC TGACCCCGCT CGGGGGGTGG 
CCCACCCGGG GCGCCGTCTG GCGCGACGTG CTGTTCGGGG CCGCGATCGC CGTCCTGTGC
CTGGGGGAGA TGCTCATCCG CTGGCGCGCC GGTGAGACGT CGGCCGCCGA GGCGCTCACC
ACCGCGTCCG CCGCGGTGGC GCTGACCGCG CTGACCGTGG TGCTCTGCCG CCTCTACCCG
CTGCTCGCCC TGACCGTGGC GCTGCTCGGC TCGTTCTGGG TGTACGGCTT CGGCGTCCTG
CTGCTGTGCG TCGCCTACCT GGTGGGCCGC CGCCTCCCCA CGGTGTGGTC GGCGCTGGCG
CTCTTCACGT CCGTGACCGT CCTGTGGACG GCGGTCGGAG CGCTGCTGTG GCCCGAGGCG
CCCGCGGCGT GGCCCGCCAC CGTGAGCACC GTCGTGTTCA CCCTCGTGCT CCCCTGGCTG
GTGGGCGTCT ACCGGCGCCA GCACGTGGCC CTGGCCGAGG CCGGATGGGA GCACGCCCGC
CAGCTCCAGC GCGAGCACCG CCTCACCGTC GACGAGGCCA GGCTCCGCGA GCGCTCCCGC
ATCGCCCAGG ACATGCACGA CTCCCTAGGC CACGAGCTCA GCCTCATCGC CCTGCGCGCC
GGGGCCCTGG AGGTCAGCCC CGACCTGGAC CAGGAGCACC GCCGCTCCGC GTCCGAGCTG
CGGATCACCG CGGTCGAGGC CACCCGCAGC CTGCGGGAGA TCATCGGCGT GCTGCGCGCC
GACACCGAAC CCGCGCCCAT GTCCCCCGCG GGCGAGGGCG TTCCCGCGCT GGTCGAGCGC
GTCCGCGACT CCGGCATGCG GGTCGTGCTG GTGCGCGACG GGGACACCGG CGGCCTGCCG
CCGATGGTGG ACCGCGCGGT CCACCGGGTC GTCCAGGAAT CCCTCACCAA CGCCGCCAAG
TACGCCCCCG GCGCCGACGT GACCGTGTGG CTGACCGCGA GCGGGGAGCG GGTGGAGGCC
AGGGTCACCA ACTCCGCGCC CCCCGAGCCG TGCCCCGACA CCGGTCCGGG CGGCAGGCGC
GGACTCATCG GGCTGCGCGA GCGCGTCCGC CTGACCGGGG GCTCCTTCGC CGCCGGGCCG
AACGGCGACG GCGGCTGGGA GGTCTGCGCG ATCATGCCGC TGGACGGCAC CGCCGGGGCG
GGGCCCGACG AGGACGGGGA GGCGGCCCAG ATCGACCAGC TCCAGCTCGC CGCCCACCGC
CGGGTGCGCG GCTGGACGAT GGCCCTGGCC CTGGTACCGG TCACCGCCGC CGTGGCGCTC
GTGCTCGGGC TGGGCTGGAT CGGCGTCACG AACATGGAGA GCAGCACCCT GACCCCCGAG
GAGTTCGCCG ACCTGACCGT CGGCGACCCC CAGACCGAGG TCGAGGAGGC CCTGCCGCCG
AACTCGGTGT ACATGGACGC GAGCGTCCTG GCGCAGGACG CCGTCCCCCC GGGCGCCACA
TGCAGCTACT ACCGCAGCAA CGGCGTGTTC CTCGGCGAGG AGACGGTCTT CTACCAGCTG
TGCTTCTCCG AGGGGCGCCT GCTGACCAAG GGCGAGGCGC CGCTACGGTG A
 
Protein sequence
MDLSVPRAGL RRALTPLGGW PTRGAVWRDV LFGAAIAVLC LGEMLIRWRA GETSAAEALT 
TASAAVALTA LTVVLCRLYP LLALTVALLG SFWVYGFGVL LLCVAYLVGR RLPTVWSALA
LFTSVTVLWT AVGALLWPEA PAAWPATVST VVFTLVLPWL VGVYRRQHVA LAEAGWEHAR
QLQREHRLTV DEARLRERSR IAQDMHDSLG HELSLIALRA GALEVSPDLD QEHRRSASEL
RITAVEATRS LREIIGVLRA DTEPAPMSPA GEGVPALVER VRDSGMRVVL VRDGDTGGLP
PMVDRAVHRV VQESLTNAAK YAPGADVTVW LTASGERVEA RVTNSAPPEP CPDTGPGGRR
GLIGLRERVR LTGGSFAAGP NGDGGWEVCA IMPLDGTAGA GPDEDGEAAQ IDQLQLAAHR
RVRGWTMALA LVPVTAAVAL VLGLGWIGVT NMESSTLTPE EFADLTVGDP QTEVEEALPP
NSVYMDASVL AQDAVPPGAT CSYYRSNGVF LGEETVFYQL CFSEGRLLTK GEAPLR