Gene Ndas_4008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4008 
Symbol 
ID9247880 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4793980 
End bp4795554 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content77% 
IMG OID 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_003681911 
Protein GI297562937 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00577691 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.384071 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTGGC ACGAGGCCCT CCGGCCCGCG CGTTCCTATC CTGGGGGAAT GCCCGACCTG 
TCCCCACGCT CCCTCCTGGA CCGCGTGACG CTGTGGGCCC GCGGCCACGT GCTCGCCGTG
GACGCGCTCT GGGCCGTCGT CTGGTTCGCC ATGTCGATGG CGACCTGGCC GAGGGCCAGC
TTCGACACCG CCGAGTCGTG GGTCTACCTC GTGCTGGCCA CCGCGTGCTG CGCCGCCCTG
GCCCTGCGCC GCGTCCGCCC GTTCGCGTGC CTCGCGGTGC TGGGCGTCCT GCTGGCGTTC
CACATCCTCT GGTTCGACCA GCCCACGGCC CCGGTGGGCA TCTGCGCCCT GGTCGCGTCC
TACACGGCGC AGGCCGAGCT CCCGCGCCCG TGGCGCGCCG TCGGTCTCCT CCTGCTCCTG
GCCGGAGCGG CCTGGGCCGT CCTCTCCATC CCGCCGGAGA ACCTCTCCGC GGACCTGGAG
CTGCGCCTCA ACAGCGTCGT CTCGGCGTGG ACGGCGGTCG CGCTGTTCTC CCTGCTCGGA
GCGTTCCGCA GGCGCAACCG CGAGGAGTTC GCCCGCGTCG TGGAGCACGC CCGGCTGCTG
GAGACCCAGC GGGAGCAGGA GGTGCGCCTG GCCGCGCTCG ACGAGCGGAC GCGCATCGCC
CGGGAGATGC ACGACATCCT CGCGCACTCG CTGAACGTCA TCGTCGCCCA GGCCGACGGC
GGCCGCTACG CCGCGAAGGC CGCCCCGGAG CGCGCGGTGG CCGCCCTGGC CACCGTCGCC
CAGGTGGGAC GCGAGTCGGC GGCGGAACTG CACCAGCTCC TGGGCGTCCT GCGCGACGGC
GAGGAGCGCG GGGCCGCCCC GGCCCCCGGG GTCGGCGACC TGCCCGGCCT CGTGGAGGAG
TACCGCCGCG CCGGTCTGCG GATCCGCCTG GTCCAGCACG GGTCCCCGGC CGCCCCGCGC
GGCGGCCGGG CGGACACCGG CGCCCCGGCG ACCCTGCCGG CGACCGCGTC CCTGACGGTC
TACCGCGTGG TGCAGGAGTC GCTGGCCAAC GCGCTCAAGC ACGGGGGACC CGCCGCCGCG
CGGGTCGAGC TGACGTGGTC GCCCGGGCGG GTCGGGATCG ACGTGGCCAA CTCCGTCCGC
GAGGCCGCAC CCGCGGCCCT CACCACACCC GCGGGCCCCT CAGGCCGCTC GGAATCCACA
GGTCCGTCCG CGTCTGGAGG CCCCTCGGCT CCCCCGGTGT TCGCGGGCCC CTCCATGCCC
ACGGCCCTCT CCGCACCCGA GGGGGTTCCT CCCACGGGCC GCTCGGGTCC ATCGACACCC
GCGGACGCCT CGGGCCCCTC GGCGCCCGCG GGTTCCTGCG GCACCGGAAG GCCCTCCGGT
ACCGGGAGGC ACTCCGCCGC CAAGGCGCCC TCCGCCGTCG GGGCGGTCCC GGGCGGCGCT
CGGCGGGGCC CCGGCCACGG CCTGGTCGGC ATGCGCGAGC GTGTGGGCCT GCACGGCGGC
ACCCTGGAGG TCGGCGCCGA CGACGCCACC GGCACCTGGC GGGTGCGCGC GGTGGTCCCC
TGGGAGGAGG CGTGA
 
Protein sequence
MPWHEALRPA RSYPGGMPDL SPRSLLDRVT LWARGHVLAV DALWAVVWFA MSMATWPRAS 
FDTAESWVYL VLATACCAAL ALRRVRPFAC LAVLGVLLAF HILWFDQPTA PVGICALVAS
YTAQAELPRP WRAVGLLLLL AGAAWAVLSI PPENLSADLE LRLNSVVSAW TAVALFSLLG
AFRRRNREEF ARVVEHARLL ETQREQEVRL AALDERTRIA REMHDILAHS LNVIVAQADG
GRYAAKAAPE RAVAALATVA QVGRESAAEL HQLLGVLRDG EERGAAPAPG VGDLPGLVEE
YRRAGLRIRL VQHGSPAAPR GGRADTGAPA TLPATASLTV YRVVQESLAN ALKHGGPAAA
RVELTWSPGR VGIDVANSVR EAAPAALTTP AGPSGRSEST GPSASGGPSA PPVFAGPSMP
TALSAPEGVP PTGRSGPSTP ADASGPSAPA GSCGTGRPSG TGRHSAAKAP SAVGAVPGGA
RRGPGHGLVG MRERVGLHGG TLEVGADDAT GTWRVRAVVP WEEA