Gene Ndas_0046 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0046 
Symbol 
ID9243873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp56715 
End bp58712 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content75% 
IMG OID 
Producttranscriptional regulator, CdaR 
Protein accessionYP_003678004 
Protein GI297559030 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.541147 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0349089 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGCGCG AGCTGAACAC CACCACGTCC GGTTCGACCG GTGTCCCCGC CGACGACGGC 
GCCGCGGGCG CCGACGGCCA GGCCCTGTTC CTGGACCTGC TGCTGCGCGA CGCCCCGGCC
GTGGAGTACG AGCGCCCGGT GCTGCGCGCG CGGGCGCGCG GGGACGACCC GGAGGTGGTC
GCCGCCCTGG AGGCCGCGAA GATGACGGCG CTGCGGGTGC GGTCGGTGAT GCGCGACCGG
CGGCGGCGCG AGTCGGAACT GGGCGCGCTG TTCGAGACCG CCAACGACCT GGCGGGCATG
CGCAGTCTGG ACCAGGTGCT CCAGGCGATC GTGGAGCGGG CCCGCAACCT GCTGGGCACC
GACACCGCGT ACCTGACGCT GAGCGACCCG GAGGCGGGCG GCACGTACAT GCGGGTGACG
TCGGGGTCGG TGTCGGCGGC GTTCCAGCGG TTGCGGCTGG CCCCGGGCAA GGGGTTGGGC
GGGCTGGTGG CGACCACGGC CCTGCCGTAC GTGACGGCGA ACTACTTCGC CGATCCCCGG
TTCACGCACG CGGAGAACAT CGACCACGCG GTGCGCGACG AGGGCCTGGT GGCCATCCTG
GGCGTGCCGC TGAAGATGAA CGGACGCGAC GTGGGCGTGC TGTTCGCCGC CAACCGGCGC
GAGCGCCCCT TCGCCCACTC GGAGGTGGCG CTGCTGTCGT CGCTGGCCGC GCACGCGGCG
ATCGCGATCG ACAGCGCGAA CCTCATCGAC GACACCCGGC GTGCCCTGGA CGAGCTGCAC
ACCGTCAACG AGCGGTTGCA GCGGCACACG GCGTCGGTGG AGCGGTCGGC GGCGGCGCAC
GACCGGTTGA CCGACCTGGT GCTGCGGGGC GGCGGCGTGC GCGAGGTCGC GGCGGCGGTG
GCCGAGGTGC TGGGCGGCAC GGTGCTGATC CACGACGCGA CCTCGGACTC CTCGGTGACC
GCCAGCCCGG AGGGTGTGGT GCGGGAGGGC GTCCCGTGGG ACGCGGGCGA CGGGGACCTG
GCCGAGGCGG TGCGCTCGTC GCTGGCCAGC GGCCGCGCGG TGCGGGCGGG CCGGGCGTGG
GTGGCGACGG CGGCGGCCGG GACCGAGCCG CTGGGAACGC TGGTGCTGCG CGGGGTGGAG
CTGGACTCCA CCGACCAGCG CGTCCTGGAG CGGTCGGCGA TGGTGACGGC GCTGCTGCTG
CTCATCCGGC GTTCGGTGAG CGAGACCGAG CACAGGCTGC GCGGGGACCT GCTGGACGAG
CTGTTGGAGG TGCCCGCGCG CGATCCGGTT TCGCTGCGCC AGCGGGCCGC GCTGCTGCAC
GCGGACCTGG ACGCGCCGCA CGTGCTGGTG GTGGCCGAGG CCCCGGGCGG GGACCCGGGG
CGGTTGCGCT CGGCGGCGAC CCACGTCGCG GAGACCACCG GGGGACTGGC CGGGAGCCGG
TCGGGCCGCC TGGTGCTGGC GCTGCCCGGG AGGGACCCGT CGGCGGTGGG CCGACGGGTG
GCCGACGAGC TGTCGGGTGC GGTGAACGGC CCCGTGACGG CGGGGCTGGC CGGTCCGACG
GCCGGTCCGG CCTCGTTCGG GGACGCGTTC GCCGAGGCGG CCCGGTGCCT CCAGACGCTG
CGTGCGCTGG GACGGGAGGG GGACGTGGCG ACCACGGGGG ACCTGGGGTT CTCGGGGCTG
CTGCTGAGCC AGGACCGGGA CGTGCCGGGG TTCGTGTCGG CGACGCTCGG GCCGCTGCTG
GAGTACGACG CCAGGCGGGG AACGCTGCTG GTGGAGACTC TGCGGGCGTA CTTCGCCGCC
GGGGGCAACC TGTCGCGCGC CAAGGAGGAC CTGCACATCC ACGTGAACAC GGTGGCGCAG
CGCCTGGAGC GGATCGGTCA GCTGATCGGG GCGGACTGGC AGCGTCCCGG CCGGGCGCTG
GAGCTCCAGC TGGCCCTGCA CCTGCACGGC CTGCTGGACC GGGGCGTGGA CCTGCTGGGC
GACCAGAACG GCGGTTGA
 
Protein sequence
MERELNTTTS GSTGVPADDG AAGADGQALF LDLLLRDAPA VEYERPVLRA RARGDDPEVV 
AALEAAKMTA LRVRSVMRDR RRRESELGAL FETANDLAGM RSLDQVLQAI VERARNLLGT
DTAYLTLSDP EAGGTYMRVT SGSVSAAFQR LRLAPGKGLG GLVATTALPY VTANYFADPR
FTHAENIDHA VRDEGLVAIL GVPLKMNGRD VGVLFAANRR ERPFAHSEVA LLSSLAAHAA
IAIDSANLID DTRRALDELH TVNERLQRHT ASVERSAAAH DRLTDLVLRG GGVREVAAAV
AEVLGGTVLI HDATSDSSVT ASPEGVVREG VPWDAGDGDL AEAVRSSLAS GRAVRAGRAW
VATAAAGTEP LGTLVLRGVE LDSTDQRVLE RSAMVTALLL LIRRSVSETE HRLRGDLLDE
LLEVPARDPV SLRQRAALLH ADLDAPHVLV VAEAPGGDPG RLRSAATHVA ETTGGLAGSR
SGRLVLALPG RDPSAVGRRV ADELSGAVNG PVTAGLAGPT AGPASFGDAF AEAARCLQTL
RALGREGDVA TTGDLGFSGL LLSQDRDVPG FVSATLGPLL EYDARRGTLL VETLRAYFAA
GGNLSRAKED LHIHVNTVAQ RLERIGQLIG ADWQRPGRAL ELQLALHLHG LLDRGVDLLG
DQNGG