Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0046 |
Symbol | |
ID | 9243873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 56715 |
End bp | 58712 |
Gene Length | 1998 bp |
Protein Length | 665 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, CdaR |
Protein accession | YP_003678004 |
Protein GI | 297559030 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.541147 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0349089 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAGCGCG AGCTGAACAC CACCACGTCC GGTTCGACCG GTGTCCCCGC CGACGACGGC GCCGCGGGCG CCGACGGCCA GGCCCTGTTC CTGGACCTGC TGCTGCGCGA CGCCCCGGCC GTGGAGTACG AGCGCCCGGT GCTGCGCGCG CGGGCGCGCG GGGACGACCC GGAGGTGGTC GCCGCCCTGG AGGCCGCGAA GATGACGGCG CTGCGGGTGC GGTCGGTGAT GCGCGACCGG CGGCGGCGCG AGTCGGAACT GGGCGCGCTG TTCGAGACCG CCAACGACCT GGCGGGCATG CGCAGTCTGG ACCAGGTGCT CCAGGCGATC GTGGAGCGGG CCCGCAACCT GCTGGGCACC GACACCGCGT ACCTGACGCT GAGCGACCCG GAGGCGGGCG GCACGTACAT GCGGGTGACG TCGGGGTCGG TGTCGGCGGC GTTCCAGCGG TTGCGGCTGG CCCCGGGCAA GGGGTTGGGC GGGCTGGTGG CGACCACGGC CCTGCCGTAC GTGACGGCGA ACTACTTCGC CGATCCCCGG TTCACGCACG CGGAGAACAT CGACCACGCG GTGCGCGACG AGGGCCTGGT GGCCATCCTG GGCGTGCCGC TGAAGATGAA CGGACGCGAC GTGGGCGTGC TGTTCGCCGC CAACCGGCGC GAGCGCCCCT TCGCCCACTC GGAGGTGGCG CTGCTGTCGT CGCTGGCCGC GCACGCGGCG ATCGCGATCG ACAGCGCGAA CCTCATCGAC GACACCCGGC GTGCCCTGGA CGAGCTGCAC ACCGTCAACG AGCGGTTGCA GCGGCACACG GCGTCGGTGG AGCGGTCGGC GGCGGCGCAC GACCGGTTGA CCGACCTGGT GCTGCGGGGC GGCGGCGTGC GCGAGGTCGC GGCGGCGGTG GCCGAGGTGC TGGGCGGCAC GGTGCTGATC CACGACGCGA CCTCGGACTC CTCGGTGACC GCCAGCCCGG AGGGTGTGGT GCGGGAGGGC GTCCCGTGGG ACGCGGGCGA CGGGGACCTG GCCGAGGCGG TGCGCTCGTC GCTGGCCAGC GGCCGCGCGG TGCGGGCGGG CCGGGCGTGG GTGGCGACGG CGGCGGCCGG GACCGAGCCG CTGGGAACGC TGGTGCTGCG CGGGGTGGAG CTGGACTCCA CCGACCAGCG CGTCCTGGAG CGGTCGGCGA TGGTGACGGC GCTGCTGCTG CTCATCCGGC GTTCGGTGAG CGAGACCGAG CACAGGCTGC GCGGGGACCT GCTGGACGAG CTGTTGGAGG TGCCCGCGCG CGATCCGGTT TCGCTGCGCC AGCGGGCCGC GCTGCTGCAC GCGGACCTGG ACGCGCCGCA CGTGCTGGTG GTGGCCGAGG CCCCGGGCGG GGACCCGGGG CGGTTGCGCT CGGCGGCGAC CCACGTCGCG GAGACCACCG GGGGACTGGC CGGGAGCCGG TCGGGCCGCC TGGTGCTGGC GCTGCCCGGG AGGGACCCGT CGGCGGTGGG CCGACGGGTG GCCGACGAGC TGTCGGGTGC GGTGAACGGC CCCGTGACGG CGGGGCTGGC CGGTCCGACG GCCGGTCCGG CCTCGTTCGG GGACGCGTTC GCCGAGGCGG CCCGGTGCCT CCAGACGCTG CGTGCGCTGG GACGGGAGGG GGACGTGGCG ACCACGGGGG ACCTGGGGTT CTCGGGGCTG CTGCTGAGCC AGGACCGGGA CGTGCCGGGG TTCGTGTCGG CGACGCTCGG GCCGCTGCTG GAGTACGACG CCAGGCGGGG AACGCTGCTG GTGGAGACTC TGCGGGCGTA CTTCGCCGCC GGGGGCAACC TGTCGCGCGC CAAGGAGGAC CTGCACATCC ACGTGAACAC GGTGGCGCAG CGCCTGGAGC GGATCGGTCA GCTGATCGGG GCGGACTGGC AGCGTCCCGG CCGGGCGCTG GAGCTCCAGC TGGCCCTGCA CCTGCACGGC CTGCTGGACC GGGGCGTGGA CCTGCTGGGC GACCAGAACG GCGGTTGA
|
Protein sequence | MERELNTTTS GSTGVPADDG AAGADGQALF LDLLLRDAPA VEYERPVLRA RARGDDPEVV AALEAAKMTA LRVRSVMRDR RRRESELGAL FETANDLAGM RSLDQVLQAI VERARNLLGT DTAYLTLSDP EAGGTYMRVT SGSVSAAFQR LRLAPGKGLG GLVATTALPY VTANYFADPR FTHAENIDHA VRDEGLVAIL GVPLKMNGRD VGVLFAANRR ERPFAHSEVA LLSSLAAHAA IAIDSANLID DTRRALDELH TVNERLQRHT ASVERSAAAH DRLTDLVLRG GGVREVAAAV AEVLGGTVLI HDATSDSSVT ASPEGVVREG VPWDAGDGDL AEAVRSSLAS GRAVRAGRAW VATAAAGTEP LGTLVLRGVE LDSTDQRVLE RSAMVTALLL LIRRSVSETE HRLRGDLLDE LLEVPARDPV SLRQRAALLH ADLDAPHVLV VAEAPGGDPG RLRSAATHVA ETTGGLAGSR SGRLVLALPG RDPSAVGRRV ADELSGAVNG PVTAGLAGPT AGPASFGDAF AEAARCLQTL RALGREGDVA TTGDLGFSGL LLSQDRDVPG FVSATLGPLL EYDARRGTLL VETLRAYFAA GGNLSRAKED LHIHVNTVAQ RLERIGQLIG ADWQRPGRAL ELQLALHLHG LLDRGVDLLG DQNGG
|
| |