Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3668 |
Symbol | |
ID | 9247537 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 4403037 |
End bp | 4404641 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, CdaR |
Protein accession | YP_003681572 |
Protein GI | 297562598 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.36524 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTTCCCA CCCTCGCCGA GGTCCTCCAG CTGCCCGCCC TCCAGCGCGC CCGCCCCCGG GTCGTGGCGC GCCCGGACCG GCTCGACGTG GCCGTGCGCT GGGTGCACAT CGCCGAGGTC ACCGACCTCG CCCACCTGCT CCGCGGCGGT GAGCTGGTCC TCAGCACCGG CATCGCCATG CCCGACGATC CCGAGGCGCT GCGCCGCTAC ATCGACGACC TCGCCGCGGC CGGGGTCAGC GGCATCGCGG TCGAACTGGG TCGCAAGTAC CACGCCGAGC TCCCGCCCGC CTTCCTCGAC GCCGCGCGCG AGGCGGGCGT GCCGGTGGTC CTGCTCGACC GCGACGCCCG CTTCGTGGAG ATCACCGAGG CCGTGCACGT CCGGGTGGTC AACGAACAGC TGGAGGAGCT GCGCGAGTCG GAGAAGCTGC ACTCGGTCTT CACCCAGCTC TCCGTGGAGG GCGCCGACAC CGGGCGCATC CTGCACGAGG TCGCCCAGCT CTCCGGCTGC CCCGTGGTCC TGGAGAACCT CACCCACCAG GTGCTGGCCT GCGACCTCAA CGGCGAGGAG CCCGAAACCC TGCTCACCTC CTGGGAGAGC CGCTCGCGCG CGGCGCGCTC GGGCTCGCGC ACCGTCCACA CGCCCGCCAA CGGCTGGCTG GTCACCACGG TCGGGGCGCG CGGCCAGGAC TGGGGGCGTC TGATCCTCGT CGTCGGGGCC AACCCCACCC CGCGCCAGAG CATGCTCATC GAACGCGCCG CCACCACCCT CGCGCTGGTC AGGCTGCTGG AGCGCCACCA GGAGAGCCTG GAGCGGCAGA CGCACCGCAC GATCATCGCG GGCATCATCG ACCGGGCCTA CTCCGACCCG GAGGAGGCGC TGGTTCGGGC CCGCGCCGTG GGCGTGCCGC TCAACGGCCG CGAACTGGTG GGGCTCGTGC TGCGCCTGCG TGAGAGCGGG ACCGGGCTGG CCGCCCAGGC GCGCCTGTCC GACACCGCCG AGGGCGCCGC GCGCGCCTGC CGCGAGCTGC GCCTGCCCGC CCTGGTGGGC TCGCTCGACG ACCTGCGCGT GGGCATCCTG CTCGCGCTGC CGGGCCGCCA GCGCCTGGAC CCGACCCTGC ACCTGCTCGC CGACCGCATC CGCACCGCCG TGGGCGGCGG CGCCGTGCTG GCCGTGGGCT CCTCCGCGGA CGGAGTCCGG GAGGTGCGGC GCTCCTTCCT GGAGGCGCGC CAGGTCGGCG ACGTGGCCAT CCGCCAGAGC GACCAGCGCC CCTTCTACCG GCTGCCCGAC CTGCACCTGC GCGGCCTGCT CCACCTGTTC CGCGACGACG AGCGGCTCCA GACCTACGTC GAGCGCGAGC TGGGGCCGCT GCTGGACCAC GACGCGCGGC ACGGCGGCGA CCTGACCGAC ATGCTCCGCC ACTACCTGGG CGCGGGCCGC AACAAGGCGC TCGCCGCCGG GCGGGCGCAC CTGTCCCGCC CGGCGTTCTA CGACCGGCTG CGGCGGGTCG CCCACGTCCT TGACGCGGAC CTGGACTCGG TGGAGACCTG CCTGTCCCTG CACGTGGCGC TGCTGTCGCT GGACTCGGTC CGCGACGAGC GCTGA
|
Protein sequence | MLPTLAEVLQ LPALQRARPR VVARPDRLDV AVRWVHIAEV TDLAHLLRGG ELVLSTGIAM PDDPEALRRY IDDLAAAGVS GIAVELGRKY HAELPPAFLD AAREAGVPVV LLDRDARFVE ITEAVHVRVV NEQLEELRES EKLHSVFTQL SVEGADTGRI LHEVAQLSGC PVVLENLTHQ VLACDLNGEE PETLLTSWES RSRAARSGSR TVHTPANGWL VTTVGARGQD WGRLILVVGA NPTPRQSMLI ERAATTLALV RLLERHQESL ERQTHRTIIA GIIDRAYSDP EEALVRARAV GVPLNGRELV GLVLRLRESG TGLAAQARLS DTAEGAARAC RELRLPALVG SLDDLRVGIL LALPGRQRLD PTLHLLADRI RTAVGGGAVL AVGSSADGVR EVRRSFLEAR QVGDVAIRQS DQRPFYRLPD LHLRGLLHLF RDDERLQTYV ERELGPLLDH DARHGGDLTD MLRHYLGAGR NKALAAGRAH LSRPAFYDRL RRVAHVLDAD LDSVETCLSL HVALLSLDSV RDER
|
| |