Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3536 |
Symbol | |
ID | 9247405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4246577 |
End bp | 4247533 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | transcriptional regulator, LysR family |
Protein accession | YP_003681443 |
Protein GI | 297562469 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.246229 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.724782 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCTATA GGAATGTCAA TGGTTCGGGC ACGGCGGCCG ATCCGCGGCC CGGCGGGCCG CTCGATCTGC AGCAGTTGAG GACCTTCCTG GCGGTACACC GGTCGGGGTC CTTCACCGCC GCCGCGCAGT TGCTGCGGCT GTCGCAGCCC ACGGTCACCA CCCAGATCCG TTCCCTGGAG CGGCGGCTCG GACACCCCCT CTTCGAGAGG CTGCCCCGGG GGGTGGCCCC GACCGCCGAG GCCAACGACC TGGCCGCGCG CGTGGCCGGG CCCCTGGACG CGCTGGAGGT GGTCGCCGGG CGCGACGGCG GCGGCGCGTC CGCGTCGCCC GCGCCGCTGC GCGTGGCGGG CCCCGCCGAG GCGCTGCACG CGCTGGTGCT GCCCGCGCTC GCGCCCCTGG TGGAGGGCGG CCTGCGGCTG CGCGTGCGCA CCGGTCCGGC CGAGGACCTG CTGGAGGAGC TGCGCCGGGA CCGGCACGAC CTGGTGGTGT CCGCGGTGCG GCCGCGGGGG CGGACCCTGG TCGCCGAACC CCTCGCCGAC GAGGAGTTCG TGCTGGTGGC GTCGCCGGCG TGGGCGGACC GGATCGGCCG CGACCGGCTC CTGCGCCAGG GCGCCTCCGC GCTGTGCGCG GTCCCGCTGG TGGCCCACGA CGACGAGTTG TCGGTCCTGC GCCGCTACTG GCGGCAGGTG TTCGGGGTGC GGCTGACGTG TGCGCCCGCG GTCACCGCGC CCGACCTGCG CGGGGTGGTG TCGGCCGTCA CGGCGGGGGC GGGGGTGACG GTGCTGCCGC GCCACCTGTG CCGGGAGACG CTGGCCGAGG GGCGCCTGGT GGCGCTCCTG GCCCCGGAGG AGCCGCCGGT CACCACGTTG TACCTGGTAC GGCGCCCGGG GGTGGCGGAC AACCCGTGCC TGGAGGAGGT GCACCGGCGC CTGGTGGCGG CCGCCCGCGC CTGTTGA
|
Protein sequence | MSYRNVNGSG TAADPRPGGP LDLQQLRTFL AVHRSGSFTA AAQLLRLSQP TVTTQIRSLE RRLGHPLFER LPRGVAPTAE ANDLAARVAG PLDALEVVAG RDGGGASASP APLRVAGPAE ALHALVLPAL APLVEGGLRL RVRTGPAEDL LEELRRDRHD LVVSAVRPRG RTLVAEPLAD EEFVLVASPA WADRIGRDRL LRQGASALCA VPLVAHDDEL SVLRRYWRQV FGVRLTCAPA VTAPDLRGVV SAVTAGAGVT VLPRHLCRET LAEGRLVALL APEEPPVTTL YLVRRPGVAD NPCLEEVHRR LVAAARAC
|
| |