Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2305 |
Symbol | |
ID | 9246155 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2754339 |
End bp | 2755388 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003680233 |
Protein GI | 297561259 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGTCCACCA GGTCCCCCAA CGCACGCCGA GGCGTGGGTA TCACCGAGGT CGCCCGGCGC GCCGGGGTCT CGCCCGGAAC CGTGTCCAAC GTGCTCAACC GCCCCGAGCG CGTCGCCGAG GCCACCCGCC TGCGGGTCGA GGCGGCCATC ACCGAGCTCG ACTACGTGCG CAACTCCTCC GGCAGCAGCC TGCGCTCGGG CCGCAGCGAG TCCGTGGGGC TGCTCGTGCT GGACGTGACC AACCCGTTCT TCACCGAGGT CGCCCGGGGC GTGGAGGACG AGGCGGCCGG ATCCGGGCTC GCCGTGGTGC TGCTCAACTC CGCCGAGAAG CGCGAGCGCC AGCAGCGCAA CGTGCGCCTG CTGGCCGAGC AGCGCGCGGC CGGGGCCGTG GTGATGCCGG TGGACGACGA CCTGTCGGAC CTTCTGTGGC TGAGCCGCCA GGGGACCTTC TGGGTGGCCC TGGACCGGGG CGACGTCGCC GAGGACGTGG GGTGCAGCGT CAGCGTGGAC AACCACGCGG GCGGGATGGC GGCCGGGAGG CACCTCATCG GGCTGGGGCA CGAGACGGTG ACCTTCCTGA CCGGGCCGTT CGCCATCGAA CAGGTCAGGC GCAGGCACGA GGGGCTGCGC GACGCCTTCA CCGAGGCCGG GCTGGACCCG GACTCCTGCG TGCGGGTGGT CGAGCAGCCG CTGCTCAACC CCGAGCAGGG CGAGCGGGCG GTGGACGCGA TCCTGGGCGG GGGGCCGCGG CAGCGGCCGC GGGCGGTGTT CTGCGCCAAC GACCAGCTCG CGCTCGGCGT GATGAAGGGC CTGGGCCAGC GGGGCCTGCG GGTGCCCGAG GACATGTCGG TGGTGGGCTA CGACAACGTG GACTTCGCCG ACCTGGTGCA CCCGGGGCTG ACCACGGTGG CCCAGCCCAA GTACGAACTG GGCCGGGCGG CGATGCGCCT GCTGGAGTCG GAGCTGAACC ACGGCGAGCA CGTCCACGAG CGGGTGCTGT TCACGCCGGA GCTGGTGGTG CGGGGTTCGA CGGCGGTGTA CCGGGACTGA
|
Protein sequence | MSTRSPNARR GVGITEVARR AGVSPGTVSN VLNRPERVAE ATRLRVEAAI TELDYVRNSS GSSLRSGRSE SVGLLVLDVT NPFFTEVARG VEDEAAGSGL AVVLLNSAEK RERQQRNVRL LAEQRAAGAV VMPVDDDLSD LLWLSRQGTF WVALDRGDVA EDVGCSVSVD NHAGGMAAGR HLIGLGHETV TFLTGPFAIE QVRRRHEGLR DAFTEAGLDP DSCVRVVEQP LLNPEQGERA VDAILGGGPR QRPRAVFCAN DQLALGVMKG LGQRGLRVPE DMSVVGYDNV DFADLVHPGL TTVAQPKYEL GRAAMRLLES ELNHGEHVHE RVLFTPELVV RGSTAVYRD
|
| |