Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1161 |
Symbol | |
ID | 9245011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1416091 |
End bp | 1417134 |
Gene Length | 1044 bp |
Protein Length | 347 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003679108 |
Protein GI | 297560134 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.193058 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGCCG TCCGCGGGTC CACCGGCCGC CCCACGATCT CCGACGTCGC CGCGCTGGCG GGGGTGTCGG TCTCGGCCGT GTCGAAGGTG GTCAACAACA AGGGCCGCAT CGCCGAGCCC ACCCGCCGGC GGGTCCTCGA CGCCGTGGAG AAGCTCCAGT GGTCGCCCTC GGCCGCGGCC GTGGCCCTGC GCGGGGCCCG CACCCGGGCC ATCGGCATGG TCTCGCGCCG CTCCCCCGAC CTGCTGTCCT ACGACCCGCA CTTCGGCGTG CTGATCTCGG GCATCGAGCG CGAACTCGCC CCCCTGGACT ACGGTCTGCT GCTGCACATC GTGGGCGAGG AGGAGGACGC CGAACGGCGC GCCTACCGGC GTCTGGCCGA GGAGCGCCGG GTGGACGGAG TCATCCTCAC CGAGAGCCTG GTCCGCGACC CCCGGTTCGA CCTGCTGCGG AGGCTCGGGC TCCCCTCGGT GCTGATCGGC ATGCCCTGGC GCGACGACCC GGTCGCCTCG GTGCTCGCCC CCGACCAGGA GCGGGGGCTG GTCGAGGCGG TCCGGCACCT GGCCTCGCTG GGGCACCGGC GGGTGGCCTA CGTGTGCGGC CCCGAGGACC GCGTGCACAC CGGGTTCCGC CGCCGGATCG TGGAGGAGCA GGTGCGGCGG CACGGGCTGT CCGCCGTCGT GCTGACGGTC CCCGACTTCA CCACCGAGGG GGCGGCCGCG GCCACCGACG AGGCGCTGTC GGCGGCCGAG CGGCCCACCG CGATCCTGTT CGCCAACGAC ATGATGGCGA TCGCCGGGAT CAGCGCCGCG CGCAGGCGGG GGCTGGAGGT GCCCCGGGAC CTGTCCGTCG TCGGCCACGA CGGCCTGCCC CTGGGCGCGC TCGTACAGCC TCGGCTGACC ACGGTCGGCC ACGACCTGGT GGGGCTCGGG CGGGCCGCCG CGCTGACGCT CGTGGCCGCG CTGGACGGGG TGCCGGCCGA CGTGCCCGCG ATCGCCCCTC CCGGGCTGGT CGTGCGCGAG TCCACGGCGC CGCCCGGAGC CTGA
|
Protein sequence | MTAVRGSTGR PTISDVAALA GVSVSAVSKV VNNKGRIAEP TRRRVLDAVE KLQWSPSAAA VALRGARTRA IGMVSRRSPD LLSYDPHFGV LISGIERELA PLDYGLLLHI VGEEEDAERR AYRRLAEERR VDGVILTESL VRDPRFDLLR RLGLPSVLIG MPWRDDPVAS VLAPDQERGL VEAVRHLASL GHRRVAYVCG PEDRVHTGFR RRIVEEQVRR HGLSAVVLTV PDFTTEGAAA ATDEALSAAE RPTAILFAND MMAIAGISAA RRRGLEVPRD LSVVGHDGLP LGALVQPRLT TVGHDLVGLG RAAALTLVAA LDGVPADVPA IAPPGLVVRE STAPPGA
|
| |