Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1195 |
Symbol | |
ID | 9245046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 1451653 |
End bp | 1452672 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | |
Product | transcriptional regulator, LysR family |
Protein accession | YP_003679142 |
Protein GI | 297560168 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.279757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.378839 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACATCG CCCAGTTGCG GGACTTCATC GCGGTCATCG ACAGCGGGAG CTTCACCCGC GCCGCTTCGG TGCTGTTCGT CTCGCAGCCG GCGGTCAGCC AGCGGATGAA GCAGCTGGAG AGCGAACTGG GCGTGCGGCT CGTGCAGCGC GGCCCCCGCG GGGTGGTGCC CACACCGGCG GGGCGGACCC TGTACCGGGA CGCCCAGCAG CTCATCCGCC GGTTCGACCA GATCGCCGAG GACGTGGCCA AGGAGCCCCG GGCCATCCGC GGACCGGTGG CCGTCGGCCT GCCCACCGCG GCCGCCGTCC ACCTCGCCCC GGCGCTGTTC TCCTGGACGA AACGGCACTA CCCGGGGGTC CGCCTGCGGC TGTTCGAGTC GGTGAGCGGA TACATCCAGG AGCTGCTCAC GGTCGGGCGG ATGGACCTGG CCGTCCTCTA CCGCGACGAC GCGGCGCCCC GGCCGGCCGA GACGCCGCTG TACTCCGAGG AGCTGTACCT GGTCGGGCGC TCGGACGCCG AGGAGCCACC CCGGGGCGGC CGGGCCGCCG GGTCCGCCGG GACGGGCGCG GCCGCCGCCG ACACCGCGGC GTACGGGGAC ATCAGCCTGG CCGACATGCT CCGGGTGCCG CTGGTAGCCC CCGGGGCGCG CAGCAACCTG CGCGTGCTCA TCGACCGCGT CTTCACCGAA CACGGCGCGG CGCCCGTGAT CGCCGCCGAC GTGGAGTCCC TGGGCACGAT GGTGCGCATC GCCGAGAGCG GCGAGGCCTG CGCCCTGCTC CCGCTGTCCA GCGTCGAGGC GCTGCGCAGT ACCCCCGACC TCATGGTGCG GCGGGTCGTG GACCCCGTGA TCGAACGCCA CATCGCGGTG TGCGCCGGTT CGGACTACTA CGAGCCGCGG GACGCGGTGT CCGTCGTCCG GCACGGCATC GTGCAGGTGA CGACCCGGCT CGCCGAGCAG GGGGCCTGGC CGGGCATCCG CCCGGCGGCC CGGACCGAAC CGCGTGCCGG CCGACCCTGA
|
Protein sequence | MDIAQLRDFI AVIDSGSFTR AASVLFVSQP AVSQRMKQLE SELGVRLVQR GPRGVVPTPA GRTLYRDAQQ LIRRFDQIAE DVAKEPRAIR GPVAVGLPTA AAVHLAPALF SWTKRHYPGV RLRLFESVSG YIQELLTVGR MDLAVLYRDD AAPRPAETPL YSEELYLVGR SDAEEPPRGG RAAGSAGTGA AAADTAAYGD ISLADMLRVP LVAPGARSNL RVLIDRVFTE HGAAPVIAAD VESLGTMVRI AESGEACALL PLSSVEALRS TPDLMVRRVV DPVIERHIAV CAGSDYYEPR DAVSVVRHGI VQVTTRLAEQ GAWPGIRPAA RTEPRAGRP
|
| |