Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_5075 |
Symbol | |
ID | 9248964 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014211 |
Strand | - |
Start bp | 219870 |
End bp | 220922 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003682962 |
Protein GI | 297563989 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACCG GGAACACGGG TCGCAAGCGC CCCACGATCA GCGACGTCGC CCGCCGGGCG GGTGTGAGCA AGGGCGCCGT GTCCCGGGCG TTGAACACCG GTACCGGCAC CAGTGCCAGC ACCCGTGAAC GCATCCGCCA GGCGGCGGTC GAACTGGGCT GGAACCCGAG TTACGCCGCA CGCGCGCTCA ACGGCAAGTC GCTGGGCACC ATCGGACTGA TCGTGCGCCG CTCCCCCGAG ATCCTCGACT TCGACGCGTT CTTCCCCTCG TTCCTGTCCG GTATCGAGTC GGTGCTCTCC GGCGAGGAGC ACGCCACCGT CATCCGCTTC GTACCCGACG AGCGGACCGA GGCCGCCACC TACGAGCGGC TGTTCAACGA CCACTTCGTC GACGGCTTCC TGGTCACCGA CCTGCGGACG GACGACGGCC GTCCCGCGAT GCTGCGCCGC CTGGGCGCGC CCGCCGTGGT GGTGGGCGCG CCCGAGGGCT CCTCCGACTT CCCCACCGTC ACCAACGACT CCAGGGAGGC GATCCGCGGT CTGGTGCGCT GCTTCGCCGA GGCGGGGCAC CGGCGCATCG CCCACGTCCA GGGCGACCCC CACATGCTGC ACGCGCACCA GCGCCGGCGC CACTGGGAGG AGGCCGTGCG CGAGTTCGGC CTGGAGCCGG GGCCCGTGGA GGAGCACGGC GGCTACACCA TCGAGGGCGG GGCCCGGGCC ACGGAGCGCA TCCTCGCCAG GCCCGCCGCC GAACGCCCCA CCGCCGTCTT CTACGGCAGC GACCTCATGG CCATAGGCGG TTACTCCGTG CTGGGGGAGG CGGGTCTGAC CGTCCCCGAC GACATGGCGG TGGCCGGGTT CGACGACATC CCCCTGGCCT CGTTCGTCAC ACCGCCGCTG ACGACCGTCC GCAACAGGCA CCGCGCGCTG GGCAGTGTCG GGGCCCGCAT CCTGCTGGAC ATGCTCAAGG GGCAGGAGCC GCCGCTGTCG ACCGTGCTCG TCGGTGAGCT CCGCCCGAGG AAGTCGTCCG GACAGCCGAT CCAAGGCTTG TAA
|
Protein sequence | MATGNTGRKR PTISDVARRA GVSKGAVSRA LNTGTGTSAS TRERIRQAAV ELGWNPSYAA RALNGKSLGT IGLIVRRSPE ILDFDAFFPS FLSGIESVLS GEEHATVIRF VPDERTEAAT YERLFNDHFV DGFLVTDLRT DDGRPAMLRR LGAPAVVVGA PEGSSDFPTV TNDSREAIRG LVRCFAEAGH RRIAHVQGDP HMLHAHQRRR HWEEAVREFG LEPGPVEEHG GYTIEGGARA TERILARPAA ERPTAVFYGS DLMAIGGYSV LGEAGLTVPD DMAVAGFDDI PLASFVTPPL TTVRNRHRAL GSVGARILLD MLKGQEPPLS TVLVGELRPR KSSGQPIQGL
|
| |