Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2472 |
Symbol | |
ID | 9246322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 2934047 |
End bp | 2935066 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003680398 |
Protein GI | 297561424 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.63522 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGAGA ACGGGACGGG CGCCCATCGG CACCCGACGA TGGCCGACGT GGCCGCGCAC CTGGGGGTCT CGCGGCAGCT GGTCTCGCTT GTGCTGGGCG ACCGCCCGGG CCCGAGCGCG CGGACCCGGG AGCGGGTCCT GCGCGCCGCC GAGGAGCTGG GGTACCGGGC CGACACCGCC GCACGGCTGC TGCGCCGGGC GCGCAGCCGA CAGGTGGGCG TGCTCTTCGG CCTGGAGTAC CCGATGGACG CCCACCTGGT CGAGGAGCTC TACCCCGCCG CCGCCGAACT CGGGTACGGC GTGGTGCTGA GCGCGATGGC CAGCACCCGC AGCGAACGCG AGGCGATCGA CGAACTGGTG GGGCTGCGCT GCGAGGCGCT GATCCTGATC GGCCTGTCCG CGGAGGCGCC CGCGGACCTG GCGCGGGTCG CCGAACGGGT CCCCGTGGTG GAGATCGGCC AGCGCACCGG CGCGGAGGGG ACCGACAGCG TGGGCACGGA CGACGCCGCG GGCGTGGGCC AGGCCGTCGG GCACCTGGTG GAACTGGGCC ACCGCGACAT CGTGCACGTC GACGGCGGCG GGCTCCCGGG GGCCGGCGGG CGCGCGCGGG GCTACGCCGA GGCGATGCGC GGGTACGGCC TGGGCGGGCG CGTCGAGGTC CTGCCCGGCG ACTACTCCGA GGAGGCGGGC GCGCGGGCGG CGCGGGTGCT CCTGGCGCGG GAGGCCCTGC CCACGGCCGT GGTCGCGGCC AACGACCTGT GCGCGTTCGG GCTGCTGGCG ACACTGGTCC GCGCCGGGGT GAGCGTGCCC GGGGACGTCT CGGTGGTGGG TTACGACGAC AGCCGGACCG CGCGGCTCTC CTTCCTCCAG CTCACGTCGG TGCGCCAGGA CGCGGCGCGC ATGGCGCGGC TCGCCGTCCG CTCCGCGTCC GAGCGCCTGG ACGGCGGGCG CACCGAGAGC AGGCATCTGC TGCTGGAGCC CGCCCTGACG GTGCGCGGCA GCACCGCTCC CCCGCGCTGA
|
Protein sequence | MSENGTGAHR HPTMADVAAH LGVSRQLVSL VLGDRPGPSA RTRERVLRAA EELGYRADTA ARLLRRARSR QVGVLFGLEY PMDAHLVEEL YPAAAELGYG VVLSAMASTR SEREAIDELV GLRCEALILI GLSAEAPADL ARVAERVPVV EIGQRTGAEG TDSVGTDDAA GVGQAVGHLV ELGHRDIVHV DGGGLPGAGG RARGYAEAMR GYGLGGRVEV LPGDYSEEAG ARAARVLLAR EALPTAVVAA NDLCAFGLLA TLVRAGVSVP GDVSVVGYDD SRTARLSFLQ LTSVRQDAAR MARLAVRSAS ERLDGGRTES RHLLLEPALT VRGSTAPPR
|
| |