Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_3567 |
Symbol | |
ID | 9247436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 4277991 |
End bp | 4279160 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | |
Product | transcriptional regulator, MerR family |
Protein accession | YP_003681474 |
Protein GI | 297562500 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.728715 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTCTCCC AGCTAGAGTT CTACGCGTGT GGTGCGCGAC ACAAGACGCA CCGCTTTCGC AACGCTTGGG GCGCCATGGC CGACGCACTG ACCCCGGGGG CGACCGCGCG TCTGCTCGGG GTCTCGCCCT CCACGCTGCG CAGCTGGGAC CGGCGCTACG GCGTGGGGCC GCGCGAGCGC AGTCCGGGCG GGCACCGCCG CTACTCGCCC GCCGACGTGG CGCGTCTGCG CGAGCTGTGC CGCCTGGTCG GTGAGGGCCT GTCGCCCGCC TCGGCGGCGG AGTGCGTGCT GGTCCCGGCT CGCGGGGGTC CGGCGCCCGA TCCAGCCCCG CCGCGCGTTC CCCGGCCGCG GGGCGCGGGG GTGCCACGGG TGGGCGGAGG CGCCGAGGAG GAGGAACCGG GGACCGGGCA GGAGGGTCCG CGGGACTTCC GGAAGGGCTC GGGAGCCAGC GGGGAGGACG CGGGCAGCGC CCAACCGGGC GCGAAAGCTC CCGGGAGGGG TGCGCGGCCC CGCTGGCGCC CCGGCGGGGA CACCCTGCCG CTGGTTCCGG CCGGGCCCAC CCTCCAGGGG ATCGCCCGCG CCGCCATGCG CATGGACGCC GAACTCGTGG AGCGCCTGCT GGAGGAGGCC CTGGACGAGT ACGGCGTGGT GGCGGCCTGG GAGGACCTGG CGATGCCGCT GCTGTACGGG ATGGGCCGCA AGTGGGAGGA CACCCGGCGC TACGTGGAGG TGGAGCACCT TCTGTCGTGG TGCGTGTCCT CGGCGCTGCG CCGCGTGGCC GCCCCCGGGG ACGCGGACCC GGCGGGCCGC CCCACGGTCC TGGCCTGCGG CCCCGGCCAG ATGCACAGTC TCCCGATGGA GGCGCTGGCC GCCGCGCTGC GCGAACGGGG CGTGCCGCGC CGGGTGCTGG GGCCGTGCAC GCCCGTGGTG GCGACGGTGC GGGCGGTGCG CCGCACGGGT CCGCGCGCGG TGGTCCTGTG GTCCCACGCC GGAGACGCCG ACGACGTCGC GGCGCTGCGG GCGGCGGTGC GCGCGGCGGC GGGGTCGGCG CAGGCCACGG CCGTGTACAC GGCCGGGCCG GGCTGGCGGT CGCTGGGCGC GGCGCCGGGG CTGGCCGCCG GGCACCTGGG CTCGCTCACC GACGCCGTGC GGGCCCTGGC CCCCGGCTGA
|
Protein sequence | MFSQLEFYAC GARHKTHRFR NAWGAMADAL TPGATARLLG VSPSTLRSWD RRYGVGPRER SPGGHRRYSP ADVARLRELC RLVGEGLSPA SAAECVLVPA RGGPAPDPAP PRVPRPRGAG VPRVGGGAEE EEPGTGQEGP RDFRKGSGAS GEDAGSAQPG AKAPGRGARP RWRPGGDTLP LVPAGPTLQG IARAAMRMDA ELVERLLEEA LDEYGVVAAW EDLAMPLLYG MGRKWEDTRR YVEVEHLLSW CVSSALRRVA APGDADPAGR PTVLACGPGQ MHSLPMEALA AALRERGVPR RVLGPCTPVV ATVRAVRRTG PRAVVLWSHA GDADDVAALR AAVRAAAGSA QATAVYTAGP GWRSLGAAPG LAAGHLGSLT DAVRALAPG
|
| |