Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_1043 |
Symbol | |
ID | 8446639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 1153111 |
End bp | 1154571 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 645040181 |
Product | transcriptional regulator, GntR family with aminotransferase domain |
Protein accession | YP_003200440 |
Protein GI | 258651284 |
COG category | [E] Amino acid transport and metabolism [K] Transcription |
COG ID | [COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 40 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACCC ATCTGCTGTC GGCCGGCCGC CTGGCCCGGG ACCTGGCCGA CTGGCGCGAC GACGGGCAGC GGCCGCGGCC GGCCTTCCGG GCGTTGGCCG AACGGATCAG CGTGCTGGCC CAGGACGGCC GGCTGCCCAC CGGCAGCGGG TTGCCGGGGG AGCGCGAGCT GGCCACCGCC CTGCAGGTCA GCCGGACCAC GGTGACCGCC GCCTACGCGC TGCTGCGGGA ACGGGGCTAC CTGGACTCCC GGCAGGGCGC GCGGAGCACC GTGATGCTGC CGGTCACGCA GGCGGCCGGC TCCGCGGTCG GGTACCGGTT CGGGGTGATG AACGATCCGG ACGACGCCGT GATCGACCTG TCCTATGCGG CACCGCCCGC GTTGCCGGCG GTCGGCGTGG CCTACCGGGA GGCGCTGGCG TCGATGCCCG AGCAGCTGGC CGGGCACGGA TTGGGCGTGT TCGGCATCCG GCGGTTGCGC CAGGCGGTCG CCGACCGCTA CACCGCCCGC GGAGTGCCCA CCCGTCCGGA CCAGATCCTG ATCACCCACG GCGCGCAGCA GGCGATGTCG TTGGTCGTCG CGGTGCTCAC CGCCCCCGGC GACCGGGTGC TGATCGAACA TCCGACCTAC CCGCACGTGC TCGAATCGAT CGCCGCGGCC GGCGGGCGGG CGGCCCCGGT GCCGCTGCTC ACCGACGAGG GCGCCGCCGG GTGGGACCTG GAGGGGTTGC GCGCGGCCGT CCGGCAGTTG GCTCCGAGCC TGGCCTACCT GGTGGTCGAC CACCACAACC CGACCGGGTT GACGCTGGGC GAGGCCGGGC GGCGCGAGCT GGCCGGGATC GCCCGGGCCG GCCGGATGAC CCTGGTGGTG GACGAGTCGA TGGCCGAGAT CGTGCTCGAC GGCGAGCGGA TGCCGCCGAT GGCCGCATTC GGGCCGGCGA TCAGCATCGG CAGCGCCTCG AAGCTGTTCT GGGGCGGCCT GCGGGTCGGC TGGGTGCGGG CGGACGAGGC GACGATCACC CGGCTGGCCA CCGCCCGCGC GCCCCTGGAT CTGGGCGTGC CGCCGTTGGA GCAGCTGGCC GTCGCGCTGC TACTGGAGCA GGCCGACCCG CTGATCGCCG AACGCACCGC ACAGCTCCGC GGCCGGCGGG CGGCGTTGAC CGACGCGTTG CGCGCGCAGC TGCCGGACTG GCGCTGGCTG CCCGGCGTCG GGGGGATGTC GCTGTGGGTG CAACTGCCGC GCCCGGTGTC CTCCCGGCTC AGCGCGGTCG CGGTGGAGTT CGGGGTGGTG ACCACCGCCG GCCCGCGGTT CGGCATCAAC GGCGCGTTCG AACAGTGGGC CCGGCTGCCC TACGTGCACG AACCGGACCG GCTGCGGGCC GCCGTGGCCG GCCTGGCCGG CGCCTACCGG GCCGTCACCG CCGGCGCCGG CGTCCGGCCC GAGCCCTCGG TGATGGTCTG A
|
Protein sequence | MSTHLLSAGR LARDLADWRD DGQRPRPAFR ALAERISVLA QDGRLPTGSG LPGERELATA LQVSRTTVTA AYALLRERGY LDSRQGARST VMLPVTQAAG SAVGYRFGVM NDPDDAVIDL SYAAPPALPA VGVAYREALA SMPEQLAGHG LGVFGIRRLR QAVADRYTAR GVPTRPDQIL ITHGAQQAMS LVVAVLTAPG DRVLIEHPTY PHVLESIAAA GGRAAPVPLL TDEGAAGWDL EGLRAAVRQL APSLAYLVVD HHNPTGLTLG EAGRRELAGI ARAGRMTLVV DESMAEIVLD GERMPPMAAF GPAISIGSAS KLFWGGLRVG WVRADEATIT RLATARAPLD LGVPPLEQLA VALLLEQADP LIAERTAQLR GRRAALTDAL RAQLPDWRWL PGVGGMSLWV QLPRPVSSRL SAVAVEFGVV TTAGPRFGIN GAFEQWARLP YVHEPDRLRA AVAGLAGAYR AVTAGAGVRP EPSVMV
|
| |