Gene Namu_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_1043 
Symbol 
ID8446639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp1153111 
End bp1154571 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content75% 
IMG OID645040181 
Producttranscriptional regulator, GntR family with aminotransferase domain 
Protein accessionYP_003200440 
Protein GI258651284 
COG category[E] Amino acid transport and metabolism
[K] Transcription 
COG ID[COG1167] Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACCC ATCTGCTGTC GGCCGGCCGC CTGGCCCGGG ACCTGGCCGA CTGGCGCGAC 
GACGGGCAGC GGCCGCGGCC GGCCTTCCGG GCGTTGGCCG AACGGATCAG CGTGCTGGCC
CAGGACGGCC GGCTGCCCAC CGGCAGCGGG TTGCCGGGGG AGCGCGAGCT GGCCACCGCC
CTGCAGGTCA GCCGGACCAC GGTGACCGCC GCCTACGCGC TGCTGCGGGA ACGGGGCTAC
CTGGACTCCC GGCAGGGCGC GCGGAGCACC GTGATGCTGC CGGTCACGCA GGCGGCCGGC
TCCGCGGTCG GGTACCGGTT CGGGGTGATG AACGATCCGG ACGACGCCGT GATCGACCTG
TCCTATGCGG CACCGCCCGC GTTGCCGGCG GTCGGCGTGG CCTACCGGGA GGCGCTGGCG
TCGATGCCCG AGCAGCTGGC CGGGCACGGA TTGGGCGTGT TCGGCATCCG GCGGTTGCGC
CAGGCGGTCG CCGACCGCTA CACCGCCCGC GGAGTGCCCA CCCGTCCGGA CCAGATCCTG
ATCACCCACG GCGCGCAGCA GGCGATGTCG TTGGTCGTCG CGGTGCTCAC CGCCCCCGGC
GACCGGGTGC TGATCGAACA TCCGACCTAC CCGCACGTGC TCGAATCGAT CGCCGCGGCC
GGCGGGCGGG CGGCCCCGGT GCCGCTGCTC ACCGACGAGG GCGCCGCCGG GTGGGACCTG
GAGGGGTTGC GCGCGGCCGT CCGGCAGTTG GCTCCGAGCC TGGCCTACCT GGTGGTCGAC
CACCACAACC CGACCGGGTT GACGCTGGGC GAGGCCGGGC GGCGCGAGCT GGCCGGGATC
GCCCGGGCCG GCCGGATGAC CCTGGTGGTG GACGAGTCGA TGGCCGAGAT CGTGCTCGAC
GGCGAGCGGA TGCCGCCGAT GGCCGCATTC GGGCCGGCGA TCAGCATCGG CAGCGCCTCG
AAGCTGTTCT GGGGCGGCCT GCGGGTCGGC TGGGTGCGGG CGGACGAGGC GACGATCACC
CGGCTGGCCA CCGCCCGCGC GCCCCTGGAT CTGGGCGTGC CGCCGTTGGA GCAGCTGGCC
GTCGCGCTGC TACTGGAGCA GGCCGACCCG CTGATCGCCG AACGCACCGC ACAGCTCCGC
GGCCGGCGGG CGGCGTTGAC CGACGCGTTG CGCGCGCAGC TGCCGGACTG GCGCTGGCTG
CCCGGCGTCG GGGGGATGTC GCTGTGGGTG CAACTGCCGC GCCCGGTGTC CTCCCGGCTC
AGCGCGGTCG CGGTGGAGTT CGGGGTGGTG ACCACCGCCG GCCCGCGGTT CGGCATCAAC
GGCGCGTTCG AACAGTGGGC CCGGCTGCCC TACGTGCACG AACCGGACCG GCTGCGGGCC
GCCGTGGCCG GCCTGGCCGG CGCCTACCGG GCCGTCACCG CCGGCGCCGG CGTCCGGCCC
GAGCCCTCGG TGATGGTCTG A
 
Protein sequence
MSTHLLSAGR LARDLADWRD DGQRPRPAFR ALAERISVLA QDGRLPTGSG LPGERELATA 
LQVSRTTVTA AYALLRERGY LDSRQGARST VMLPVTQAAG SAVGYRFGVM NDPDDAVIDL
SYAAPPALPA VGVAYREALA SMPEQLAGHG LGVFGIRRLR QAVADRYTAR GVPTRPDQIL
ITHGAQQAMS LVVAVLTAPG DRVLIEHPTY PHVLESIAAA GGRAAPVPLL TDEGAAGWDL
EGLRAAVRQL APSLAYLVVD HHNPTGLTLG EAGRRELAGI ARAGRMTLVV DESMAEIVLD
GERMPPMAAF GPAISIGSAS KLFWGGLRVG WVRADEATIT RLATARAPLD LGVPPLEQLA
VALLLEQADP LIAERTAQLR GRRAALTDAL RAQLPDWRWL PGVGGMSLWV QLPRPVSSRL
SAVAVEFGVV TTAGPRFGIN GAFEQWARLP YVHEPDRLRA AVAGLAGAYR AVTAGAGVRP
EPSVMV