Gene Namu_5023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNamu_5023 
Symbol 
ID8450654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNakamurella multipartita DSM 44233 
KingdomBacteria 
Replicon accessionNC_013235 
Strand
Start bp5600924 
End bp5603872 
Gene Length2949 bp 
Protein Length982 aa 
Translation table11 
GC content75% 
IMG OID645044060 
Producttranscriptional regulator, LuxR family 
Protein accessionYP_003204284 
Protein GI258655128 
COG category[R] General function prediction only 
COG ID[COG3899] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGGACC CGGCTGCCAC TATTGACCTC GTGGCACTGA CGGTCGACCT CCCGTTCGTC 
GGTCGCGCCG ACCAGGTGGA CCGGTTCCGC GCCGCGCTGG GCCGAGCCCG GGAGGGCGAG
CCGAGCGTCC TGCTGGTCTC CGGTGACGCC GGCGTCGGCA AGACGCGGGC GCTGACCCGG
ATGGCCGAGA TCGCCCGCGC CGAGGGGGCG TGGGTCGTCG TCAGCCACTG CGTGGATCTC
GGCGAGGTCG GGCTGCCCTA CCTGCCGTTC ACCGAGACCC TGCAGCAGTT GCGCGGGCGC
TGCGACGAGG TGGACCAGGC CATCGCCGCC CGGCCCGCCC TGGGCCGGCT GCTCGACGTC
GGGCTGGCCG AGCCGGCCGC CGGCGGGGCC GACCAGGCCG CCCGCGGGCA ACTGTTCGAC
GGCATCGCGT CGGCGATCGG GGCCGGCGGC CGGCCGGACC GGCCACTGGT GTTGATCATC
GAGGACCTGC ATTGGGCCGA CCCGTCCAGC CGGGACGTGC TGCGGTTCCT CATCGCTCGG
TTGCGCACCG AACCCCTGGT GGTGGTGGCC TCCTACCGCA CCGACGACCT GCACCGGCGC
CATCCGCTGC GCCCCACCCT GTCCGAACTG CTGCGGCATC CGCGGGTCGA TCATGTCGAG
CTGTCCCCGT TCACCCGGGA CGAGCTGGCC CAGTTCGGTG CCGCGATCAC CGGCCATCCG
CTGCCCGACC AGGTGCTGCA GCGGGTGCTG CGCCGCTCCG AGGGCAACGC CTATTTCGCG
CAGGAGCTGC TGGAGTCCGG ACCCGACACC GCCGCCCTGC CCGGCTCGTT GGCCGACGTC
CTGCACGCCC GATTGGAACG GCTGGACCCG GCGGTGCAGG CGCTGGCCGG CATCGCCTCG
GTCGCCGGTC GCCGGGTCTC CGGGGAACTG CTCACGGCGG TGGCCGGTGG ACGCCCGGAT
TTCGCCGACC CGGGCACGGT CGACGCGGCC CTGCGCGAGG CGATCGCGCA CCACGTGCTG
GCCACCGAGG ACGCCCGCTG GATCGTGTTC CGGCACGCCC TGCTGGCCGA GGCCGTCTAC
GGCGACCTGC TGCCCGGCGA GGTCACCGGC CTGCACCGGG CCTACCAGCA GCAGATCGCC
GCGAATCCGT CCCTCGGGTC GCCGGCCGAG CTGGCCTATC ACGCGCTTCG GGCCCACGAG
CTGCCGGTGG CGCTGACCGC ATCGTCCGCG GCGGCCCAGG AGGCGGTGGA CGTCCTCGCC
CCGGCCGAGG CGTTGCGGCA CCTGGAGACC GTCATCTCGC TCTGGGACGC CGTGCCGGAC
GCCGCGCAAC GGCTCGGGCG GGACCTGGTC GACGTGCAGA TGGCGGCGGC CGCCGCGGCC
AGCCGGGCGG GCCGGCCGGC TCGCGCCGCC GCGCTGGCGG TCAGCGCCCT GGACCGCAGC
GACGAGGCGC GCTCCGCTCG GCTCACCCCC GACGCGGCCT ACTACCTGAT CGATGATCAA
CGGGAGCGGG AGGCGCTCGA CCGCGCGGCC CGGGCCCTGC GGGTGCTGGA CGCCGAGGGC
CCGTCGGCGG ACCGGGCCCG GCTGCTGGCC GCCCAGGCCC GGTCGGCGCT GAACTGCGAT
CATGACGACG AGGCCCAGGC GATCGCCGAA CGGGCGGTGG CCGAGGCCCG GGCGTTCAGC
GTGCCGGCGG CCGAGGCCGA CGCGTTGACC ACGCTGGGCG TGCTGGCCGT GAACGAGGCC
GACCTGGCCG GCGACCTGTT CGGCCGGTCC CTGGAGTTGG CCCGCTCGGT CGGCGACCTC
ACCGCCGAGC TGCGCGCCAC CCACAATCTG ACCGCGAACC GTTACTACGC CGGCGATCTG
ACCGCCGCGG CCCAGATCTG CGCGGCCGGG ATCGACCGGG CGCGCTCGAC CGGCGTGCTC
TGGATGGGCT ACGGCGTCGG GCTCCTGCTG TACCGGGAAC TGATCCGGTA CCTGAGCGGG
GATCTGCGCC GGCCCGAGCC GAGCATGGAC ATGGTGCCCG AATCGGTGCG CACCATCCTG
TCCACCATCG AGTTGTACGC GGCGGTGGCG CGCGGCGACG AGGACGCGTT GGACCGGGGC
CGGGCAGTGG AGATCGACTG GTCGCGCGAC CCGATGATGG CCCTGACCTC CGGCGGTTGC
ACCATCGACG CGCTGACCTG GGCGGGCGAG CATCAGGCCG CCGTCGACCT GACCTTCCGG
CTCACCGATT TCATGAGCCG GGCCTGGAAC GACTACTTCC TGGGCGGCAT CTGGCTGTCC
GCGCTGGGCC TGGCCGCCCT GGCCGACCGG GCGACCCAGA CCCGATTGAC CGGCGGTGAT
CTGGCCGGCG ACCTGGCCAC CGGCGCCCAG CTCCTGGAGC GGATGGAGCA GACCGCCCGA
CGGGGACGGC CCCGTGGCGG CCAGCTGGGC CCCGAGGGTC GGGGGTGGGT GGCCCGGGCC
CGCGCCGAGT ACAGCCGGCT GATCGGCGAG CCCGACCCGG ACCTCTGGCG GGCCGCGGTC
ACCGAGTTCG CCTATGGCTA CCGGTACGAG GAGGCGCGAT CGCGGTGGCG GCTGGCCGAG
GCGCTGGCCG CCCGCGGCGA CCGATCGGGC GCGACCGTCG AGGCGAGCAC CGCGCTGCGC
GCGGCGCAGG ACATGGGGGC CCGGCCGTTG GCGGCGGCGC TGATCGACCT GGGCCGGCGG
GCCCGGCTCG ACCTGCCCGG GTCCACGTCC TCCGGCGGGG TGTTGACCGG CCGCGAGGAG
GAGGTGCTGC GGTTGGTCGC CGCCGGTCTG ACCAATCGGC AGATCGGCGA GCGCCTTTAC
ATCAGCGGCA AGACGGTGAG CGTGCACATC TCCAACGTGC TCGGCAAGCT CGGGGTCGGC
GGGCGCGCCG AGGCCGTCGC GGTGGCGCAC CGGCGCGGAT TGCTGCCCGA TCCGCCACCT
GCCGGCTGA
 
Protein sequence
MSDPAATIDL VALTVDLPFV GRADQVDRFR AALGRAREGE PSVLLVSGDA GVGKTRALTR 
MAEIARAEGA WVVVSHCVDL GEVGLPYLPF TETLQQLRGR CDEVDQAIAA RPALGRLLDV
GLAEPAAGGA DQAARGQLFD GIASAIGAGG RPDRPLVLII EDLHWADPSS RDVLRFLIAR
LRTEPLVVVA SYRTDDLHRR HPLRPTLSEL LRHPRVDHVE LSPFTRDELA QFGAAITGHP
LPDQVLQRVL RRSEGNAYFA QELLESGPDT AALPGSLADV LHARLERLDP AVQALAGIAS
VAGRRVSGEL LTAVAGGRPD FADPGTVDAA LREAIAHHVL ATEDARWIVF RHALLAEAVY
GDLLPGEVTG LHRAYQQQIA ANPSLGSPAE LAYHALRAHE LPVALTASSA AAQEAVDVLA
PAEALRHLET VISLWDAVPD AAQRLGRDLV DVQMAAAAAA SRAGRPARAA ALAVSALDRS
DEARSARLTP DAAYYLIDDQ REREALDRAA RALRVLDAEG PSADRARLLA AQARSALNCD
HDDEAQAIAE RAVAEARAFS VPAAEADALT TLGVLAVNEA DLAGDLFGRS LELARSVGDL
TAELRATHNL TANRYYAGDL TAAAQICAAG IDRARSTGVL WMGYGVGLLL YRELIRYLSG
DLRRPEPSMD MVPESVRTIL STIELYAAVA RGDEDALDRG RAVEIDWSRD PMMALTSGGC
TIDALTWAGE HQAAVDLTFR LTDFMSRAWN DYFLGGIWLS ALGLAALADR ATQTRLTGGD
LAGDLATGAQ LLERMEQTAR RGRPRGGQLG PEGRGWVARA RAEYSRLIGE PDPDLWRAAV
TEFAYGYRYE EARSRWRLAE ALAARGDRSG ATVEASTALR AAQDMGARPL AAALIDLGRR
ARLDLPGSTS SGGVLTGREE EVLRLVAAGL TNRQIGERLY ISGKTVSVHI SNVLGKLGVG
GRAEAVAVAH RRGLLPDPPP AG