Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_5023 |
Symbol | |
ID | 8450654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 5600924 |
End bp | 5603872 |
Gene Length | 2949 bp |
Protein Length | 982 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 645044060 |
Product | transcriptional regulator, LuxR family |
Protein accession | YP_003204284 |
Protein GI | 258655128 |
COG category | [R] General function prediction only |
COG ID | [COG3899] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGGACC CGGCTGCCAC TATTGACCTC GTGGCACTGA CGGTCGACCT CCCGTTCGTC GGTCGCGCCG ACCAGGTGGA CCGGTTCCGC GCCGCGCTGG GCCGAGCCCG GGAGGGCGAG CCGAGCGTCC TGCTGGTCTC CGGTGACGCC GGCGTCGGCA AGACGCGGGC GCTGACCCGG ATGGCCGAGA TCGCCCGCGC CGAGGGGGCG TGGGTCGTCG TCAGCCACTG CGTGGATCTC GGCGAGGTCG GGCTGCCCTA CCTGCCGTTC ACCGAGACCC TGCAGCAGTT GCGCGGGCGC TGCGACGAGG TGGACCAGGC CATCGCCGCC CGGCCCGCCC TGGGCCGGCT GCTCGACGTC GGGCTGGCCG AGCCGGCCGC CGGCGGGGCC GACCAGGCCG CCCGCGGGCA ACTGTTCGAC GGCATCGCGT CGGCGATCGG GGCCGGCGGC CGGCCGGACC GGCCACTGGT GTTGATCATC GAGGACCTGC ATTGGGCCGA CCCGTCCAGC CGGGACGTGC TGCGGTTCCT CATCGCTCGG TTGCGCACCG AACCCCTGGT GGTGGTGGCC TCCTACCGCA CCGACGACCT GCACCGGCGC CATCCGCTGC GCCCCACCCT GTCCGAACTG CTGCGGCATC CGCGGGTCGA TCATGTCGAG CTGTCCCCGT TCACCCGGGA CGAGCTGGCC CAGTTCGGTG CCGCGATCAC CGGCCATCCG CTGCCCGACC AGGTGCTGCA GCGGGTGCTG CGCCGCTCCG AGGGCAACGC CTATTTCGCG CAGGAGCTGC TGGAGTCCGG ACCCGACACC GCCGCCCTGC CCGGCTCGTT GGCCGACGTC CTGCACGCCC GATTGGAACG GCTGGACCCG GCGGTGCAGG CGCTGGCCGG CATCGCCTCG GTCGCCGGTC GCCGGGTCTC CGGGGAACTG CTCACGGCGG TGGCCGGTGG ACGCCCGGAT TTCGCCGACC CGGGCACGGT CGACGCGGCC CTGCGCGAGG CGATCGCGCA CCACGTGCTG GCCACCGAGG ACGCCCGCTG GATCGTGTTC CGGCACGCCC TGCTGGCCGA GGCCGTCTAC GGCGACCTGC TGCCCGGCGA GGTCACCGGC CTGCACCGGG CCTACCAGCA GCAGATCGCC GCGAATCCGT CCCTCGGGTC GCCGGCCGAG CTGGCCTATC ACGCGCTTCG GGCCCACGAG CTGCCGGTGG CGCTGACCGC ATCGTCCGCG GCGGCCCAGG AGGCGGTGGA CGTCCTCGCC CCGGCCGAGG CGTTGCGGCA CCTGGAGACC GTCATCTCGC TCTGGGACGC CGTGCCGGAC GCCGCGCAAC GGCTCGGGCG GGACCTGGTC GACGTGCAGA TGGCGGCGGC CGCCGCGGCC AGCCGGGCGG GCCGGCCGGC TCGCGCCGCC GCGCTGGCGG TCAGCGCCCT GGACCGCAGC GACGAGGCGC GCTCCGCTCG GCTCACCCCC GACGCGGCCT ACTACCTGAT CGATGATCAA CGGGAGCGGG AGGCGCTCGA CCGCGCGGCC CGGGCCCTGC GGGTGCTGGA CGCCGAGGGC CCGTCGGCGG ACCGGGCCCG GCTGCTGGCC GCCCAGGCCC GGTCGGCGCT GAACTGCGAT CATGACGACG AGGCCCAGGC GATCGCCGAA CGGGCGGTGG CCGAGGCCCG GGCGTTCAGC GTGCCGGCGG CCGAGGCCGA CGCGTTGACC ACGCTGGGCG TGCTGGCCGT GAACGAGGCC GACCTGGCCG GCGACCTGTT CGGCCGGTCC CTGGAGTTGG CCCGCTCGGT CGGCGACCTC ACCGCCGAGC TGCGCGCCAC CCACAATCTG ACCGCGAACC GTTACTACGC CGGCGATCTG ACCGCCGCGG CCCAGATCTG CGCGGCCGGG ATCGACCGGG CGCGCTCGAC CGGCGTGCTC TGGATGGGCT ACGGCGTCGG GCTCCTGCTG TACCGGGAAC TGATCCGGTA CCTGAGCGGG GATCTGCGCC GGCCCGAGCC GAGCATGGAC ATGGTGCCCG AATCGGTGCG CACCATCCTG TCCACCATCG AGTTGTACGC GGCGGTGGCG CGCGGCGACG AGGACGCGTT GGACCGGGGC CGGGCAGTGG AGATCGACTG GTCGCGCGAC CCGATGATGG CCCTGACCTC CGGCGGTTGC ACCATCGACG CGCTGACCTG GGCGGGCGAG CATCAGGCCG CCGTCGACCT GACCTTCCGG CTCACCGATT TCATGAGCCG GGCCTGGAAC GACTACTTCC TGGGCGGCAT CTGGCTGTCC GCGCTGGGCC TGGCCGCCCT GGCCGACCGG GCGACCCAGA CCCGATTGAC CGGCGGTGAT CTGGCCGGCG ACCTGGCCAC CGGCGCCCAG CTCCTGGAGC GGATGGAGCA GACCGCCCGA CGGGGACGGC CCCGTGGCGG CCAGCTGGGC CCCGAGGGTC GGGGGTGGGT GGCCCGGGCC CGCGCCGAGT ACAGCCGGCT GATCGGCGAG CCCGACCCGG ACCTCTGGCG GGCCGCGGTC ACCGAGTTCG CCTATGGCTA CCGGTACGAG GAGGCGCGAT CGCGGTGGCG GCTGGCCGAG GCGCTGGCCG CCCGCGGCGA CCGATCGGGC GCGACCGTCG AGGCGAGCAC CGCGCTGCGC GCGGCGCAGG ACATGGGGGC CCGGCCGTTG GCGGCGGCGC TGATCGACCT GGGCCGGCGG GCCCGGCTCG ACCTGCCCGG GTCCACGTCC TCCGGCGGGG TGTTGACCGG CCGCGAGGAG GAGGTGCTGC GGTTGGTCGC CGCCGGTCTG ACCAATCGGC AGATCGGCGA GCGCCTTTAC ATCAGCGGCA AGACGGTGAG CGTGCACATC TCCAACGTGC TCGGCAAGCT CGGGGTCGGC GGGCGCGCCG AGGCCGTCGC GGTGGCGCAC CGGCGCGGAT TGCTGCCCGA TCCGCCACCT GCCGGCTGA
|
Protein sequence | MSDPAATIDL VALTVDLPFV GRADQVDRFR AALGRAREGE PSVLLVSGDA GVGKTRALTR MAEIARAEGA WVVVSHCVDL GEVGLPYLPF TETLQQLRGR CDEVDQAIAA RPALGRLLDV GLAEPAAGGA DQAARGQLFD GIASAIGAGG RPDRPLVLII EDLHWADPSS RDVLRFLIAR LRTEPLVVVA SYRTDDLHRR HPLRPTLSEL LRHPRVDHVE LSPFTRDELA QFGAAITGHP LPDQVLQRVL RRSEGNAYFA QELLESGPDT AALPGSLADV LHARLERLDP AVQALAGIAS VAGRRVSGEL LTAVAGGRPD FADPGTVDAA LREAIAHHVL ATEDARWIVF RHALLAEAVY GDLLPGEVTG LHRAYQQQIA ANPSLGSPAE LAYHALRAHE LPVALTASSA AAQEAVDVLA PAEALRHLET VISLWDAVPD AAQRLGRDLV DVQMAAAAAA SRAGRPARAA ALAVSALDRS DEARSARLTP DAAYYLIDDQ REREALDRAA RALRVLDAEG PSADRARLLA AQARSALNCD HDDEAQAIAE RAVAEARAFS VPAAEADALT TLGVLAVNEA DLAGDLFGRS LELARSVGDL TAELRATHNL TANRYYAGDL TAAAQICAAG IDRARSTGVL WMGYGVGLLL YRELIRYLSG DLRRPEPSMD MVPESVRTIL STIELYAAVA RGDEDALDRG RAVEIDWSRD PMMALTSGGC TIDALTWAGE HQAAVDLTFR LTDFMSRAWN DYFLGGIWLS ALGLAALADR ATQTRLTGGD LAGDLATGAQ LLERMEQTAR RGRPRGGQLG PEGRGWVARA RAEYSRLIGE PDPDLWRAAV TEFAYGYRYE EARSRWRLAE ALAARGDRSG ATVEASTALR AAQDMGARPL AAALIDLGRR ARLDLPGSTS SGGVLTGREE EVLRLVAAGL TNRQIGERLY ISGKTVSVHI SNVLGKLGVG GRAEAVAVAH RRGLLPDPPP AG
|
| |