Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4539 |
Symbol | |
ID | 8450167 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 5047100 |
End bp | 5049013 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 645043580 |
Product | thiamine pyrophosphate protein central region |
Protein accession | YP_003203807 |
Protein GI | 258654651 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3962] Acetolactate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.366125 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCAGCG CAGCAGCGTC GCACGCGGGA CAGACCGTCC GACTGACCGT CGGGCAGGCG GTCGTGAAGT TCCTGGGCAA TCAGTACAGC GAGCGGGACG GTGTGCGGCG CAAGCTGTTC GCCGGCTGCT TCGGCATCTT CGGGCACGGC AACGTGGCCG GTCTGGGGCA GGCGCTGCTG CAGGCCGAGC TGGCGGAGCC GGAGCTGCTG CCCTATCACC AGGGTCGCAA CGAGCAGGCG ATGGTGCACA TTGCGGTCGG GTACGCCCGG CAGAAGGACC GGCTGGAGAC GTTCGCAGTG ACCGCGTCGG TCGGCCCGGG CTCGTCGAAC ATGCTGACCG GGGCCGCGCT GGCCACCATC AACCGGTTGC CGGTGCTGCT GCTGGCCAGC GACATCTTCG CCACCCGGGT GGCCTCGCCG GTCCTGCAGG AGCTGGAACA GCCTTTCGGG TACGACGTCT CGGTCAACGA CGCGTTCCGG CCGCTGAGCA AGTTCTTCGA CCGGGTGTGG CGGCCCGAGC AGTTGCCGGC GGCCCTGCTG GGCGCGATGC GGGTGCTGAC CGATCCGGCG GAGACGGGGG CGGTGACGCT GGCCCTGCCG GAGGACGTGC AGGCCGAGGC GTTCGACTGG CCGGTGGAGC TGTTCGCCGA GCGGGTCTGG CGGATCGGCC GTCCGGTGCC CGAGCCCGAG GTGATCGCCG CCGCCGCGCA GATCATCCGC AACGCCAAGG CGCCGCTGAT CGTGTCCGGG GGCGGGGTCA CCTACGCGGA GGCCAACGAC GAGCTGCGGG CGTTCGTGGA GGCGACCGGC ATCCCGATCA GCGAGACCCA GGCCGGCAAG GGATCGTTGC CGTTCGACCA CCCGCTGAAC CTGGGCGCGA TCGGCGCGAC CGGTTCCCCG GCGGCCAACC ACTTCGCCCG CCGGGCCGAC GTGGTGATCG GGATCGGCAC CCGGTACAGC GATTTCACCA CCGGGTCGAA GACGATCTGG CAGCACCCGG ATGTGCGGTT CGTGAACATC AACGTGGCCG GGTTGGACGC GTTCAAGCTG GCCGGGTACC CGGTGGTGGC CGACGCGAAG CGGGCGCTGC CGGCGTTGGC TCAGGCCCTG TCGGGGTACT CGTCCGGACC GGAGTTCCAG GCGGAGGTCA CCGCCCGGGC CAAGGCGTGG GACGACGAGG TGGTGGCCTC TCACCACAGC GGGTACGGCG AGACCCACGA GTTCCTGGCC CAGGCCGAGG TTCTCGGCGC GGTCGAGGCG GCCATGGAGC CGACCGACGT GGTGGTCTGC GCGGCCGGTT CGCTGCCCGG CGATCTGCAC GGCATGTGGC GCACCCGGGA ACGCAAGGGG TATCACGTCG AGTACGGGTA CTCGTGCATG GGCTACGAGA TTCCGGGAGC GATCGGGATC AAGCTGGCCG CCCCGGAGCG GGACGTGTTC GTCACCGTCG GCGACGGCTC GTACCTGATG ATGCCGACCG AGCTGGTGAC CGCGGTGCAG GAGGGCATCA AGATCATCGT GGTGCTCTTG CAGAACCATG GGTACGCCTC GATCGGCGCG CTGGCCCAGT CGCTGGGGGT GCAGCGGTTC GGCACCAAGT ACCGGTACCG CAACGCCGAA TCGGGCCGGC TGGACGGCGG GAAGCTGCCG GTCGACCTGG CCCTGAACGC CGAGTCCATG GGTGTCACGG TCTACCGGAC GAGGACCCTC GCCGAGCTCA AGGATGCGCT GGCCGCGGCG AAGGCCGGCG ACGAGCCGTG CCTGGTCCAC GTGGACACCG ACCTGGAGTT CCACTCGCCC AAGGGCGACG GCTGGTGGGA CGTGCCCGTC GCGCAGGTCT CCACACTGGA CTTCACCCAG GACGCCCGGA TCCGCTACGA GAAGTCCCGC GCCGACCAGA AGCCCTACCT GTAG
|
Protein sequence | MTSAAASHAG QTVRLTVGQA VVKFLGNQYS ERDGVRRKLF AGCFGIFGHG NVAGLGQALL QAELAEPELL PYHQGRNEQA MVHIAVGYAR QKDRLETFAV TASVGPGSSN MLTGAALATI NRLPVLLLAS DIFATRVASP VLQELEQPFG YDVSVNDAFR PLSKFFDRVW RPEQLPAALL GAMRVLTDPA ETGAVTLALP EDVQAEAFDW PVELFAERVW RIGRPVPEPE VIAAAAQIIR NAKAPLIVSG GGVTYAEAND ELRAFVEATG IPISETQAGK GSLPFDHPLN LGAIGATGSP AANHFARRAD VVIGIGTRYS DFTTGSKTIW QHPDVRFVNI NVAGLDAFKL AGYPVVADAK RALPALAQAL SGYSSGPEFQ AEVTARAKAW DDEVVASHHS GYGETHEFLA QAEVLGAVEA AMEPTDVVVC AAGSLPGDLH GMWRTRERKG YHVEYGYSCM GYEIPGAIGI KLAAPERDVF VTVGDGSYLM MPTELVTAVQ EGIKIIVVLL QNHGYASIGA LAQSLGVQRF GTKYRYRNAE SGRLDGGKLP VDLALNAESM GVTVYRTRTL AELKDALAAA KAGDEPCLVH VDTDLEFHSP KGDGWWDVPV AQVSTLDFTQ DARIRYEKSR ADQKPYL
|
| |