Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3686 |
Symbol | |
ID | 8449305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4043694 |
End bp | 4045493 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 645042750 |
Product | thiamine pyrophosphate protein TPP binding domain protein |
Protein accession | YP_003202986 |
Protein GI | 258653830 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0906801 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0462514 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAAT TGGTGGCGGA TGTGGTGTTG TCCCGGTTGC GAGAATGGGG GGTGCGGCAG GTCTTCGGGT ACCCGGGGGA CGGGATCAAC GGCCTGTTGG CGGCGTGGGG GAGGGCGAAG GACGACCCGC AGTTCGTGCA GGCCCGGCAC GAGGAGATGG CCGCGTTCGC CGCCGTCGGC TTCGCCAAGT TCAGCGGTCG GGTCGGGGTG TGCGTGGCGA CCAGCGGACC CGGCGCGATC CACCTGCTGA ACGGGCTGTA CGACGCGAAA CTCGATCACG TCCCGGTCGT CGCGATCGTC GGGCAGACGG CCCGCTCGGC CATGGGCGGC TCGTACCAGC AGGAGGTCGA CCTGCTCTCG CTGTTCAAGG ACGTCTGCAG CGACTACGTG CAGATGTGTA CCGTCCCACA GCAGCTGCCG AACCTGATCG ACCGGGCGAT CCGGATCGCT CAGACCGAGC ACGCCCCGAC CTGCGTGATC GTGCCGTCCG ACGTGTTCGA CCTGGACTAC GAACCGCCGG GGCACGAGTT CAAGCAGGTC CCGTCCAGCG TCGGCACCGC CTGGGCGACC GCCGCCCCGG ACCCGGACGC GGTCCGCGCC GCGGCGGACC TGCTCAACGC CGGGGAGAAG GTCGCGTTGC TGGTCGGTCA GGGGGCCAGA GGCTGCGAAG CCGAGCTGAC CGAAGTCGCG GACCTGCTGG GCGCCGGCGC CGCGAAGGCG TTGCTGGGCA AGGACGTGCT GCCCGACACC CTGCCCTGGG TCACCGGTTC GATTGGCCTG CTGGGCACCA CCGCCAGCTA CCGGCTGATG ATGGGCTGCG ACACGCTCCT GACCATCGGG TCGAACTTCC CGTACACCCA GTTCATGCCG GACCTCGGGC AGGCCCGGGC CGTGCAGATC GACCGGTCCG GCAAGTGGAT CGGCATGCGG TACCCCTACG AGATCAACCT CGTCGGGGAC GCGAAGGCCA CTCTCAAGGC GCTGATCCCG CTGTTGAACC GGAAGGCCGA CCGGAGCTGG CGGGACCGGG TGCAGGCCGA TGTCGCGGAC TGGTGGCAGA CCGCCGAGCG CCGCGCGTTG ACCGCCGCCG ATCCGGTCAA CCCGATGCGG ATCTTCCATG AACTGTCGCA GCGGTTGCCC GTCGACGCCA TCGTGGTCAG CGATTCGGGC AGCGCAGCGA ACTGGTACGC CCGGCACCTG CGCTTTCACG GCGACATCCG CGGGTCACTG TCCGGGACGC TGGCCACGAT GGGTCCGGGG GTGCCGTACG CGATCGGCGC GAAATGGGCG CACCCGGACC GACCGGTGAT CGCCCTGGTG GGAGACGGAG CGATGCAGAT GAACGGACTG GCCGAGCTCA TCACCATCTC GCACTACTGG TCGCAATGGG CCGACCCGCG GCTCATCGTC GCGGTGCTGC ACAACAACGA CCTCAACCAG GTCACCTGGG AGATGCGGGC CATGTCGGGT GCCCCCAAGT TCGCCGAATC GCAGACTCTC CCGGACGTCG ACTACGCCGG ATTCGCGACC GGTTTGGGTC TGTCCGGCGT CCGGATCGAC GACCCCGATG CGCTGGGCCC GGCCTGGGCG ACCGCGTTGG CGGCGACCCG GCCCACCGTG CTGGACGTGA TCTGCGACCC GGATGTGCCG CCGATCCCGC CGCACGCCAC CTTCGATCAG GTCAAGTCCG TCGCCGGGGC GGTGCTGCAC GGTGACGAGG ACGCCTGGGG TTTCGTCAAA CAGGGTGTGA AACAGAAGGT GCAGCAGTAT CTGCCCGGAA CCAAGGGGGG AACGTCATGA
|
Protein sequence | MAELVADVVL SRLREWGVRQ VFGYPGDGIN GLLAAWGRAK DDPQFVQARH EEMAAFAAVG FAKFSGRVGV CVATSGPGAI HLLNGLYDAK LDHVPVVAIV GQTARSAMGG SYQQEVDLLS LFKDVCSDYV QMCTVPQQLP NLIDRAIRIA QTEHAPTCVI VPSDVFDLDY EPPGHEFKQV PSSVGTAWAT AAPDPDAVRA AADLLNAGEK VALLVGQGAR GCEAELTEVA DLLGAGAAKA LLGKDVLPDT LPWVTGSIGL LGTTASYRLM MGCDTLLTIG SNFPYTQFMP DLGQARAVQI DRSGKWIGMR YPYEINLVGD AKATLKALIP LLNRKADRSW RDRVQADVAD WWQTAERRAL TAADPVNPMR IFHELSQRLP VDAIVVSDSG SAANWYARHL RFHGDIRGSL SGTLATMGPG VPYAIGAKWA HPDRPVIALV GDGAMQMNGL AELITISHYW SQWADPRLIV AVLHNNDLNQ VTWEMRAMSG APKFAESQTL PDVDYAGFAT GLGLSGVRID DPDALGPAWA TALAATRPTV LDVICDPDVP PIPPHATFDQ VKSVAGAVLH GDEDAWGFVK QGVKQKVQQY LPGTKGGTS
|
| |