Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4857 |
Symbol | |
ID | 8336211 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5523474 |
End bp | 5525285 |
Gene Length | 1812 bp |
Protein Length | 603 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644957957 |
Product | thiamine pyrophosphate protein |
Protein accession | YP_003115559 |
Protein GI | 256393995 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGACGA AGGTTTCCGA CTACGTCCTG CAGCGCCTGC GCGACTGGGG TGTGGAGCAC GTCTTCGCGT ATGCCGGCGA TGGCATCAAC GGGTTGCTCG CCGCCTGGGG GCGGGCGGAG AACAAGCCCA TGTTCGTCCA GTCCCGGCAC GAGGAGATGT CCGCCTTCGA GGCGGTCGGT TACGCGAAGT TCTCCGGCAA GGTCGGGGTG TGCGCGGCCA CCTCAGGTCC CGGCGCCATC CACCTGCTCA ACGGCCTGTA CGACGCCAAG CTGGACCACG TGCCGGTGGT GGCGATCGTC GGGCAGACGA ACCGCTCGGC GATGGGCGGC TCCTATCAGC AGGAGGTCGA CCTGCTGAGC CTGTACAAGG ACGTGGCGAG CGCCTACTGC GAGATGGTGA CCGTTCCGGA GCAGCTGCCG AACGTCCTGG ACCGGGCGAT GCGGGTCGCC GCCACCAAGC GCACCGTCAC CGCGGTGATC ATCCCCGCGG ACGTCCAGGA GTTGGAGTAC TCCGCGCCCG GGCACGCGTT CAAGATGGTC CCGTCCAGTC TCGGACTGCC CCACTCGACC GCGGTACCCC AGGCGGAGGA GCTCCGGAAG GCCGCGGATC TGCTGAACGC CGGACGGAAG GTGGCGATCC TGGCAGGCCA GGGCGCCCGC GCCGCGAGCC GAGAGGTGCA ACAGGTCGCC GACCTGCTGG GAGCCGGCGT GGCCAAGGCA CTGTTGGGCA AGGACGTCCT GTCCGACGAG CTGCCCTACG TCACGGGCTC GATCGGGCTG TTGGGAACCC GGCCCTCCTA CGAGCTGATG CGTGACTGCG ACACCCTGCT CGTGATCGGC TCCAGCTTCC CCTACACGCA GTTCCTGCCG GAGTTCGGTC AGGCGCGCGC GGTGCAGATC GACATCGATC CGGGCATGGT CGGGCTGCGC TACCCGTTCG AGGTCAACCT CGTCGGCGAC GCCCGCGAGA CCCTGCAAGC GCTGCTTCCC CTGCTGAACG CCAAGGACGA CCGCTCCTGG CGCGAGACCG TGGAGGAGAA CGCCGCGCGC TGGTGGGAAG TCATGCAGCG GCGGGCGGCG ACCGAGGCCG ACCCGATCAA CCCCGAGTAC GTCGTGCACG CTCTGGACGC CCTGTTGCCC GACGACGTCA TCGTCGCCGC CGACTCGGGT TCCTCAGCAA ACTGGTACGC GCGCCATCTG CGCTTCCGCG GCTCGATGCG CGGTTCGCTG TCCGGGACGC TCGCGACGAT GGGTCCCGGA GTCCCGTACG TGATCGGCGC CAAGTTCGCC CACCCGGACC GGCCGGCGAT CGCGCTGGTC GGTGACGGCG CCATGCAGAT GAACGGCATG GCCGAGCTCA TCACCGCCGC CAAGTACTGG GAGCGCTGGC AAGACCCGCG GCTGGTCGTG GCCGTCTTGA ACAACCACGA CCTGAACCAG GTCACCTGGG AGATGCGGGC CATGGCCGGC GCCCCGCAGT TCGAGCCCTC CCAGTCGCTG CCGGACGTGC GCTTTGCGGA CTTCGCGCGC TCCATCGGGC TGGAAGGCGT GCGGGTGGAG AAGCCCGAAC AGGTGGAACC CGCCTGGCGG CAGGCACTGG CCGCGGACCG GCCGTTCGTC ATCGACTTCC GCACCGACCC GGCGGTCCCG CCGATCCCGC CGCACGCCAC CCTCGACCAG ATCGAGGCCG CGGCGTCGGC GATCGTGCAC GGCGACAGCG ACCGCGTCTC GATGATCAAG CAGGGCATCA AGTCCAAGAT CCAGGAGTTC CTGCCGGGCG GGCCCGACGG CGACGACGCG TCAGATCACT GA
|
Protein sequence | MATKVSDYVL QRLRDWGVEH VFAYAGDGIN GLLAAWGRAE NKPMFVQSRH EEMSAFEAVG YAKFSGKVGV CAATSGPGAI HLLNGLYDAK LDHVPVVAIV GQTNRSAMGG SYQQEVDLLS LYKDVASAYC EMVTVPEQLP NVLDRAMRVA ATKRTVTAVI IPADVQELEY SAPGHAFKMV PSSLGLPHST AVPQAEELRK AADLLNAGRK VAILAGQGAR AASREVQQVA DLLGAGVAKA LLGKDVLSDE LPYVTGSIGL LGTRPSYELM RDCDTLLVIG SSFPYTQFLP EFGQARAVQI DIDPGMVGLR YPFEVNLVGD ARETLQALLP LLNAKDDRSW RETVEENAAR WWEVMQRRAA TEADPINPEY VVHALDALLP DDVIVAADSG SSANWYARHL RFRGSMRGSL SGTLATMGPG VPYVIGAKFA HPDRPAIALV GDGAMQMNGM AELITAAKYW ERWQDPRLVV AVLNNHDLNQ VTWEMRAMAG APQFEPSQSL PDVRFADFAR SIGLEGVRVE KPEQVEPAWR QALAADRPFV IDFRTDPAVP PIPPHATLDQ IEAAASAIVH GDSDRVSMIK QGIKSKIQEF LPGGPDGDDA SDH
|
| |