Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1761 |
Symbol | |
ID | 8544143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 2432234 |
End bp | 2433913 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646386468 |
Product | thiamine pyrophosphate protein TPP binding domain protein |
Protein accession | YP_003266203 |
Protein GI | 262194994 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.15439 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00802646 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCATGC GGGGCAAGCA CGCGTTCATC GAGCAGCTTC GGCTCGAGGG CGTCCCATTC GTGTTCGGCA ATCCCGGCAG CACCGAGTTG GCCCTGCTCG ACGGCTTCCG CGACGCGCCC GTCACCTACA TGCTGGGGCT GCACGAGGCC GTGAGCATGG CCATGGCCAC CGGCTACGCC AAGGCCAGCG GCCGCCCCGC GGTGGTCAAC CTGCACTGCG CCGCCGGGCT CGGCAACGCC ATGGGCATGA TGCTCAACGC GGCGCGCGAA CGCGTCCCAC TCGTCGTCAC GGCCGGCCAG CAGGACACGC GTCATCTATT CGCGGATCCG CTGATCAGCG GCGACCTGCT CGAGCTGGCA CGACCCTTCG CCAAGTGGGC CACCGAGGTC CGGCAGGCCG ACGAACTTCC CATCGCGCTG CGCCGGGCCT TCAAGACTGC ACTCGAGGCG CCCGCCGGCC CGGTGGTGGT GTCGCTGCCG CAAAACCTCC TCGACCAGCC GGTGTCGCTG CCGATCCTGC CGACCGTGCA CACCGCGCCG CCCTCGCCCT CGCTGCGCGC GCTCGAGAGC GCGGCCGATG TGCTGGCGAA GGCCTCGAAT CCGCTGATCC TGTGCGGTGA CGGGCTGGCG GCCAGCGGCG GTCACGAACA GGTCGCGCGC CTGGCCGAGA TGCTGGGTGC GCCGGTCATG GCCAACACCT TGTCGGTGTG GAGCTTTCCC AACAATCACC CGCACTATCG CGGCGGACCG ATTCTCGCCC AGGTGGGCGA GGTGCTGCGC GACCGCGACT GCGTGCTCGC GCTGGGCGCC GGGCAGCTCT TCCGACAGCT GCCCTACGAG GGGCTCACGC CGCTGCCTGC GGAGTGCGCG CTGATTCAGA TCGACGTGGC GCCCGATCTC ATCGCCAAGA ATCACCCGGT GGCCCTGGGC ATCCACGGCG ACCCGCGCGA GGCCGCGACG CATCTGCTCA CGCTGTTGGA AGAGCGCCTG GACGAAGACC GTCGCCAGGC GGCTCGCGCG CGCGCCGCTG CGCTGGCCGA GCGACACGCG GCGCAGCGGG CGCAGGCGCG GGAACAGATG CACGCGCACT GGGACGACGA TCCGATTTCC CCGATGCGCG TGGCCGGCGA ATTGGCCCGC CACCTCACGC CGCAGAGCGT GCTGGTGGGC GCGACCGGTA CCGCCGTGCG CCAGGCCTTC GGCCAGCTTC TCGACTTCGG CGAGCCGCTG TCGTTCTTCG GCGGCAGCGA GGCGCTCGGC TTCTCGATTC CGGGCGCGTT GGGCATTCAG CTCGCGCTGC CCGAGCGCCA GGTGGTGTGC GTGGTGCGCG AGGGTGAGGG CATGTACACC ATCCAGGGGC TGTGGACCGC CAAACACTAT GATATCCCGG TCAAATACCT GGTTCTCAAC GATCTCGAAT ATCGCGCGCT GAAAAAAGGA ATGGTCAAGT ACTGGCGCGG CCACGGCAGC CCCGGCCCCT TTATCGCCAT GGACCTCGAC GACCCGCCCC TGGATTACGC CAAGCTCGCG GCCGCGTTCG AGATCCCGGC GCGTTCGATC GCCAAACCCA GCGACCTGCC CGCGGCCATG GATGAACTCT TTTCCACGCC TGGTGTGGCG TGTCTCGATG TTAGGATTGC CCGCTTTCAC GCAGCCGAGG GCGCACCGAC GCCGCGATGA
|
Protein sequence | MSMRGKHAFI EQLRLEGVPF VFGNPGSTEL ALLDGFRDAP VTYMLGLHEA VSMAMATGYA KASGRPAVVN LHCAAGLGNA MGMMLNAARE RVPLVVTAGQ QDTRHLFADP LISGDLLELA RPFAKWATEV RQADELPIAL RRAFKTALEA PAGPVVVSLP QNLLDQPVSL PILPTVHTAP PSPSLRALES AADVLAKASN PLILCGDGLA ASGGHEQVAR LAEMLGAPVM ANTLSVWSFP NNHPHYRGGP ILAQVGEVLR DRDCVLALGA GQLFRQLPYE GLTPLPAECA LIQIDVAPDL IAKNHPVALG IHGDPREAAT HLLTLLEERL DEDRRQAARA RAAALAERHA AQRAQAREQM HAHWDDDPIS PMRVAGELAR HLTPQSVLVG ATGTAVRQAF GQLLDFGEPL SFFGGSEALG FSIPGALGIQ LALPERQVVC VVREGEGMYT IQGLWTAKHY DIPVKYLVLN DLEYRALKKG MVKYWRGHGS PGPFIAMDLD DPPLDYAKLA AAFEIPARSI AKPSDLPAAM DELFSTPGVA CLDVRIARFH AAEGAPTPR
|
| |