Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2267 |
Symbol | |
ID | 5055561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 2029063 |
End bp | 2030712 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640469819 |
Product | thiamine pyrophosphate binding domain-containing protein |
Protein accession | YP_001154463 |
Protein GI | 145592461 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.158148 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCGG CACAAGTCGC CGTATGGTGC CTTGAACAAC TAGGCGTCAA GAGGATCTAC GGCCTAATCG GCACATCGAT CCTGGACTTC ATAGACGCCG TCAAGGACAG CAGGATAAGA TACATCTCGA CACGCCACGA GCAGGTTGCA GTATCCATGG CAGACGCAGA GGGGAGGCTC ACCGGCAAAC CCGGCGTAGC TGTTGTCCAC GCCGGTCCGG GGTTTTTAAA CTCGCTCATC TCTGTGGCAA ACGCGTACAA GGACACCTCG CCTTTGTTGT TAATATCGGG GGCGGTCAAG AGGCGGCTTG CGGGGCTTGA TTCCTGGCTC GAAGTGCCTC AGAGGGATAT TATAAGGCCT ATTGTCAAGG CCGTTTTCCG AGTTGACAAG CCCCTCGATG TGGGTAAAAT TATCGCAGAG GCGTACTCAA CAGCGGCCTC CCCGCCACAG GGACCTGTAT TTGTGGAAGT TCCAGAAGAC GTGTGGTCCA TGTCCTCGGA GACTGCCCCT TGCAGGTTTA ATGTTAGACC TCCGCCAGTT GTCTGCGACG AAGATTTGAA AAAAGTGGCG GAACTGCTTT CAAAAACAAG GAGGCCGGTT TTGCTGGCTG GAGGGGGCAT AAACAACGAC GAAGGCGCCA GGCTACTTTT ACAACTCGCT GAGTGGTGGC AAATACCAGT TGCCGTAACT GGAAACGGGC GGGGGGCATT TCCCGAGGAT CACCCGCTGT TTCTAGGAAG GGCGGGCTTC GGCGGCGGCA ATCCAGTAGC CGACCAAGCA CTTATAAGAG CCGATCTTGT CCTCGCAGTG GGTGCCGGGC TCTCCGACAC AACGACTTAC GGGTATAACT ATGTGCCGAG GGGTGATATA ATTGTTGTCA ACCTAGACCC ATTAGCTGAG AAGAAGCCTA TCCCCTACAC TCTCCGCTTC TACGCCGACG CTGTAGATTT CCTCAGAAAA CTAGTAGCGG CTGGGATAGA TGTGAAAGTT GACCCGAATT GGCATAAAGA AATAGAAGAG ATGAGGAAAA GCTGGAATAC GTACCTAGAA GAGGCGCTCT CTAGAAGCTA CCACGGCTTT GTCAACCCGT CTAAGTTCTT CTACCACCTG GACAAGGCGT TGCCCCGAGA TATCGTGATG GTGGGAGGAC AAGGCATGCA CATTGTGTAC ACCTTCAGCT TTGTAAAAAT TAGAGCAGTG CGTGGTTACC TGGCGGCCTT TAACCTAGGC GCCATGGGAT TTGCCTTCCC CGCCGCGCTG GGGGCAAAAC TCTCTATGCC TGAACGCGAT GTATATGCAG TTGTTGGAGA CGGCGAGTTT ATGATGACCG TACAAGACCT AGAAACCGCG ACTAGGGAGA AAATCCCAGT GAAAATTATA GTTGTAAACG ACAATTCGTA CAGGGTGCTA TATGCAAGGC AAAGGGCGCA GAAAATGGGC AGAGTCTTCG GCACGCTACA CACAAATCCC GATTTCGTCA AGCTTGCAGA GGCGTTCGGC GTTGAGGCCA TGTCTATAAG CTCCGACGAC GATATCCCCA AAGCCGTGAA GTTCATAACT GAGCACTCTG AAAGACCTAA ACTCTTGGAA GTAAAAATCC ACCCCGACGA CTTCCCCCCA ATGAATATAG AAGCCGCGTT AAAATTCTAG
|
Protein sequence | MNAAQVAVWC LEQLGVKRIY GLIGTSILDF IDAVKDSRIR YISTRHEQVA VSMADAEGRL TGKPGVAVVH AGPGFLNSLI SVANAYKDTS PLLLISGAVK RRLAGLDSWL EVPQRDIIRP IVKAVFRVDK PLDVGKIIAE AYSTAASPPQ GPVFVEVPED VWSMSSETAP CRFNVRPPPV VCDEDLKKVA ELLSKTRRPV LLAGGGINND EGARLLLQLA EWWQIPVAVT GNGRGAFPED HPLFLGRAGF GGGNPVADQA LIRADLVLAV GAGLSDTTTY GYNYVPRGDI IVVNLDPLAE KKPIPYTLRF YADAVDFLRK LVAAGIDVKV DPNWHKEIEE MRKSWNTYLE EALSRSYHGF VNPSKFFYHL DKALPRDIVM VGGQGMHIVY TFSFVKIRAV RGYLAAFNLG AMGFAFPAAL GAKLSMPERD VYAVVGDGEF MMTVQDLETA TREKIPVKII VVNDNSYRVL YARQRAQKMG RVFGTLHTNP DFVKLAEAFG VEAMSISSDD DIPKAVKFIT EHSERPKLLE VKIHPDDFPP MNIEAALKF
|
| |