Gene Pars_2267 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2267 
Symbol 
ID5055561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2029063 
End bp2030712 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content52% 
IMG OID640469819 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001154463 
Protein GI145592461 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.158148 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCGG CACAAGTCGC CGTATGGTGC CTTGAACAAC TAGGCGTCAA GAGGATCTAC 
GGCCTAATCG GCACATCGAT CCTGGACTTC ATAGACGCCG TCAAGGACAG CAGGATAAGA
TACATCTCGA CACGCCACGA GCAGGTTGCA GTATCCATGG CAGACGCAGA GGGGAGGCTC
ACCGGCAAAC CCGGCGTAGC TGTTGTCCAC GCCGGTCCGG GGTTTTTAAA CTCGCTCATC
TCTGTGGCAA ACGCGTACAA GGACACCTCG CCTTTGTTGT TAATATCGGG GGCGGTCAAG
AGGCGGCTTG CGGGGCTTGA TTCCTGGCTC GAAGTGCCTC AGAGGGATAT TATAAGGCCT
ATTGTCAAGG CCGTTTTCCG AGTTGACAAG CCCCTCGATG TGGGTAAAAT TATCGCAGAG
GCGTACTCAA CAGCGGCCTC CCCGCCACAG GGACCTGTAT TTGTGGAAGT TCCAGAAGAC
GTGTGGTCCA TGTCCTCGGA GACTGCCCCT TGCAGGTTTA ATGTTAGACC TCCGCCAGTT
GTCTGCGACG AAGATTTGAA AAAAGTGGCG GAACTGCTTT CAAAAACAAG GAGGCCGGTT
TTGCTGGCTG GAGGGGGCAT AAACAACGAC GAAGGCGCCA GGCTACTTTT ACAACTCGCT
GAGTGGTGGC AAATACCAGT TGCCGTAACT GGAAACGGGC GGGGGGCATT TCCCGAGGAT
CACCCGCTGT TTCTAGGAAG GGCGGGCTTC GGCGGCGGCA ATCCAGTAGC CGACCAAGCA
CTTATAAGAG CCGATCTTGT CCTCGCAGTG GGTGCCGGGC TCTCCGACAC AACGACTTAC
GGGTATAACT ATGTGCCGAG GGGTGATATA ATTGTTGTCA ACCTAGACCC ATTAGCTGAG
AAGAAGCCTA TCCCCTACAC TCTCCGCTTC TACGCCGACG CTGTAGATTT CCTCAGAAAA
CTAGTAGCGG CTGGGATAGA TGTGAAAGTT GACCCGAATT GGCATAAAGA AATAGAAGAG
ATGAGGAAAA GCTGGAATAC GTACCTAGAA GAGGCGCTCT CTAGAAGCTA CCACGGCTTT
GTCAACCCGT CTAAGTTCTT CTACCACCTG GACAAGGCGT TGCCCCGAGA TATCGTGATG
GTGGGAGGAC AAGGCATGCA CATTGTGTAC ACCTTCAGCT TTGTAAAAAT TAGAGCAGTG
CGTGGTTACC TGGCGGCCTT TAACCTAGGC GCCATGGGAT TTGCCTTCCC CGCCGCGCTG
GGGGCAAAAC TCTCTATGCC TGAACGCGAT GTATATGCAG TTGTTGGAGA CGGCGAGTTT
ATGATGACCG TACAAGACCT AGAAACCGCG ACTAGGGAGA AAATCCCAGT GAAAATTATA
GTTGTAAACG ACAATTCGTA CAGGGTGCTA TATGCAAGGC AAAGGGCGCA GAAAATGGGC
AGAGTCTTCG GCACGCTACA CACAAATCCC GATTTCGTCA AGCTTGCAGA GGCGTTCGGC
GTTGAGGCCA TGTCTATAAG CTCCGACGAC GATATCCCCA AAGCCGTGAA GTTCATAACT
GAGCACTCTG AAAGACCTAA ACTCTTGGAA GTAAAAATCC ACCCCGACGA CTTCCCCCCA
ATGAATATAG AAGCCGCGTT AAAATTCTAG
 
Protein sequence
MNAAQVAVWC LEQLGVKRIY GLIGTSILDF IDAVKDSRIR YISTRHEQVA VSMADAEGRL 
TGKPGVAVVH AGPGFLNSLI SVANAYKDTS PLLLISGAVK RRLAGLDSWL EVPQRDIIRP
IVKAVFRVDK PLDVGKIIAE AYSTAASPPQ GPVFVEVPED VWSMSSETAP CRFNVRPPPV
VCDEDLKKVA ELLSKTRRPV LLAGGGINND EGARLLLQLA EWWQIPVAVT GNGRGAFPED
HPLFLGRAGF GGGNPVADQA LIRADLVLAV GAGLSDTTTY GYNYVPRGDI IVVNLDPLAE
KKPIPYTLRF YADAVDFLRK LVAAGIDVKV DPNWHKEIEE MRKSWNTYLE EALSRSYHGF
VNPSKFFYHL DKALPRDIVM VGGQGMHIVY TFSFVKIRAV RGYLAAFNLG AMGFAFPAAL
GAKLSMPERD VYAVVGDGEF MMTVQDLETA TREKIPVKII VVNDNSYRVL YARQRAQKMG
RVFGTLHTNP DFVKLAEAFG VEAMSISSDD DIPKAVKFIT EHSERPKLLE VKIHPDDFPP
MNIEAALKF