Gene Pars_0051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0051 
Symbol 
ID5056063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp38939 
End bp39997 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content55% 
IMG OID640467631 
Productisopentenyl pyrophosphate isomerase 
Protein accessionYP_001152320 
Protein GI145590318 
COG category[C] Energy production and conversion 
COG ID[COG1304] L-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases 
TIGRFAM ID[TIGR02151] isopentenyl-diphosphate delta-isomerase, type 2 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGATAG ACAAGAGGAA AAACGACCAC ATCTACCTCG CATCTTCTGA TCTATCCCAA 
GTAGGCACGG CTCTTTTCGA GGAGGTTGTC CTAATACACA ACGCGTTGCC GGAAATAGAC
TTCTCAGATA TCGACCTCTC CACTAATTTC CTTGGGGCCC CCGTTAAGGC GCCTTTTGGA
ATTGGCGCTA TGACCGGAGG AACGGAGTTG GCAGGTAAAA TCAACGCGGA GTTGGCAAAA
GCCGCAGAGG AGTTTGGCAT ACCTATGTAC GTCGGGTCAC AGAGAATTGC GTTAGTGAAG
CCCGAGGTCA GGTGGACTTT TGAAGTGGTT AAGCAAAACG CGCCATCCAT TCCTAAGATT
GCCAATCTCG GTGCGCCGCA GTTGGCTCAG CTTTCTGAGA AACAGCTGGT GGACTGGGTT
GTGCAAGCTG TGGATATGAT AGACGCCTAC GCCGTGGCCG TCCACCTAAA CGCAGCGCAG
GAAGTGGTAC AACCGGAGGG GGAGCCCAGC TTCAGGGGCG TGTTGGAAAA ATTAAAAATT
GTCAAAAGGG CGGCAGGACG GCCACTCATA GTCAAAGAGG TGGGCAACGG CATATCGAAA
GAGGTGGCGG CGAAGCTGGC GGAGGTGGCA GACGCAATAG ATGTGGGGGG GCTTGGAGGG
ACTTCTTTTG TGGCGATTGA GGGCGCCAGG GCGGCCGATG CCTGGCTCCA AAGGCGGGTT
GCCGAGACGT TTAAGTATTG GGGAATACCC ACAGCTGCCT CTATCTGCGA GGTTAAATCC
GTGTATCGAG GCTTCGTAAT AGCGTCTGGC GGGATCAGAA GCGGACTAGA CGGAGCTAGG
GCTTTGGCGC TTGGCGCGCA CTTCTTCACA ATGTCCCAGC CACTCCTAAA AGCCACGCTT
GAGGGAAGAC TCCGTGAGGA GATAGAGGCG GTAATTACAG AGGTTAAAAT CGCCATGTTC
CTCACAGGAG TGCGCAGGCC GCAAGAACTG GCCCAAGTAC CCCGTGTCTA CGGCCCCAGG
CTGAGGGCTT GGCTAGAACA ACGAGGCACA ACTTGTTGA
 
Protein sequence
MGIDKRKNDH IYLASSDLSQ VGTALFEEVV LIHNALPEID FSDIDLSTNF LGAPVKAPFG 
IGAMTGGTEL AGKINAELAK AAEEFGIPMY VGSQRIALVK PEVRWTFEVV KQNAPSIPKI
ANLGAPQLAQ LSEKQLVDWV VQAVDMIDAY AVAVHLNAAQ EVVQPEGEPS FRGVLEKLKI
VKRAAGRPLI VKEVGNGISK EVAAKLAEVA DAIDVGGLGG TSFVAIEGAR AADAWLQRRV
AETFKYWGIP TAASICEVKS VYRGFVIASG GIRSGLDGAR ALALGAHFFT MSQPLLKATL
EGRLREEIEA VITEVKIAMF LTGVRRPQEL AQVPRVYGPR LRAWLEQRGT TC