Gene Pars_0554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0554 
Symbol 
ID5054568 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp496036 
End bp497466 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content50% 
IMG OID640468116 
Productaldehyde dehydrogenase 
Protein accessionYP_001152801 
Protein GI145590799 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000239131 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGAGGG AGGGGGGCGT TAAGGAGGTA AAGTCGCCTA TAGACATGTC AATATTGGCG 
AAAGTCGCTA TGCCTAGCTC AGAGGAGGTG GAAGAGGTTG TAGCTACTGT GTATGTCAAG
GGCAGATGGG CAGCACGGGA TTTGCCGGGT GAGAGGAGGG TGAGAATCCT GCGGAGAGCC
TCAGAACTTT TGGAGAAAAA CGCAGAGCTG TTTGAAGAAG TCCTTGTCAT AAATGCAGGC
AAGACGCGGC CGCAAGCCAA GGGTGAAGTA AAGGCCTCAA TTGATAGGCT TAAGCTCGCT
GATTTAGATT TAAAGAAGGT TTCGGGAGAG TATGTCCCGG GGGATTGGAC TGAGGACACC
TTAGAAACGG AGGCTGTGGT GAGAAGAGAG CCGCTCGGCG TAGTTCTTGC AATAACGCCT
TTCAACTACC CCCTTTTCGA TGTGGTCAAC AAGGTGGTAT ATTCCTTCAT ATATGGGAAT
GCCGTATTGG TAAAGCCGGC TTCGGCTACT CCTCTCCCCG CCTTAATGTT TGCAAAGATC
TTAATTGAGG CTGGCTACCC GCCTGAGGCG CTAGGCGTAT TGCCAATATC GGGGACAGAG
GCAGAGAAAT TGGTAGCTGA TGATAGAATA GCAGCGGTTA GCTTCACCGG GAGTTATGAG
TCGGGGGAAA AAGTAGTGCG AGCAGGGGGC GTTAAACAAT ACATCCTTGA GCTGGGCGGC
GGCGACCCAG CAATTGTTCT CAATGACGCA GATTTGGAGC TGGCTGTGGA TAGAATAGCT
AGGGGGATAT ACAGCTATGC TGGCCAGCGG TGTGACGCGA TAAAGCTGAT TTTAGCAGAA
GGCGATATTT ATGAGAGCTT AAAACACGGA CTTGCGAAAA GGCTTAGGGA GGTAAAGGTG
GGGGATCCAA GAGATCCGGA GGTTGAGATG GGCCCCTTAA TATCCTCTGA GGCCGTTGAG
GAGATGTTCA ATGCCATAGA CGACGCTGTG AAAAAAGGCG GATCCGTAGT GGTAGGCGGC
GAGAGGTTAG GGCCTAATTA CGTCAAACCA ACGCTGATTG AAGCGTCGGC TGATAAGGTA
AGGGATATGG AGCTTTACAG AAGGGAGATA TTTGCCCCCA TAGCGCTGAT AGTAAGGGTT
AAGGACTTAG ACGAGGCTGT GGAGCTGGCC AATGGAAGGC CTTTTGGCCT TGATGCCAGT
ATATTCGGGA AGGATATTAC GACAATCCGT AAGGCTATTC GGCTACTTGA AGTAGGCGCT
GTTTATGTAA ACGATATGCC TAGACATGGC ATTGGATACT ACCCATTCGG CGGCAGGAAG
AAAAGCGGCG TATATAGAGA GGGGATAGGA TATAGCGTAG AGGCAGTGAC TGCATATAAG
ACGATAGTGT TCAACTATAG AGGCAGAGGC GTGTGGAGAT ACACCACATA A
 
Protein sequence
MGREGGVKEV KSPIDMSILA KVAMPSSEEV EEVVATVYVK GRWAARDLPG ERRVRILRRA 
SELLEKNAEL FEEVLVINAG KTRPQAKGEV KASIDRLKLA DLDLKKVSGE YVPGDWTEDT
LETEAVVRRE PLGVVLAITP FNYPLFDVVN KVVYSFIYGN AVLVKPASAT PLPALMFAKI
LIEAGYPPEA LGVLPISGTE AEKLVADDRI AAVSFTGSYE SGEKVVRAGG VKQYILELGG
GDPAIVLNDA DLELAVDRIA RGIYSYAGQR CDAIKLILAE GDIYESLKHG LAKRLREVKV
GDPRDPEVEM GPLISSEAVE EMFNAIDDAV KKGGSVVVGG ERLGPNYVKP TLIEASADKV
RDMELYRREI FAPIALIVRV KDLDEAVELA NGRPFGLDAS IFGKDITTIR KAIRLLEVGA
VYVNDMPRHG IGYYPFGGRK KSGVYREGIG YSVEAVTAYK TIVFNYRGRG VWRYTT