Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0554 |
Symbol | |
ID | 5054568 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 496036 |
End bp | 497466 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640468116 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001152801 |
Protein GI | 145590799 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.000000239131 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGAGGG AGGGGGGCGT TAAGGAGGTA AAGTCGCCTA TAGACATGTC AATATTGGCG AAAGTCGCTA TGCCTAGCTC AGAGGAGGTG GAAGAGGTTG TAGCTACTGT GTATGTCAAG GGCAGATGGG CAGCACGGGA TTTGCCGGGT GAGAGGAGGG TGAGAATCCT GCGGAGAGCC TCAGAACTTT TGGAGAAAAA CGCAGAGCTG TTTGAAGAAG TCCTTGTCAT AAATGCAGGC AAGACGCGGC CGCAAGCCAA GGGTGAAGTA AAGGCCTCAA TTGATAGGCT TAAGCTCGCT GATTTAGATT TAAAGAAGGT TTCGGGAGAG TATGTCCCGG GGGATTGGAC TGAGGACACC TTAGAAACGG AGGCTGTGGT GAGAAGAGAG CCGCTCGGCG TAGTTCTTGC AATAACGCCT TTCAACTACC CCCTTTTCGA TGTGGTCAAC AAGGTGGTAT ATTCCTTCAT ATATGGGAAT GCCGTATTGG TAAAGCCGGC TTCGGCTACT CCTCTCCCCG CCTTAATGTT TGCAAAGATC TTAATTGAGG CTGGCTACCC GCCTGAGGCG CTAGGCGTAT TGCCAATATC GGGGACAGAG GCAGAGAAAT TGGTAGCTGA TGATAGAATA GCAGCGGTTA GCTTCACCGG GAGTTATGAG TCGGGGGAAA AAGTAGTGCG AGCAGGGGGC GTTAAACAAT ACATCCTTGA GCTGGGCGGC GGCGACCCAG CAATTGTTCT CAATGACGCA GATTTGGAGC TGGCTGTGGA TAGAATAGCT AGGGGGATAT ACAGCTATGC TGGCCAGCGG TGTGACGCGA TAAAGCTGAT TTTAGCAGAA GGCGATATTT ATGAGAGCTT AAAACACGGA CTTGCGAAAA GGCTTAGGGA GGTAAAGGTG GGGGATCCAA GAGATCCGGA GGTTGAGATG GGCCCCTTAA TATCCTCTGA GGCCGTTGAG GAGATGTTCA ATGCCATAGA CGACGCTGTG AAAAAAGGCG GATCCGTAGT GGTAGGCGGC GAGAGGTTAG GGCCTAATTA CGTCAAACCA ACGCTGATTG AAGCGTCGGC TGATAAGGTA AGGGATATGG AGCTTTACAG AAGGGAGATA TTTGCCCCCA TAGCGCTGAT AGTAAGGGTT AAGGACTTAG ACGAGGCTGT GGAGCTGGCC AATGGAAGGC CTTTTGGCCT TGATGCCAGT ATATTCGGGA AGGATATTAC GACAATCCGT AAGGCTATTC GGCTACTTGA AGTAGGCGCT GTTTATGTAA ACGATATGCC TAGACATGGC ATTGGATACT ACCCATTCGG CGGCAGGAAG AAAAGCGGCG TATATAGAGA GGGGATAGGA TATAGCGTAG AGGCAGTGAC TGCATATAAG ACGATAGTGT TCAACTATAG AGGCAGAGGC GTGTGGAGAT ACACCACATA A
|
Protein sequence | MGREGGVKEV KSPIDMSILA KVAMPSSEEV EEVVATVYVK GRWAARDLPG ERRVRILRRA SELLEKNAEL FEEVLVINAG KTRPQAKGEV KASIDRLKLA DLDLKKVSGE YVPGDWTEDT LETEAVVRRE PLGVVLAITP FNYPLFDVVN KVVYSFIYGN AVLVKPASAT PLPALMFAKI LIEAGYPPEA LGVLPISGTE AEKLVADDRI AAVSFTGSYE SGEKVVRAGG VKQYILELGG GDPAIVLNDA DLELAVDRIA RGIYSYAGQR CDAIKLILAE GDIYESLKHG LAKRLREVKV GDPRDPEVEM GPLISSEAVE EMFNAIDDAV KKGGSVVVGG ERLGPNYVKP TLIEASADKV RDMELYRREI FAPIALIVRV KDLDEAVELA NGRPFGLDAS IFGKDITTIR KAIRLLEVGA VYVNDMPRHG IGYYPFGGRK KSGVYREGIG YSVEAVTAYK TIVFNYRGRG VWRYTT
|
| |