Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0535 |
Symbol | |
ID | 5055756 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 481914 |
End bp | 482828 |
Gene Length | 915 bp |
Protein Length | 304 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640468097 |
Product | 5-oxopent-3-ene-1,2,5-tricarboxylate decarboxylase |
Protein accession | YP_001152782 |
Protein GI | 145590780 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.47505 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.339338 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCTAC TAACCTACAC GAAAAACGGC GTAAGTAAGG TTGGCCTCTT CAAAAACGGC GAAATCATAG ACCTACCCCA AGCGTACCTC CTGACATACG ACGCGGCTGA GGCGCCGGAC TTCCTTTTCG ACATGAGGAG GCTCATCGCC CTGGGAGGGC CCGCCCTGGA GGTGGCGAGG TATCTTGAAA GACACGCGCC AGGTGAGGCT TACTTGAAGC CTTCCGAGAT CAAGTGGGAG CCGCCGGTGC CGAACCCGGA GAAGATTTTT GCCGTGGCTG TCAACTACAA GGCGCATGGC CAGGAGGCTG GGGTTAAGCC GCCCGAGAGG CCCTACTTCT TCCCCAAGTT TCCCAATGCC CTGGTGGGGC ATGAGGGGCC TGTCGTCAAG CATAAGGTGG TGCAGAAGCT GGATTGGGAG GTGGAGCTGG TGGTGGTCAT GGGCCGCCCC GGCAAGTACA TACAACCGGA GAAGGCTCTT GACCACGTCT TCGGCTACGC GGTGGGTAAC GACATCTCTA TTAGAGATTG GCAGTTTCCG CCTGGCTGGC CTCAGCAACT AAACCCCTAC GGCCAGTACT GGATCTGGGG CAAGTCTATG GACACGGCGG CGCCTGTGGG GCCCTACATT GTGACTAGAG ACGAGGTGCC GGACCCCAAC AAGCTTGGGC TTAGGTTGTG GGTAAACAGC CAGCTGGAGC AGGAGGGCAA CACCTCCGAG CTGATCTTCA ACGTACAGCA ACTGATCCAC TGGGCCTCCC AAGGCATAAC TCTTAAGCCA GGCGACCTCA TATTCACCGG CACCCCACCC GGCGTAGGAT TCCCCAAAGG CAAATTCCTC AAAGGAGGCG ACGTGGTAGA AGCAGAAGTG GAGGGCATCG GGCGGCTTAG GAACTACGTG GTTGAGGAGA AATGA
|
Protein sequence | MRLLTYTKNG VSKVGLFKNG EIIDLPQAYL LTYDAAEAPD FLFDMRRLIA LGGPALEVAR YLERHAPGEA YLKPSEIKWE PPVPNPEKIF AVAVNYKAHG QEAGVKPPER PYFFPKFPNA LVGHEGPVVK HKVVQKLDWE VELVVVMGRP GKYIQPEKAL DHVFGYAVGN DISIRDWQFP PGWPQQLNPY GQYWIWGKSM DTAAPVGPYI VTRDEVPDPN KLGLRLWVNS QLEQEGNTSE LIFNVQQLIH WASQGITLKP GDLIFTGTPP GVGFPKGKFL KGGDVVEAEV EGIGRLRNYV VEEK
|
| |