Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0705 |
Symbol | |
ID | 5055225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 627173 |
End bp | 628603 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640468262 |
Product | UbiD family decarboxylase |
Protein accession | YP_001152943 |
Protein GI | 145590941 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.400833 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTCTCGG ATCTCCGCGC CTTTTTGGAC GCCTTGGAGG AGAGGGGGTG GCTGAAGAGG GTTTCTGAGC CCCTTTCGCC TGAGCTGGAG ATTCCGGAGG TGCTGCGCCG GGTGATGTAC GCGAGGGGCC CGGCCCTCCT CTTTGAGTCG GTGAAGGGCT TTCCCAAGTG GCGCGTCGTC GGCAACCTCT TCGGCTCTCT GGAGAGGATC CGCTTGGCGC TGAGCGCTGA GCGGCTTGAA GACGTGGGGA GGCGCATCTT GGCGCCGATG GCCTCGCCGC CGCCTATCAC CCTTATGGAT AAGTTCAGGG CCGCGGCTGA CCTCTTCGAG CTTGGGCGCT ACGCCCCTAG GGCTGTCCGT TCGGCGCCTG TTAAGGAGGT GGAGGAGGCC CCGAACCTCC TCTCTATCCC GGCTTTTAAG AGCTGGCCGG GGGACGCTGG CCGCTACATA ACCTTCGGCC CTCTCGTGAC GCGGACTGCC TCGGGGATAT ATAACGTGGG GCTTTACCGG ATCCAGATCC TCAACGAGGC AGAGGCTATT GTCCACGCGC AGATTCACAA GAGGGCGGCC GACCTCTTCG CATCGTCGCG GGGGTGCGTG GACGCTGCCA TCGTCATCGG GGGGGACCCG GCCTTCCTTC TCAGCGCGAT GATGCCCACG CCCTACCCGC TGGACGAGTA CCTCTTCGCG GGGGTGTTGA GGGGCTCTGG GCTCGAGGTG ACGAGGGGCT CCGCCACGGA CCTCTACATC CCGGCGCGGG CTGAGGCCGT CGTGGAGGGC TGTGTCGACG TCTCTGACCT GAGGAGGGAG GGGCCTTTCG GCGACCACTA CGGCGTGTAC GACCCCGGGG GGCTCTACCC CGTATTTAAG GCTAAGCTTG TCCTGCGGCG GGAGGACCCC ATCTACTACG GCACTGTCGT GGGGAAGCCG CCTCTGGAGG ATGCCTATAT GGGGAAGGCG GTGGAGCGGG TATTTCTCCC GGTTCTCCAG TTCCTCCTGC CTGAAGTGGT TGATATAAAC CTGCCGGAGT TCGGCCTCTT CCAGGGGGTT GCCATCGTTT CTGTTAAAAA GCGCTTCCCG GGGCATGGGA AGAAGGTGAT GATGGCGCTG TGGGGGCTGG GCCACATGAT GTCCCTCACC AAGGTCGTCA TCGTGGTGGA CCACGACGTC AATGTGCACG ACCTCAACGA GGTGCTCTTC GCCATAGCCC AGCGGGTCGA CCCGCAACGG GACGTGGTGG TGGTCCCGGG GGCACACGTA GATGTCTTGG ACACCGGGTC CCCTACGCCG GGGTACGGAA GCAAGCTTGG GATCGATGCC ACCCGGAAGC TGCCGGAGGA GTACGGGGGC CGGTCGTGGC CAGCAGAGGT GGAGCCCGAC CCTGAGGTGG CGGAGAGGGT TAGGGGGGTG GTGGAGCGGG TTTTGGGGTG A
|
Protein sequence | MFSDLRAFLD ALEERGWLKR VSEPLSPELE IPEVLRRVMY ARGPALLFES VKGFPKWRVV GNLFGSLERI RLALSAERLE DVGRRILAPM ASPPPITLMD KFRAAADLFE LGRYAPRAVR SAPVKEVEEA PNLLSIPAFK SWPGDAGRYI TFGPLVTRTA SGIYNVGLYR IQILNEAEAI VHAQIHKRAA DLFASSRGCV DAAIVIGGDP AFLLSAMMPT PYPLDEYLFA GVLRGSGLEV TRGSATDLYI PARAEAVVEG CVDVSDLRRE GPFGDHYGVY DPGGLYPVFK AKLVLRREDP IYYGTVVGKP PLEDAYMGKA VERVFLPVLQ FLLPEVVDIN LPEFGLFQGV AIVSVKKRFP GHGKKVMMAL WGLGHMMSLT KVVIVVDHDV NVHDLNEVLF AIAQRVDPQR DVVVVPGAHV DVLDTGSPTP GYGSKLGIDA TRKLPEEYGG RSWPAEVEPD PEVAERVRGV VERVLG
|
| |