Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1921 |
Symbol | |
ID | 5055152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1726271 |
End bp | 1727548 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640469467 |
Product | UbiD family decarboxylase |
Protein accession | YP_001154120 |
Protein GI | 145592118 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCTCT CAAACGCCGT CGCTAGTCTT TCTGAAATAA GGTTCTACGA TCCTCCATAC GGAGAATTCG GCATAGCCAG AGTTCTGAAG GAGTCTGAGG GGAGGTGGAC TCCCCTATTC CGGGGAGTTG GGAAGGGGTT CTCCGGAGTG GGTAATGTTA TTGACACTAG GCCTAAGCTA TATCGCTTTC TAGGCGCAAG TAGCGATGAG GAGGCTTATT CGAAGTTGCT TTCGGCTCTG GAGAGTCCCG CGTCTCTGGA CTTCGTCTCC ACGTGGCAGG ATCTATATCG GGAAGTGGGT TCTCTTTACG ATTTGCCAAT GGTGCGCTAC TACGAGCGGG AGGCACGTCC TTACATAACC TCCGGCGTGG TGGTTGGGGC AGGCCCCGGC GGCGTGTACA ATGCATCTAT TCATCGTTTT TCTCCTATCG GCGCCAGGAA GGCTGTAATT AGGCTTGTGC CGCGCCACCT ATATCACATG TACAAGATGA GCGTAAAGCA GGGACGGGAG GTCCCCATAG CAGTGGCGTG GGGGGTACAC CCCCTGGTTC TCCTCGCCGC GGCCTCTTCG CCGCCCTATG GCGTATTTGA GCTGGGGGTA GCAGCCCGTC TTCTGGGAGG GCTCAAGGCA GTCTCTCTGG AGAATGGCGC GGTGGCGCCT TTTCCAGCTT CGGTGATTAT AGAGGGGTTC ATCACTGCCG AGCAAGCGGA GGAGGGGCCC TTCGTGGATA TTGTGGGGGT TTATGACCGG GTGAGGCTCC AGCCCGTGGT GAGAGTAGAG CGGATTTACG TGCTCCGTTC CGAGGCTTTA CTCCACTATC TGCTCCCGGC GGGGTTGGAA CACCAACTCC TTATGGGGTT TGAGCGTGAG GCACGGATTT GGAAAGCTGT GCGCTCGGTA GTGCCTGGGG TTAAAAAAGT GCGCCTCACC AGAGGCGGCT TTGGGTGGAT GGTGGCTGTT ATCTCATTAG AAAAGGCTGT GGAGGGAGAC GCGAAAAACG CGTTATTGGC CGCCTTTGCG GCACATCCCA GTCTCAAGAT CGCAATAGCG GTTGACGGGG ATGTGGATCC CGACGACCCA GTGGCGGTGG AGTGGGCCCT GGCGACACGG CTGAGGGCCG ACATGGGGCT TTTCGTAATC CCCTACGTGA GGGGGTCCAC CCTTGATCCC GTGGCGCTTA ACGAGGAGGG GCTCACGCAC AAGATAGGGA TAGACGCCAC TAGGCCTCTT GATGCGGATC CTGTCCTTTT TGAGCGGGCG CGGATCCCAG AAACTTAG
|
Protein sequence | MSLSNAVASL SEIRFYDPPY GEFGIARVLK ESEGRWTPLF RGVGKGFSGV GNVIDTRPKL YRFLGASSDE EAYSKLLSAL ESPASLDFVS TWQDLYREVG SLYDLPMVRY YEREARPYIT SGVVVGAGPG GVYNASIHRF SPIGARKAVI RLVPRHLYHM YKMSVKQGRE VPIAVAWGVH PLVLLAAASS PPYGVFELGV AARLLGGLKA VSLENGAVAP FPASVIIEGF ITAEQAEEGP FVDIVGVYDR VRLQPVVRVE RIYVLRSEAL LHYLLPAGLE HQLLMGFERE ARIWKAVRSV VPGVKKVRLT RGGFGWMVAV ISLEKAVEGD AKNALLAAFA AHPSLKIAIA VDGDVDPDDP VAVEWALATR LRADMGLFVI PYVRGSTLDP VALNEEGLTH KIGIDATRPL DADPVLFERA RIPET
|
| |