Gene Pars_0705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0705 
Symbol 
ID5055225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp627173 
End bp628603 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content64% 
IMG OID640468262 
ProductUbiD family decarboxylase 
Protein accessionYP_001152943 
Protein GI145590941 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.400833 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCTCGG ATCTCCGCGC CTTTTTGGAC GCCTTGGAGG AGAGGGGGTG GCTGAAGAGG 
GTTTCTGAGC CCCTTTCGCC TGAGCTGGAG ATTCCGGAGG TGCTGCGCCG GGTGATGTAC
GCGAGGGGCC CGGCCCTCCT CTTTGAGTCG GTGAAGGGCT TTCCCAAGTG GCGCGTCGTC
GGCAACCTCT TCGGCTCTCT GGAGAGGATC CGCTTGGCGC TGAGCGCTGA GCGGCTTGAA
GACGTGGGGA GGCGCATCTT GGCGCCGATG GCCTCGCCGC CGCCTATCAC CCTTATGGAT
AAGTTCAGGG CCGCGGCTGA CCTCTTCGAG CTTGGGCGCT ACGCCCCTAG GGCTGTCCGT
TCGGCGCCTG TTAAGGAGGT GGAGGAGGCC CCGAACCTCC TCTCTATCCC GGCTTTTAAG
AGCTGGCCGG GGGACGCTGG CCGCTACATA ACCTTCGGCC CTCTCGTGAC GCGGACTGCC
TCGGGGATAT ATAACGTGGG GCTTTACCGG ATCCAGATCC TCAACGAGGC AGAGGCTATT
GTCCACGCGC AGATTCACAA GAGGGCGGCC GACCTCTTCG CATCGTCGCG GGGGTGCGTG
GACGCTGCCA TCGTCATCGG GGGGGACCCG GCCTTCCTTC TCAGCGCGAT GATGCCCACG
CCCTACCCGC TGGACGAGTA CCTCTTCGCG GGGGTGTTGA GGGGCTCTGG GCTCGAGGTG
ACGAGGGGCT CCGCCACGGA CCTCTACATC CCGGCGCGGG CTGAGGCCGT CGTGGAGGGC
TGTGTCGACG TCTCTGACCT GAGGAGGGAG GGGCCTTTCG GCGACCACTA CGGCGTGTAC
GACCCCGGGG GGCTCTACCC CGTATTTAAG GCTAAGCTTG TCCTGCGGCG GGAGGACCCC
ATCTACTACG GCACTGTCGT GGGGAAGCCG CCTCTGGAGG ATGCCTATAT GGGGAAGGCG
GTGGAGCGGG TATTTCTCCC GGTTCTCCAG TTCCTCCTGC CTGAAGTGGT TGATATAAAC
CTGCCGGAGT TCGGCCTCTT CCAGGGGGTT GCCATCGTTT CTGTTAAAAA GCGCTTCCCG
GGGCATGGGA AGAAGGTGAT GATGGCGCTG TGGGGGCTGG GCCACATGAT GTCCCTCACC
AAGGTCGTCA TCGTGGTGGA CCACGACGTC AATGTGCACG ACCTCAACGA GGTGCTCTTC
GCCATAGCCC AGCGGGTCGA CCCGCAACGG GACGTGGTGG TGGTCCCGGG GGCACACGTA
GATGTCTTGG ACACCGGGTC CCCTACGCCG GGGTACGGAA GCAAGCTTGG GATCGATGCC
ACCCGGAAGC TGCCGGAGGA GTACGGGGGC CGGTCGTGGC CAGCAGAGGT GGAGCCCGAC
CCTGAGGTGG CGGAGAGGGT TAGGGGGGTG GTGGAGCGGG TTTTGGGGTG A
 
Protein sequence
MFSDLRAFLD ALEERGWLKR VSEPLSPELE IPEVLRRVMY ARGPALLFES VKGFPKWRVV 
GNLFGSLERI RLALSAERLE DVGRRILAPM ASPPPITLMD KFRAAADLFE LGRYAPRAVR
SAPVKEVEEA PNLLSIPAFK SWPGDAGRYI TFGPLVTRTA SGIYNVGLYR IQILNEAEAI
VHAQIHKRAA DLFASSRGCV DAAIVIGGDP AFLLSAMMPT PYPLDEYLFA GVLRGSGLEV
TRGSATDLYI PARAEAVVEG CVDVSDLRRE GPFGDHYGVY DPGGLYPVFK AKLVLRREDP
IYYGTVVGKP PLEDAYMGKA VERVFLPVLQ FLLPEVVDIN LPEFGLFQGV AIVSVKKRFP
GHGKKVMMAL WGLGHMMSLT KVVIVVDHDV NVHDLNEVLF AIAQRVDPQR DVVVVPGAHV
DVLDTGSPTP GYGSKLGIDA TRKLPEEYGG RSWPAEVEPD PEVAERVRGV VERVLG