Gene Sare_4514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4514 
Symbol 
ID5707035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5101704 
End bp5103164 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content68% 
IMG OID641273928 
ProductUbiD family decarboxylase 
Protein accessionYP_001539277 
Protein GI159040024 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00843241 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGGCTC GTGGCTTTCC GTACTCCGAT CTGAAGGACT TCCTGGCGGC GCTGGAGCGC 
GCGGGTGAGC TGCGGCGGGT GGACGTCCCG GTGGATCCGA CGCTGGAGTT GGCCGAGGTC
GTCACCCGAA CGGTCCGCGC CGGCGGCCCG GCACTGGTCT TCGAGCGGCC CACCCGCGGC
GAGATGCCGG TGGCGATCAA CCTGTTCGGC ACGGAGAAGC GGATGGCGAT GGCGCTCGGC
GTCGAGTCGC TGGACGAGAT CGGCGCGCGG ATCGGTGCGT TGATCCGGCC GGAGTTGCCG
GTCGGCTGGT CCGGCATCCG CGAGGGCCTC GGCAAGGTCA TGCAGCTCAA GTCGGTGCCG
CCACGCAAGG TGAAGACCGC GCCCTGCCAG CAGGTGGTGT ACCGGGGCGA CGACGTCGAC
CTGACCCGGC TGCCCGGCCT GCAGGTGTGG CCCGGTGACG GCGGCGTCTT CCACAACTAC
GGGTTGACCC ACACCAAGCA TCCCGAGACC GGCGCACGCA ACCTCGGCCT CTACCGGCTT
CAGCAGCACA GTCGGAACAC GCTGGGCATG CACTGGCAGA TCCACAAGGA CTCCACCGCC
CATCACGCGG TCGCCGAGCG GCTCGGCCAG CGGCTGCCGG TGGCCATCGC GATCGGCTGC
GACCCGGTGA TCTCGTACGC CGCGAGCGCC CCGCTTCCCG GTGACATCGA CGAATACCTG
TTCGCGGGCT TCCTGCGCGG TGAACGGGTC GAGATGGTCG ACTGCCTGAC CGTTCCGCTC
CAGGTGCCGG CGCATGCCCA GGTGGTGCTC GAGGGGTACC TCGAGCCCGG CGAGCGGCTG
CCCGAGGGGC CGTTCGGTGA TCACACCGGC TACTACACGC CGATCGAGCC GTTCCCGGTC
CTGCACGTCG AGACGATGAC CATGCAGCGC AATCCGGTCT ACCACTCGAT CATCACCTCG
AAGCCGCCGC AGGAGGACCA TGGCCTGGGC AAGGCCACCG AGCGGATTTT CCAGCCGCTG
CTGAAGCTGC TCATCCCGGA CATCGTCGAC TACGACCTGC CGGCCGCCGG GGTCTTCCAC
AACTGCGCGA TCGTGGCGAT TCGCAAGCGC TACCCGAAGC ACGCGCAGAA GGTCATGAGT
GCGATCTGGG GCGCGCACCT GATGTCGATG ACCAAGCTGA TCGTGATCGT GGACGAGGAC
TGCGACGTGC ACGACTACAA CGAGGTTGCC TTCCGGGCGT TCGGCAACGT CGACTACGCC
CGGGACCTGC TGCTCACCGA AGGGCCGGTG GACCATCTGG ACCACGCCTC GTACCAGCAG
TTCTGGGGCG GTAAGGCCGG CGTCGACGCC ACCCGCAAGC TCCCGGGGGA GGGCTACACC
CGGGGCTGGC CCGAGGAGTT GACCATGACG CCCGAGGTGG TGTCGTTGGT CGACAAGCGC
TGGAAGGAGT ACGGCATCTG A
 
Protein sequence
MAARGFPYSD LKDFLAALER AGELRRVDVP VDPTLELAEV VTRTVRAGGP ALVFERPTRG 
EMPVAINLFG TEKRMAMALG VESLDEIGAR IGALIRPELP VGWSGIREGL GKVMQLKSVP
PRKVKTAPCQ QVVYRGDDVD LTRLPGLQVW PGDGGVFHNY GLTHTKHPET GARNLGLYRL
QQHSRNTLGM HWQIHKDSTA HHAVAERLGQ RLPVAIAIGC DPVISYAASA PLPGDIDEYL
FAGFLRGERV EMVDCLTVPL QVPAHAQVVL EGYLEPGERL PEGPFGDHTG YYTPIEPFPV
LHVETMTMQR NPVYHSIITS KPPQEDHGLG KATERIFQPL LKLLIPDIVD YDLPAAGVFH
NCAIVAIRKR YPKHAQKVMS AIWGAHLMSM TKLIVIVDED CDVHDYNEVA FRAFGNVDYA
RDLLLTEGPV DHLDHASYQQ FWGGKAGVDA TRKLPGEGYT RGWPEELTMT PEVVSLVDKR
WKEYGI