Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4514 |
Symbol | |
ID | 5707035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 5101704 |
End bp | 5103164 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641273928 |
Product | UbiD family decarboxylase |
Protein accession | YP_001539277 |
Protein GI | 159040024 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00843241 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGGCTC GTGGCTTTCC GTACTCCGAT CTGAAGGACT TCCTGGCGGC GCTGGAGCGC GCGGGTGAGC TGCGGCGGGT GGACGTCCCG GTGGATCCGA CGCTGGAGTT GGCCGAGGTC GTCACCCGAA CGGTCCGCGC CGGCGGCCCG GCACTGGTCT TCGAGCGGCC CACCCGCGGC GAGATGCCGG TGGCGATCAA CCTGTTCGGC ACGGAGAAGC GGATGGCGAT GGCGCTCGGC GTCGAGTCGC TGGACGAGAT CGGCGCGCGG ATCGGTGCGT TGATCCGGCC GGAGTTGCCG GTCGGCTGGT CCGGCATCCG CGAGGGCCTC GGCAAGGTCA TGCAGCTCAA GTCGGTGCCG CCACGCAAGG TGAAGACCGC GCCCTGCCAG CAGGTGGTGT ACCGGGGCGA CGACGTCGAC CTGACCCGGC TGCCCGGCCT GCAGGTGTGG CCCGGTGACG GCGGCGTCTT CCACAACTAC GGGTTGACCC ACACCAAGCA TCCCGAGACC GGCGCACGCA ACCTCGGCCT CTACCGGCTT CAGCAGCACA GTCGGAACAC GCTGGGCATG CACTGGCAGA TCCACAAGGA CTCCACCGCC CATCACGCGG TCGCCGAGCG GCTCGGCCAG CGGCTGCCGG TGGCCATCGC GATCGGCTGC GACCCGGTGA TCTCGTACGC CGCGAGCGCC CCGCTTCCCG GTGACATCGA CGAATACCTG TTCGCGGGCT TCCTGCGCGG TGAACGGGTC GAGATGGTCG ACTGCCTGAC CGTTCCGCTC CAGGTGCCGG CGCATGCCCA GGTGGTGCTC GAGGGGTACC TCGAGCCCGG CGAGCGGCTG CCCGAGGGGC CGTTCGGTGA TCACACCGGC TACTACACGC CGATCGAGCC GTTCCCGGTC CTGCACGTCG AGACGATGAC CATGCAGCGC AATCCGGTCT ACCACTCGAT CATCACCTCG AAGCCGCCGC AGGAGGACCA TGGCCTGGGC AAGGCCACCG AGCGGATTTT CCAGCCGCTG CTGAAGCTGC TCATCCCGGA CATCGTCGAC TACGACCTGC CGGCCGCCGG GGTCTTCCAC AACTGCGCGA TCGTGGCGAT TCGCAAGCGC TACCCGAAGC ACGCGCAGAA GGTCATGAGT GCGATCTGGG GCGCGCACCT GATGTCGATG ACCAAGCTGA TCGTGATCGT GGACGAGGAC TGCGACGTGC ACGACTACAA CGAGGTTGCC TTCCGGGCGT TCGGCAACGT CGACTACGCC CGGGACCTGC TGCTCACCGA AGGGCCGGTG GACCATCTGG ACCACGCCTC GTACCAGCAG TTCTGGGGCG GTAAGGCCGG CGTCGACGCC ACCCGCAAGC TCCCGGGGGA GGGCTACACC CGGGGCTGGC CCGAGGAGTT GACCATGACG CCCGAGGTGG TGTCGTTGGT CGACAAGCGC TGGAAGGAGT ACGGCATCTG A
|
Protein sequence | MAARGFPYSD LKDFLAALER AGELRRVDVP VDPTLELAEV VTRTVRAGGP ALVFERPTRG EMPVAINLFG TEKRMAMALG VESLDEIGAR IGALIRPELP VGWSGIREGL GKVMQLKSVP PRKVKTAPCQ QVVYRGDDVD LTRLPGLQVW PGDGGVFHNY GLTHTKHPET GARNLGLYRL QQHSRNTLGM HWQIHKDSTA HHAVAERLGQ RLPVAIAIGC DPVISYAASA PLPGDIDEYL FAGFLRGERV EMVDCLTVPL QVPAHAQVVL EGYLEPGERL PEGPFGDHTG YYTPIEPFPV LHVETMTMQR NPVYHSIITS KPPQEDHGLG KATERIFQPL LKLLIPDIVD YDLPAAGVFH NCAIVAIRKR YPKHAQKVMS AIWGAHLMSM TKLIVIVDED CDVHDYNEVA FRAFGNVDYA RDLLLTEGPV DHLDHASYQQ FWGGKAGVDA TRKLPGEGYT RGWPEELTMT PEVVSLVDKR WKEYGI
|
| |