Gene Strop_4094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4094 
Symbol 
ID5060576 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4653843 
End bp4655303 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content68% 
IMG OID640476355 
ProductUbiD family decarboxylase 
Protein accessionYP_001160902 
Protein GI145596605 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.576442 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.345928 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGCTC GTGGCTTTCC GTACTCTGAT CTGAAGGACT TCCTGGCGGC GCTGGAGCGC 
GCGGGTGAGC TGCGGCGGGT GGACGTCCCG GTGGATCCGA CGCTGGAGTT GGCCGAGGTC
GTCACCCGAA CGGTCCGCGC CGGCGGCCCG GCGCTGGTCT TCGAGCGGCC CACCCGCGGC
GAGATGCCGG TGGCGATCAA CCTGTTTGGC ACCGAGAAGC GGATGGCGAT GGCGCTCGGC
GTCGAGTCGC TGGACGAGAT CGGCGCGCGG ATCGGCGCGT TGATCCGGCC GGAGTTGCCG
GTCGGCTGGT CCGGCATCCG CGAGGGCCTC GGCAAGGTCA TGCAGCTCAA GTCGGTGCCG
CCACGCAAGG TGAAGACCGC GCCCTGCCAG CAGGTGGTGT ACCGGGGCGA CGACGTCGAC
CTGACCCGCC TGCCCGGCCT GCAGGTGTGG CCCGGTGACG GCGGCGTCTT CCACAACTAC
GGGTTGACCC ACACCAAGCA TCCCGAGACC GGCGCGCGCA ACCTCGGCCT CTACCGGCTT
CAGCAGCACA GTCGGAACAC GCTGGGCATG CACTGGCAGA TCCACAAGGA CTCCACCGCC
CATCACGCGG TCGCCGAGCG GCTCGGCCAG CGGCTGCCGG TGGCCATCGC GATCGGCTGC
GACCCGGTGA TCTCGTACGC CGCGAGCGCC CCACTTCCCG GCGACATCGA CGAATACCTG
TTCGCGGGTT TCCTGCGCGG TGAACGGGTC GAGATGGTCG ACTGCCTGAC CGTTCCGCTC
CAGGTGCCGG CGCACGCCCA GGTGGTGCTC GAGGGGTACC TCGAGCCCGG CGAGCGGTTG
CCCGAGGGGC CGTTCGGTGA TCACACCGGC TACTACACGC CGATCGAGCC GTTCCCGGTT
CTGCACGTCG AGACGATGAC CATGCAGCGC AACCCGGTCT ACCACTCGAT CATCACCTCG
AAGCCGCCGC AGGAGGACCA TGGCCTGGGT AAGGCCACCG AGCGGATCTT CCAGCCGCTG
CTGAAGCTAC TCATCCCGGA CATCGTCGAC TACGACCTGC CGGCCGCCGG GGTCTTCCAC
AACTGCGCGA TCGTGTCGAT TCGCAAGCGC TACCCAAAGC ACGCGCAGAA GGTCATGAGT
GCGATCTGGG GCGCGCACCT GATGTCGATG ACCAAGCTGA TCGTGATCGT GGACGAGGAC
TGCGACGTGC ACGACTACAA CGCGGTTGCC TTCCGGGCGT TCGGCAACGT GGACTACGCC
CGGGACCTGC TGCTCACCGA AGGGCCGGTG GACCACCTGG ACCACGCCTC GTACCAGCAG
TTCTGGGGCG GTAAGGCCGG CGTCGACGCC ACCCGCAAGC TCCCGGGGGA GGGCTACACC
CGGGGCTGGC CCGAGGAGTT GACCATGGCG CCCGAGGTGG TGTCGTTGGT CGACAAGCGC
TGGAAGGAGT ACGGCATCTG A
 
Protein sequence
MAARGFPYSD LKDFLAALER AGELRRVDVP VDPTLELAEV VTRTVRAGGP ALVFERPTRG 
EMPVAINLFG TEKRMAMALG VESLDEIGAR IGALIRPELP VGWSGIREGL GKVMQLKSVP
PRKVKTAPCQ QVVYRGDDVD LTRLPGLQVW PGDGGVFHNY GLTHTKHPET GARNLGLYRL
QQHSRNTLGM HWQIHKDSTA HHAVAERLGQ RLPVAIAIGC DPVISYAASA PLPGDIDEYL
FAGFLRGERV EMVDCLTVPL QVPAHAQVVL EGYLEPGERL PEGPFGDHTG YYTPIEPFPV
LHVETMTMQR NPVYHSIITS KPPQEDHGLG KATERIFQPL LKLLIPDIVD YDLPAAGVFH
NCAIVSIRKR YPKHAQKVMS AIWGAHLMSM TKLIVIVDED CDVHDYNAVA FRAFGNVDYA
RDLLLTEGPV DHLDHASYQQ FWGGKAGVDA TRKLPGEGYT RGWPEELTMA PEVVSLVDKR
WKEYGI