Gene Acid345_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1967 
Symbol 
ID4073228 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2360271 
End bp2361833 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content59% 
IMG OID637983980 
Product3-octaprenyl-4hydroxybenzoate decarboxylase 
Protein accessionYP_591042 
Protein GI94968994 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases
[TIGR03701] menaquinone biosynthesis decarboxylase, SCO4490 family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCTACG ACGATATTCG CGAATGGATC GCTGCCCTCG ACCGCGCCGG GGAACTGAAG 
CGCGTGAAAG CCGAAGTCGA TCCCAAGCTG GAGATTTGCG AGATCACCGA CCGCGTTTCC
AAGTGGCCGG CACGCAATGG CAAAGGTCAA GGCGGCCCGG CCCTCTTATT CGAGAAGGTC
AAAGGCTATC CCGGAGCGAA GGTCTTGATG AACCAGTTCG GCTCCGAACG CCGGATGAAG
ATGGCCCTCG ATGTCGATTC GCTCGATGAA ATTTCCGGTC GCATCCGCGA GTTCATGGAT
GTGAAGTCGC CACAGGGCCT GCTCGACAAA ATCAAAATGC TGCCCAAGCT CGCTGAGATG
GGCAAGTTCT TTCCGAAGTC CGTGAGCACG GGCCCGTGCA AAGAAGTCAT CAAGAAGGGC
GACTTCTCGC TGCTCGACTT TCCCATTCTC ACCACCTGGC CGAAAGACGG CGGTCCGTTC
ATCACGCTCC CGTGCGTCAT CACGCGCGAT CCGAGAACGG GGAAGCGCAA CGTAGGCGCC
TACCGCATGC AGGTCTATGA CGCGACTTCC ACGGGTATGC ATTGGCAGCG CCACAAAGGG
GGCGCTGAGC ACTATCGCGA GCGTATGCGG GCCGCGCACG TTGGCGGCGA TCCGGCAGGG
AAGTCAGACG CAATTGATTT GATGGCGCGC AGCGGTGGAG GCTCACGCAC AACCGAAGGT
CCGAAAGGGA AGATGGAAGT TGCAGTTGCG CTCGGCACCG ATCCCGCACT GATGTTTTCG
GCGATCGTTC CAGCGCCGCC GGAGGTCGAA GAATTCATGA TCGCGGGATT CCTGCGCCAG
AAGCCGGTGG AACTGGTGAA GTGCGAGACT GTGGACTTGG ATGTTCCCGC GAACGCCGAG
ATTATTCTCG AAGGACATGT GAACCTCGAC GAACTAAAAG TCGAAGGACC GTTCGGCGAT
CATACGGGCT TCTACTCGCT CGAAGACCTC TACCCGGTTT TCCACGTTAC CTGCATCACA
CACCGCAAGG ATCCGGTGTA CTCCGCGACG ATCGTGGGCA AGCCTCCGGT GGAAGATGCG
TACATGGGCA AGGCCGTCGA ACGCATTTTC CTGCCGCTCA TGCGGGTCAC CATCCCGGAA
ATTGTGGATC TGAACATGCC GATCGAAGGG GTCTTCCACA ACCTCATGAT CGTCTCCATC
CGGAAGAGTT ACGCCGGCCA TGCGCGCAAG GTGATGAGCG CTATCTGGGG ACTTGGCGGC
GCCATGTTCA CCAAGTGCAT CATCGTCGTG GACGAAGACG TGAATGTGCA GAACCCTGCG
GAGGTCGTCT GGAAGGTCTG TAACAACATC GACCCGGAGC GCGATATTCA GTTCACGCTG
GGGCCGGTGG ACTCGTTGGA TCATGCGTCC CGGCTGCCGG ATTTCGGGTC GAAGATGGGG
ATCGACGCGA CCCGGAAGTG GCCGAGCGAA GGCTTCGAGC GTCCGTGGCC GGATGAGGTG
ACGATGGACG ACGCGACGCG GGAATTGGTC AGTAAGAGGT GGAAGGAGTA TGGGATCGAG
TAG
 
Protein sequence
MAYDDIREWI AALDRAGELK RVKAEVDPKL EICEITDRVS KWPARNGKGQ GGPALLFEKV 
KGYPGAKVLM NQFGSERRMK MALDVDSLDE ISGRIREFMD VKSPQGLLDK IKMLPKLAEM
GKFFPKSVST GPCKEVIKKG DFSLLDFPIL TTWPKDGGPF ITLPCVITRD PRTGKRNVGA
YRMQVYDATS TGMHWQRHKG GAEHYRERMR AAHVGGDPAG KSDAIDLMAR SGGGSRTTEG
PKGKMEVAVA LGTDPALMFS AIVPAPPEVE EFMIAGFLRQ KPVELVKCET VDLDVPANAE
IILEGHVNLD ELKVEGPFGD HTGFYSLEDL YPVFHVTCIT HRKDPVYSAT IVGKPPVEDA
YMGKAVERIF LPLMRVTIPE IVDLNMPIEG VFHNLMIVSI RKSYAGHARK VMSAIWGLGG
AMFTKCIIVV DEDVNVQNPA EVVWKVCNNI DPERDIQFTL GPVDSLDHAS RLPDFGSKMG
IDATRKWPSE GFERPWPDEV TMDDATRELV SKRWKEYGIE