Gene GSU2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2004 
Symbol 
ID2688094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp2196283 
End bp2198121 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content64% 
IMG OID637126695 
Product3-octaprenyl-4-hydroxybenzoate carboxy-lyase family protein 
Protein accessionNP_953053 
Protein GI39997102 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.266757 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCTATC GCAATCTGCA GGAATGTGTG AAGGAGCTGG AACGGACCGG CCAGTTGATC 
CGGATCGACG TGGAAACCGA TCCGTACCTC GAAATCGGCG CCATCCAGCG CCGCGTCTAC
CATGCGGAGG GGCCGGCCCT TCTGTTCACC CGGGCGAAGG GGTGCCGCTT TCCCCTGTTG
GGGAACCTGT TCGGCACCAT GGACCGCACT ACCTTCATCT TCCGAGACAC CCTGCGCGTT
ATTGAACGTC TCGTGGCTCT CAAGATCAAT CCCACGGCGT TTTTGAGGGA CCCCCTTGGC
CATCTCGGCA TTCCCCGGGC GGCACTCCAC CTTCTGCCGC GCCACGTGTC GGACGGGCCG
ATTCTCGCCA ATCACACAAC CCTTGACCAA CTCCCCTCCC TGGTTTCCTG GCCCCGGGAC
GGTGGTCCCT TTGTGACGCT GCCCCAGGTT TATTCCGAAA GCCCGTCCCG TCCCGGCTTC
CGCTTCTCCA ACCTGGGAAT GTACCGCGTT CAACTTTCGG GGAACGAGTA TGCGCCGAAC
CAGGAGGTCG GAATTCATTA CCAGATCCAT CGCGGCATCG GTGTCCACCA TGCTGAGGCG
ATCGCCCGGG GTGAGCCCCT GAAGGTGAGC GTCTTCGTGG GCGGGGCACC TTCAATGGCC
GTGGCGGCAG TCATGCCGCT CCCCGAGGGG CTGCCGGAAC TTTCCTTTGC CGGACTCCTG
GCCGGCCGCC GCATCGACAT GATCTGCCGC CCCGACCGTC TTCCTCTTCC GGCCGAGGCG
GATTTCGTCA TCACCGGCAC CATCGACCCG AACCGGACGC TCCCCGAGGG CCCCTTCGGC
GATCACCTGG GGTATTACAG CCTCGCCCAT CCGTTCCCGG TTCTCACGGT CGAGAACGTG
TACCATCGTG CCGACGCCAT CTGGCCCTAT ACCACCGTCG GCCGCCCTCC CCAGGAGGAC
ACCACCTTCG GCGCCTTTAT CCATGAGCTG ACCGGCGCCC TGATCCCCGA AGTGCTTCCC
GGCGTGAAGG CGGTTCATGC CGTGGACGCG GCCGGGGTCC ATCCCCTGCT GCTGGCCGTG
GGCTCAGAAC GCTACGTTCC CTATGAAGAG GAACGCACAC CCCGGGAACT TCTCACCATT
GCCAGCGCCA TCCTGGGCAA TGGTCAACTC TCTCTGGCAA AGTACCTCTT CATTGCTCCC
CATGAGGATG AGCCCCCCGA CATCCACGAC ATTGACGGTT TTATCCGCTT TACCCTGGAG
CGGGCCGACT GGCGCCGTGA CCTTCACTTC CATACCCGCA CCACCATCGA CACACTCGAT
TATTCGGGTA CAGGCCTGAA CGAGGGGTCA AAGGTCATCG TGGCGGCGGC AGGTTCTCCC
AGGCGCGAGC TGCCCGGCGA GCTGCCCGTC GGGCTGCGCC TTCCCGACGG GTTTAGCGCG
CCCCGGGTCT GCTTCCCCGG CGTTCTGGCA GTGCAAGGCC CCGCCTTCCC CGGCTACCGC
GACTGCGTTA CCCCGGACAT GGAGCGCTTC TGCGCCGGCA TTTCCGTCGA CGATCCTCTG
AACCGCTTTC CCCTGGTGGT CATCGTGGAC GACAGCGATT TTGCCGCCCG GACCCTCAAT
AATTTCCTCT GGGTTACCTT TACCCGCTCC AATCCTGCAG CGGATATCCA CGGCATCGGC
GCCTCGGTCC GCTGCAAACA CTGGGGGTGC GACGGCGCAT TGGTCATCGA TGCCCGCATC
AAGCCGCACC ATGCGCCGCC CCTGGAGGAT GTCCCCGAGA TTGAACGACG GGTCGATGAG
CTTGGTGCTC CCGGAGGTCC GTTGCACGGC ATCATCTGA
 
Protein sequence
MGYRNLQECV KELERTGQLI RIDVETDPYL EIGAIQRRVY HAEGPALLFT RAKGCRFPLL 
GNLFGTMDRT TFIFRDTLRV IERLVALKIN PTAFLRDPLG HLGIPRAALH LLPRHVSDGP
ILANHTTLDQ LPSLVSWPRD GGPFVTLPQV YSESPSRPGF RFSNLGMYRV QLSGNEYAPN
QEVGIHYQIH RGIGVHHAEA IARGEPLKVS VFVGGAPSMA VAAVMPLPEG LPELSFAGLL
AGRRIDMICR PDRLPLPAEA DFVITGTIDP NRTLPEGPFG DHLGYYSLAH PFPVLTVENV
YHRADAIWPY TTVGRPPQED TTFGAFIHEL TGALIPEVLP GVKAVHAVDA AGVHPLLLAV
GSERYVPYEE ERTPRELLTI ASAILGNGQL SLAKYLFIAP HEDEPPDIHD IDGFIRFTLE
RADWRRDLHF HTRTTIDTLD YSGTGLNEGS KVIVAAAGSP RRELPGELPV GLRLPDGFSA
PRVCFPGVLA VQGPAFPGYR DCVTPDMERF CAGISVDDPL NRFPLVVIVD DSDFAARTLN
NFLWVTFTRS NPAADIHGIG ASVRCKHWGC DGALVIDARI KPHHAPPLED VPEIERRVDE
LGAPGGPLHG II