Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GSU2004 |
Symbol | |
ID | 2688094 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sulfurreducens PCA |
Kingdom | Bacteria |
Replicon accession | NC_002939 |
Strand | - |
Start bp | 2196283 |
End bp | 2198121 |
Gene Length | 1839 bp |
Protein Length | 612 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637126695 |
Product | 3-octaprenyl-4-hydroxybenzoate carboxy-lyase family protein |
Protein accession | NP_953053 |
Protein GI | 39997102 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.266757 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCTATC GCAATCTGCA GGAATGTGTG AAGGAGCTGG AACGGACCGG CCAGTTGATC CGGATCGACG TGGAAACCGA TCCGTACCTC GAAATCGGCG CCATCCAGCG CCGCGTCTAC CATGCGGAGG GGCCGGCCCT TCTGTTCACC CGGGCGAAGG GGTGCCGCTT TCCCCTGTTG GGGAACCTGT TCGGCACCAT GGACCGCACT ACCTTCATCT TCCGAGACAC CCTGCGCGTT ATTGAACGTC TCGTGGCTCT CAAGATCAAT CCCACGGCGT TTTTGAGGGA CCCCCTTGGC CATCTCGGCA TTCCCCGGGC GGCACTCCAC CTTCTGCCGC GCCACGTGTC GGACGGGCCG ATTCTCGCCA ATCACACAAC CCTTGACCAA CTCCCCTCCC TGGTTTCCTG GCCCCGGGAC GGTGGTCCCT TTGTGACGCT GCCCCAGGTT TATTCCGAAA GCCCGTCCCG TCCCGGCTTC CGCTTCTCCA ACCTGGGAAT GTACCGCGTT CAACTTTCGG GGAACGAGTA TGCGCCGAAC CAGGAGGTCG GAATTCATTA CCAGATCCAT CGCGGCATCG GTGTCCACCA TGCTGAGGCG ATCGCCCGGG GTGAGCCCCT GAAGGTGAGC GTCTTCGTGG GCGGGGCACC TTCAATGGCC GTGGCGGCAG TCATGCCGCT CCCCGAGGGG CTGCCGGAAC TTTCCTTTGC CGGACTCCTG GCCGGCCGCC GCATCGACAT GATCTGCCGC CCCGACCGTC TTCCTCTTCC GGCCGAGGCG GATTTCGTCA TCACCGGCAC CATCGACCCG AACCGGACGC TCCCCGAGGG CCCCTTCGGC GATCACCTGG GGTATTACAG CCTCGCCCAT CCGTTCCCGG TTCTCACGGT CGAGAACGTG TACCATCGTG CCGACGCCAT CTGGCCCTAT ACCACCGTCG GCCGCCCTCC CCAGGAGGAC ACCACCTTCG GCGCCTTTAT CCATGAGCTG ACCGGCGCCC TGATCCCCGA AGTGCTTCCC GGCGTGAAGG CGGTTCATGC CGTGGACGCG GCCGGGGTCC ATCCCCTGCT GCTGGCCGTG GGCTCAGAAC GCTACGTTCC CTATGAAGAG GAACGCACAC CCCGGGAACT TCTCACCATT GCCAGCGCCA TCCTGGGCAA TGGTCAACTC TCTCTGGCAA AGTACCTCTT CATTGCTCCC CATGAGGATG AGCCCCCCGA CATCCACGAC ATTGACGGTT TTATCCGCTT TACCCTGGAG CGGGCCGACT GGCGCCGTGA CCTTCACTTC CATACCCGCA CCACCATCGA CACACTCGAT TATTCGGGTA CAGGCCTGAA CGAGGGGTCA AAGGTCATCG TGGCGGCGGC AGGTTCTCCC AGGCGCGAGC TGCCCGGCGA GCTGCCCGTC GGGCTGCGCC TTCCCGACGG GTTTAGCGCG CCCCGGGTCT GCTTCCCCGG CGTTCTGGCA GTGCAAGGCC CCGCCTTCCC CGGCTACCGC GACTGCGTTA CCCCGGACAT GGAGCGCTTC TGCGCCGGCA TTTCCGTCGA CGATCCTCTG AACCGCTTTC CCCTGGTGGT CATCGTGGAC GACAGCGATT TTGCCGCCCG GACCCTCAAT AATTTCCTCT GGGTTACCTT TACCCGCTCC AATCCTGCAG CGGATATCCA CGGCATCGGC GCCTCGGTCC GCTGCAAACA CTGGGGGTGC GACGGCGCAT TGGTCATCGA TGCCCGCATC AAGCCGCACC ATGCGCCGCC CCTGGAGGAT GTCCCCGAGA TTGAACGACG GGTCGATGAG CTTGGTGCTC CCGGAGGTCC GTTGCACGGC ATCATCTGA
|
Protein sequence | MGYRNLQECV KELERTGQLI RIDVETDPYL EIGAIQRRVY HAEGPALLFT RAKGCRFPLL GNLFGTMDRT TFIFRDTLRV IERLVALKIN PTAFLRDPLG HLGIPRAALH LLPRHVSDGP ILANHTTLDQ LPSLVSWPRD GGPFVTLPQV YSESPSRPGF RFSNLGMYRV QLSGNEYAPN QEVGIHYQIH RGIGVHHAEA IARGEPLKVS VFVGGAPSMA VAAVMPLPEG LPELSFAGLL AGRRIDMICR PDRLPLPAEA DFVITGTIDP NRTLPEGPFG DHLGYYSLAH PFPVLTVENV YHRADAIWPY TTVGRPPQED TTFGAFIHEL TGALIPEVLP GVKAVHAVDA AGVHPLLLAV GSERYVPYEE ERTPRELLTI ASAILGNGQL SLAKYLFIAP HEDEPPDIHD IDGFIRFTLE RADWRRDLHF HTRTTIDTLD YSGTGLNEGS KVIVAAAGSP RRELPGELPV GLRLPDGFSA PRVCFPGVLA VQGPAFPGYR DCVTPDMERF CAGISVDDPL NRFPLVVIVD DSDFAARTLN NFLWVTFTRS NPAADIHGIG ASVRCKHWGC DGALVIDARI KPHHAPPLED VPEIERRVDE LGAPGGPLHG II
|
| |