Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3938 |
Symbol | |
ID | 3911745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 4493600 |
End bp | 4495222 |
Gene Length | 1623 bp |
Protein Length | 540 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637885842 |
Product | benzoylformate decarboxylase |
Protein accession | YP_487542 |
Protein GI | 86751046 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.23105 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCGAAGA AGCCGAAGCC GAGCGCTGCC GTCAGCACCG TCAAATCCGC CACGCTCGAT CTGCTCCGCG CGTTCAAGAT CGACAAGGTG TTCGGCAATC CCGGCTCCAC CGAGCTGCCG TTCCTCAGCG ACTGGCCGGA CGACATCGAC TACGTACTGG CGCTGCAGGA GGCGAGCGCG ATGGCGATGG CCGACGGCTA CGCGCAGGCG ACGCGCAACG CCGGCTTCGT CAATCTGCAT TCAGCCGCCG GCGTCGGCAA CGCGCTCGGC AACATCTACA CCGCGTTCAA GAACCAGACG CCGCTGCTGA TCACCGCCGG CCAGCAGGCG CGCAGCCTGC TGCCGTTACA GGCGTTTCTC GGCGCCGAAC GCGCATCCGA ATTTCCGCGG CCTTACGTGA AATACAGCAT CGAGCCGGCG CGCGCCGAGG ACGTGCCGGC GGCGATCGCC CGCGCCTATT ATGTGGCGAT GCAGCCGCCG TGCGGGCCGA CCTTCGTGTC GGTGCCGATC GACGACTGGG CGCGGCCCGC GCAGCCGGTT CCGCTGCGCA ACGTGACGCG CGAACTCGGC CCGGAGCGCG CGGCGATGCA GGCGCTGGCG GAGGCGCTGG CGGAGGCGAA GAAGCCCGCC CTGGTGGTCG GCCCCGCGAT CGATCGCGCC GCCGCGGTCG ATTTGATGGT GCGCCTCGCC GAGCGCGCCA ACGCGCCGGT GTTCGTCAGT CCGTTCTCGG CGCGCTGCAG TTTCCCGGAG CGGCATCCGC TGTTCGCCGG CTTCCTGCCC GCCTCGCCGG GGCAACTCTC CGAAGCCATC GGCGCCTACG ACGTCGTGGT GGTGATCGGC GCACCGGTGT TCACCTTCCA TGTCGAAGGC CGCGCGTCGA TCTTCGACGG CGCAACGTCG CTGTTCCAGA TCACCGACGA CGCCGAGGCC GCGTCGGTGA CGCCGCTCGG CACCAGCATC ATCGCCACCA TGAAGCCGGC ATTATCGCTG CTGCTGGAGT TGTTACCAGA GACCCAATGC GCGGCGCCGC CGGCACGGGC GCTGCCGCCA GCGCCTGCCG CGGCCGATCC GATGCCGGCC GAATTTCTGC TCGATGCGTT GAGCAAGGCG ATGCCGGCCG GCACGATGCT GGTCGAGGAA GCGCCGTCGC ATCGGCTGGC GATGCAGAAA TTCATGCCGA TGCGCGGCCA GGACAGTTTC GCCACGATGG CGAGCGGCGG CCTCGGCTGG TCGCTGCCGG CCGCGGTCGG CTTCGCGCTG GCGCATCCGG AGCGCCGCAC CGTGTGCCTG ATCGGCGACG GCTCGGCGAT GTATTCGATC CAGGCGCTGT GGACTGCGGC AGAGCGCAAG CTGCCGCTGA CCGTGGTGGT GCTGAACAAT GGCGGCTACG GCGCGATGCG CTCGTTCAGC CAGGTGATGC AGGTCCGCGA CGTGCCCGGG CTGGAGCTGC CCGGGATCGA CTACGTCCAG CTCGCGCAGT CGATGGGCTG TGTCGCCGAA CGCGTGTCAC GCTGTGAGGA CCTCGCGCCG GTGCTCGCCC GCGCGCTGGC GCATGACGGC GTGTTCGTGG TCGAGGCGAC GCTGGATAGC GCGGTGCCGC TGCTGTACGC GAAGAACGGG TAG
|
Protein sequence | MPKKPKPSAA VSTVKSATLD LLRAFKIDKV FGNPGSTELP FLSDWPDDID YVLALQEASA MAMADGYAQA TRNAGFVNLH SAAGVGNALG NIYTAFKNQT PLLITAGQQA RSLLPLQAFL GAERASEFPR PYVKYSIEPA RAEDVPAAIA RAYYVAMQPP CGPTFVSVPI DDWARPAQPV PLRNVTRELG PERAAMQALA EALAEAKKPA LVVGPAIDRA AAVDLMVRLA ERANAPVFVS PFSARCSFPE RHPLFAGFLP ASPGQLSEAI GAYDVVVVIG APVFTFHVEG RASIFDGATS LFQITDDAEA ASVTPLGTSI IATMKPALSL LLELLPETQC AAPPARALPP APAAADPMPA EFLLDALSKA MPAGTMLVEE APSHRLAMQK FMPMRGQDSF ATMASGGLGW SLPAAVGFAL AHPERRTVCL IGDGSAMYSI QALWTAAERK LPLTVVVLNN GGYGAMRSFS QVMQVRDVPG LELPGIDYVQ LAQSMGCVAE RVSRCEDLAP VLARALAHDG VFVVEATLDS AVPLLYAKNG
|
| |