Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_1799 |
Symbol | |
ID | 6409456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 1931883 |
End bp | 1933511 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 642711686 |
Product | benzoylformate decarboxylase |
Protein accession | YP_001990801 |
Protein GI | 192290196 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.42222 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCCCGCGA AGAAGTCCAA ACAGCCGAGC GCTGCTGTCA GCACCGTCAA ATCCGCCACC CTCGATCTGC TCCGCGCCTT CAAGATCGAC AAGGTGTTCG GCAATCCCGG CTCGACCGAG CTGCCGTTCC TCAGTGACTG GCCGGACGAT ATCGACTACG TGCTGGCGCT GCAGGAAGCT TCGGCGGTGG CGATGGCCGA CGGCTACGCG CAGGCGACGC GCAACGCCGG CTTCGTCAAT CTGCATTCGG CCGCCGGCGT CGGCAATGCG CTCGGCAACA TTTACAGCGC GTTCAAGAAC CAGACGCCGC TGGTAATCAC CGCCGGCCAG CAAGCGCGCA GCCTGCTGCC GCTGCAGGCA TTCCTGGGCG CCGAGCGCGC CTCCGAGTTT CCGCGGCCTT ATGTGAAGTA CAGCGTCGAG CCGGCCCGCG CCGAGGACGT GCCCACCGCG ATTGCCCGTG CCTACTACGT CGCGATGCAG CCGCCGTGCG GGCCGACCTT CGTGTCGGTA CCGATTGACG ATTGGGCGCG GCCGGCGGCG CCGGTGCCAC CGCGCACGAT CACCCGCGAG ATCGGCCCGG ATCGATCAGC AATGCAGGTG CTGGCCGATA CGCTCGCCAA CGCCAAGAAG CCGGCACTGG TGGTCGGGCC AGCGATCGAC CGCGCCGCGG CGGTCGATCT GATGGTGCAG CTCGCCGAGC GCACGCGAGC GCCAGTGTGG GTGTCGCCGT TCTCCGCACG CTGCAGCTTC CCGGAACAGC ATCTGCTGTT CGCCGGCTTC CTGCCCGCCT CGCCGGGACA ACTCTCCGAA ACGCTCGGCG CCTACGACGT GATCGTGGTG ATCGGCGCGC CGGTGTTCAC CTTTCATGTC GAAGGCCACG CCGCGATCTT CGATGGAAGC TCGAAGCTGT TCCAGATCAC CGATGATGCC GAAGCCGCCT CGGTGACGCC GCTCGGCGCC AGCATCATCG CGACGATGAC CCCGGCCCTG ACGCTGCTGC TGGAGTTGCT GCCGGAGACC AAGCGCGCCG CACCGCCGGC CCGCGCGGTG CCGCCTGCAC CTCAGCCGGC CGAGCCGATG CCGGTGGAGT ATCTGCTGCA CACCCTGCGC GCCGCGATGC CCCAGAGCGC GATGCTGGTC GAGGAAGCGC CGTCGCACCG CCCGGCGATG CAGACATACA TGCCGATGCC GGGCCAGGAC AGTTTCGCCA CGATGGCGAG CGGCGGCTTG GGCTGGTCGC TGCCGGCGTC GGTCGGTTTT GCGCTGGCGC ATCCGAACCG CCGCACCGTC TGCCTGATCG GCGACGGCTC GGCGATGTAC TCGATCCAGG CGCTGTGGAC GGCGGCGCAG CGCAAGCTGC CGCTGACCGT GGTGGTGCTG AACAACGGCG GCTACGGCGC GATGCGCTCG TTCAGCCAGG TGATGCAGGT GCGGAACGTG CCCGGGCTGG AGCTGCCCGG GATCGACTTC ACCGCTCTGG CGCAATCGCT CGGCTGCGAT GCTGTGCGGG TGACGCGCAG CGAGGAACTG GCGCCGGCGC TGACGCGCGC CCTTGCATGG GACGGCGTCA GCCTGGTCGA AGCGATGCTC GATACGTCGG TGCCGATGCT CTACGCGCGC AACGGCTGA
|
Protein sequence | MPAKKSKQPS AAVSTVKSAT LDLLRAFKID KVFGNPGSTE LPFLSDWPDD IDYVLALQEA SAVAMADGYA QATRNAGFVN LHSAAGVGNA LGNIYSAFKN QTPLVITAGQ QARSLLPLQA FLGAERASEF PRPYVKYSVE PARAEDVPTA IARAYYVAMQ PPCGPTFVSV PIDDWARPAA PVPPRTITRE IGPDRSAMQV LADTLANAKK PALVVGPAID RAAAVDLMVQ LAERTRAPVW VSPFSARCSF PEQHLLFAGF LPASPGQLSE TLGAYDVIVV IGAPVFTFHV EGHAAIFDGS SKLFQITDDA EAASVTPLGA SIIATMTPAL TLLLELLPET KRAAPPARAV PPAPQPAEPM PVEYLLHTLR AAMPQSAMLV EEAPSHRPAM QTYMPMPGQD SFATMASGGL GWSLPASVGF ALAHPNRRTV CLIGDGSAMY SIQALWTAAQ RKLPLTVVVL NNGGYGAMRS FSQVMQVRNV PGLELPGIDF TALAQSLGCD AVRVTRSEEL APALTRALAW DGVSLVEAML DTSVPMLYAR NG
|
| |