Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0998 |
Symbol | |
ID | 3909295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1141411 |
End bp | 1143090 |
Gene Length | 1680 bp |
Protein Length | 559 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637882891 |
Product | thiamine pyrophosphate protein |
Protein accession | YP_484619 |
Protein GI | 86748123 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.79453 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.343483 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCACA TGCTCGAGAG CCGCACCGCC GCCGAGGCGC TGGTCGATCA ACTGGCGATC AACGGCGTCA AGCACGTGTT CTGCGTGCCG GGAGAGAGCT TCCTGCCGGT GCTGGACGCG CTGCGCGATC GCGACATCAC CATCACCGTC TGCCGCCATG AGGGCGGCGC GGCGATGATG GCGGAGGCGA TCGGCAAGGC GACCGGCCAG CCCGGCGTCT GTTTCGTCAC CCGCGGGCCG GGGGCGACCA ATGCGTCGGC GGGCATTCAC GTCGCGCGGC AGGACTCGAC GCCGATGATC CTGTTCGTCG GCCAGGTCGA ACGCGCGGTC AAGGAGCGCG AGGCGTTTCA GGAGCTCGAC TATCGCGCCG TGTTCGGCAG CATGACGAAA TGGACCACCG AGATCGACGA TGCCGATCGC GTCACCGAGC TGGTGTCGCG CGCGTTCTAC ACCGCGACCA GCGGCCGGCC GGGGCCGGTG GTGATCGCGC TGCCGAAGGA CGTGCTGAGC GAACGCGTCA CCGTCGGCCA TGCCCCGGCG TTCAGACCGG TCGAAACCTC GCCCGGCGAC GAGGAGATGG ACGAACTCGC GGCGCTGCTC GCCGGCGCCG AGCGACCGCT GATCGTGCTC GGCGGCAGCC GCTGGAGCCT GAAGGCGCGC GAGCAGATCG AACAGATCGC GACTCGCACC GGGCTGCCGG TCGCGACCAG CTATCGCCGC GGCACGCTGT TCGACGTGAT GCATCCGCGC TATGCGGGTG ATCTCGGCCT CGGACCCAAT CCGAAGCTGG TGGCGCGCGC CAAGGCAGCC GATCTCGTGG TGCTGATCGG CGGACGGCTG GGCGAAATTC CGTCGCAGGG CTACACCCTG CTCGACAGTC CCGCGCCGCA GACCAAGCTG GTGCACATCC ATCCCGGCGC CGAGGAACTC GGCCGGGTGT ATCGTCCGCA ACTCGCGATC CACGCCTCGC CGGCGCGCTT CGTCGCGGCG CTGGCGAAAC GCGACATCGT CGCGCGACCG GAGTGGCGGG ACGCCGCCGA CGCGGCGCAT GCCGACTATC TCGCCTGGAC CGAGACCGCG ACGCCGCAGC CCGGCGACGT CAATCTCGGC GAGGTGATGA TTTGGCTGCG CGACAACGTG CCGGCCGACA CCATCCTGTG CAACGGCGCC GGCAACTACG CGTCGTGGAT TCACCGGTTC TATCGTTTCC GTCACTACAT GACCCACATC GCGCCGACCT CGGCCTCGAT GGGCTACGGC ATGCCGGCCG CGATCGCGAT GCAGCGGCTG CATCCGGAGC GGTTGGTGCT GTCGGTCAAT GGCGACGGCG ATTTCCTGAT GAGCGGTCAG GAATTCGCCA TCGCCGTGCA GTATCGGCTG CCGATCGTCG TGGTGGTCTG CGACAACGGC ATGTACGGCA CCATCCGGAT GCATCAGGAG CGCGAGTTTC CGGGACGCGT CGCCGCCACC GAGCTGCACA ATCCGGATTT CGCCGCCTAT GCGCGCGCCT TCGGCGGCTT CGGCGCCAAT GTCGAGAAGA CGGCCGACTT CCCGGCGGCG TTCACCGCGG CGCGCGCCTC CGGCCTGCCG TCGATCATCC ATCTGAAGAT CGACCCCGAC GCGATCCTCC CCGGCGCGAC GCTGTCCGGC ATCCGCGCGG CGGCGCTGGA AAAGGCGTAG
|
Protein sequence | MHHMLESRTA AEALVDQLAI NGVKHVFCVP GESFLPVLDA LRDRDITITV CRHEGGAAMM AEAIGKATGQ PGVCFVTRGP GATNASAGIH VARQDSTPMI LFVGQVERAV KEREAFQELD YRAVFGSMTK WTTEIDDADR VTELVSRAFY TATSGRPGPV VIALPKDVLS ERVTVGHAPA FRPVETSPGD EEMDELAALL AGAERPLIVL GGSRWSLKAR EQIEQIATRT GLPVATSYRR GTLFDVMHPR YAGDLGLGPN PKLVARAKAA DLVVLIGGRL GEIPSQGYTL LDSPAPQTKL VHIHPGAEEL GRVYRPQLAI HASPARFVAA LAKRDIVARP EWRDAADAAH ADYLAWTETA TPQPGDVNLG EVMIWLRDNV PADTILCNGA GNYASWIHRF YRFRHYMTHI APTSASMGYG MPAAIAMQRL HPERLVLSVN GDGDFLMSGQ EFAIAVQYRL PIVVVVCDNG MYGTIRMHQE REFPGRVAAT ELHNPDFAAY ARAFGGFGAN VEKTADFPAA FTAARASGLP SIIHLKIDPD AILPGATLSG IRAAALEKA
|
| |