Gene RPB_0998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0998 
Symbol 
ID3909295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1141411 
End bp1143090 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content69% 
IMG OID637882891 
Productthiamine pyrophosphate protein 
Protein accessionYP_484619 
Protein GI86748123 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.79453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.343483 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCACA TGCTCGAGAG CCGCACCGCC GCCGAGGCGC TGGTCGATCA ACTGGCGATC 
AACGGCGTCA AGCACGTGTT CTGCGTGCCG GGAGAGAGCT TCCTGCCGGT GCTGGACGCG
CTGCGCGATC GCGACATCAC CATCACCGTC TGCCGCCATG AGGGCGGCGC GGCGATGATG
GCGGAGGCGA TCGGCAAGGC GACCGGCCAG CCCGGCGTCT GTTTCGTCAC CCGCGGGCCG
GGGGCGACCA ATGCGTCGGC GGGCATTCAC GTCGCGCGGC AGGACTCGAC GCCGATGATC
CTGTTCGTCG GCCAGGTCGA ACGCGCGGTC AAGGAGCGCG AGGCGTTTCA GGAGCTCGAC
TATCGCGCCG TGTTCGGCAG CATGACGAAA TGGACCACCG AGATCGACGA TGCCGATCGC
GTCACCGAGC TGGTGTCGCG CGCGTTCTAC ACCGCGACCA GCGGCCGGCC GGGGCCGGTG
GTGATCGCGC TGCCGAAGGA CGTGCTGAGC GAACGCGTCA CCGTCGGCCA TGCCCCGGCG
TTCAGACCGG TCGAAACCTC GCCCGGCGAC GAGGAGATGG ACGAACTCGC GGCGCTGCTC
GCCGGCGCCG AGCGACCGCT GATCGTGCTC GGCGGCAGCC GCTGGAGCCT GAAGGCGCGC
GAGCAGATCG AACAGATCGC GACTCGCACC GGGCTGCCGG TCGCGACCAG CTATCGCCGC
GGCACGCTGT TCGACGTGAT GCATCCGCGC TATGCGGGTG ATCTCGGCCT CGGACCCAAT
CCGAAGCTGG TGGCGCGCGC CAAGGCAGCC GATCTCGTGG TGCTGATCGG CGGACGGCTG
GGCGAAATTC CGTCGCAGGG CTACACCCTG CTCGACAGTC CCGCGCCGCA GACCAAGCTG
GTGCACATCC ATCCCGGCGC CGAGGAACTC GGCCGGGTGT ATCGTCCGCA ACTCGCGATC
CACGCCTCGC CGGCGCGCTT CGTCGCGGCG CTGGCGAAAC GCGACATCGT CGCGCGACCG
GAGTGGCGGG ACGCCGCCGA CGCGGCGCAT GCCGACTATC TCGCCTGGAC CGAGACCGCG
ACGCCGCAGC CCGGCGACGT CAATCTCGGC GAGGTGATGA TTTGGCTGCG CGACAACGTG
CCGGCCGACA CCATCCTGTG CAACGGCGCC GGCAACTACG CGTCGTGGAT TCACCGGTTC
TATCGTTTCC GTCACTACAT GACCCACATC GCGCCGACCT CGGCCTCGAT GGGCTACGGC
ATGCCGGCCG CGATCGCGAT GCAGCGGCTG CATCCGGAGC GGTTGGTGCT GTCGGTCAAT
GGCGACGGCG ATTTCCTGAT GAGCGGTCAG GAATTCGCCA TCGCCGTGCA GTATCGGCTG
CCGATCGTCG TGGTGGTCTG CGACAACGGC ATGTACGGCA CCATCCGGAT GCATCAGGAG
CGCGAGTTTC CGGGACGCGT CGCCGCCACC GAGCTGCACA ATCCGGATTT CGCCGCCTAT
GCGCGCGCCT TCGGCGGCTT CGGCGCCAAT GTCGAGAAGA CGGCCGACTT CCCGGCGGCG
TTCACCGCGG CGCGCGCCTC CGGCCTGCCG TCGATCATCC ATCTGAAGAT CGACCCCGAC
GCGATCCTCC CCGGCGCGAC GCTGTCCGGC ATCCGCGCGG CGGCGCTGGA AAAGGCGTAG
 
Protein sequence
MHHMLESRTA AEALVDQLAI NGVKHVFCVP GESFLPVLDA LRDRDITITV CRHEGGAAMM 
AEAIGKATGQ PGVCFVTRGP GATNASAGIH VARQDSTPMI LFVGQVERAV KEREAFQELD
YRAVFGSMTK WTTEIDDADR VTELVSRAFY TATSGRPGPV VIALPKDVLS ERVTVGHAPA
FRPVETSPGD EEMDELAALL AGAERPLIVL GGSRWSLKAR EQIEQIATRT GLPVATSYRR
GTLFDVMHPR YAGDLGLGPN PKLVARAKAA DLVVLIGGRL GEIPSQGYTL LDSPAPQTKL
VHIHPGAEEL GRVYRPQLAI HASPARFVAA LAKRDIVARP EWRDAADAAH ADYLAWTETA
TPQPGDVNLG EVMIWLRDNV PADTILCNGA GNYASWIHRF YRFRHYMTHI APTSASMGYG
MPAAIAMQRL HPERLVLSVN GDGDFLMSGQ EFAIAVQYRL PIVVVVCDNG MYGTIRMHQE
REFPGRVAAT ELHNPDFAAY ARAFGGFGAN VEKTADFPAA FTAARASGLP SIIHLKIDPD
AILPGATLSG IRAAALEKA