Gene Rpal_1799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_1799 
Symbol 
ID6409456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp1931883 
End bp1933511 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content69% 
IMG OID642711686 
Productbenzoylformate decarboxylase 
Protein accessionYP_001990801 
Protein GI192290196 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.42222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCCGCGA AGAAGTCCAA ACAGCCGAGC GCTGCTGTCA GCACCGTCAA ATCCGCCACC 
CTCGATCTGC TCCGCGCCTT CAAGATCGAC AAGGTGTTCG GCAATCCCGG CTCGACCGAG
CTGCCGTTCC TCAGTGACTG GCCGGACGAT ATCGACTACG TGCTGGCGCT GCAGGAAGCT
TCGGCGGTGG CGATGGCCGA CGGCTACGCG CAGGCGACGC GCAACGCCGG CTTCGTCAAT
CTGCATTCGG CCGCCGGCGT CGGCAATGCG CTCGGCAACA TTTACAGCGC GTTCAAGAAC
CAGACGCCGC TGGTAATCAC CGCCGGCCAG CAAGCGCGCA GCCTGCTGCC GCTGCAGGCA
TTCCTGGGCG CCGAGCGCGC CTCCGAGTTT CCGCGGCCTT ATGTGAAGTA CAGCGTCGAG
CCGGCCCGCG CCGAGGACGT GCCCACCGCG ATTGCCCGTG CCTACTACGT CGCGATGCAG
CCGCCGTGCG GGCCGACCTT CGTGTCGGTA CCGATTGACG ATTGGGCGCG GCCGGCGGCG
CCGGTGCCAC CGCGCACGAT CACCCGCGAG ATCGGCCCGG ATCGATCAGC AATGCAGGTG
CTGGCCGATA CGCTCGCCAA CGCCAAGAAG CCGGCACTGG TGGTCGGGCC AGCGATCGAC
CGCGCCGCGG CGGTCGATCT GATGGTGCAG CTCGCCGAGC GCACGCGAGC GCCAGTGTGG
GTGTCGCCGT TCTCCGCACG CTGCAGCTTC CCGGAACAGC ATCTGCTGTT CGCCGGCTTC
CTGCCCGCCT CGCCGGGACA ACTCTCCGAA ACGCTCGGCG CCTACGACGT GATCGTGGTG
ATCGGCGCGC CGGTGTTCAC CTTTCATGTC GAAGGCCACG CCGCGATCTT CGATGGAAGC
TCGAAGCTGT TCCAGATCAC CGATGATGCC GAAGCCGCCT CGGTGACGCC GCTCGGCGCC
AGCATCATCG CGACGATGAC CCCGGCCCTG ACGCTGCTGC TGGAGTTGCT GCCGGAGACC
AAGCGCGCCG CACCGCCGGC CCGCGCGGTG CCGCCTGCAC CTCAGCCGGC CGAGCCGATG
CCGGTGGAGT ATCTGCTGCA CACCCTGCGC GCCGCGATGC CCCAGAGCGC GATGCTGGTC
GAGGAAGCGC CGTCGCACCG CCCGGCGATG CAGACATACA TGCCGATGCC GGGCCAGGAC
AGTTTCGCCA CGATGGCGAG CGGCGGCTTG GGCTGGTCGC TGCCGGCGTC GGTCGGTTTT
GCGCTGGCGC ATCCGAACCG CCGCACCGTC TGCCTGATCG GCGACGGCTC GGCGATGTAC
TCGATCCAGG CGCTGTGGAC GGCGGCGCAG CGCAAGCTGC CGCTGACCGT GGTGGTGCTG
AACAACGGCG GCTACGGCGC GATGCGCTCG TTCAGCCAGG TGATGCAGGT GCGGAACGTG
CCCGGGCTGG AGCTGCCCGG GATCGACTTC ACCGCTCTGG CGCAATCGCT CGGCTGCGAT
GCTGTGCGGG TGACGCGCAG CGAGGAACTG GCGCCGGCGC TGACGCGCGC CCTTGCATGG
GACGGCGTCA GCCTGGTCGA AGCGATGCTC GATACGTCGG TGCCGATGCT CTACGCGCGC
AACGGCTGA
 
Protein sequence
MPAKKSKQPS AAVSTVKSAT LDLLRAFKID KVFGNPGSTE LPFLSDWPDD IDYVLALQEA 
SAVAMADGYA QATRNAGFVN LHSAAGVGNA LGNIYSAFKN QTPLVITAGQ QARSLLPLQA
FLGAERASEF PRPYVKYSVE PARAEDVPTA IARAYYVAMQ PPCGPTFVSV PIDDWARPAA
PVPPRTITRE IGPDRSAMQV LADTLANAKK PALVVGPAID RAAAVDLMVQ LAERTRAPVW
VSPFSARCSF PEQHLLFAGF LPASPGQLSE TLGAYDVIVV IGAPVFTFHV EGHAAIFDGS
SKLFQITDDA EAASVTPLGA SIIATMTPAL TLLLELLPET KRAAPPARAV PPAPQPAEPM
PVEYLLHTLR AAMPQSAMLV EEAPSHRPAM QTYMPMPGQD SFATMASGGL GWSLPASVGF
ALAHPNRRTV CLIGDGSAMY SIQALWTAAQ RKLPLTVVVL NNGGYGAMRS FSQVMQVRNV
PGLELPGIDF TALAQSLGCD AVRVTRSEEL APALTRALAW DGVSLVEAML DTSVPMLYAR
NG