Gene RPB_3938 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3938 
Symbol 
ID3911745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4493600 
End bp4495222 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content69% 
IMG OID637885842 
Productbenzoylformate decarboxylase 
Protein accessionYP_487542 
Protein GI86751046 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.23105 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCGAAGA AGCCGAAGCC GAGCGCTGCC GTCAGCACCG TCAAATCCGC CACGCTCGAT 
CTGCTCCGCG CGTTCAAGAT CGACAAGGTG TTCGGCAATC CCGGCTCCAC CGAGCTGCCG
TTCCTCAGCG ACTGGCCGGA CGACATCGAC TACGTACTGG CGCTGCAGGA GGCGAGCGCG
ATGGCGATGG CCGACGGCTA CGCGCAGGCG ACGCGCAACG CCGGCTTCGT CAATCTGCAT
TCAGCCGCCG GCGTCGGCAA CGCGCTCGGC AACATCTACA CCGCGTTCAA GAACCAGACG
CCGCTGCTGA TCACCGCCGG CCAGCAGGCG CGCAGCCTGC TGCCGTTACA GGCGTTTCTC
GGCGCCGAAC GCGCATCCGA ATTTCCGCGG CCTTACGTGA AATACAGCAT CGAGCCGGCG
CGCGCCGAGG ACGTGCCGGC GGCGATCGCC CGCGCCTATT ATGTGGCGAT GCAGCCGCCG
TGCGGGCCGA CCTTCGTGTC GGTGCCGATC GACGACTGGG CGCGGCCCGC GCAGCCGGTT
CCGCTGCGCA ACGTGACGCG CGAACTCGGC CCGGAGCGCG CGGCGATGCA GGCGCTGGCG
GAGGCGCTGG CGGAGGCGAA GAAGCCCGCC CTGGTGGTCG GCCCCGCGAT CGATCGCGCC
GCCGCGGTCG ATTTGATGGT GCGCCTCGCC GAGCGCGCCA ACGCGCCGGT GTTCGTCAGT
CCGTTCTCGG CGCGCTGCAG TTTCCCGGAG CGGCATCCGC TGTTCGCCGG CTTCCTGCCC
GCCTCGCCGG GGCAACTCTC CGAAGCCATC GGCGCCTACG ACGTCGTGGT GGTGATCGGC
GCACCGGTGT TCACCTTCCA TGTCGAAGGC CGCGCGTCGA TCTTCGACGG CGCAACGTCG
CTGTTCCAGA TCACCGACGA CGCCGAGGCC GCGTCGGTGA CGCCGCTCGG CACCAGCATC
ATCGCCACCA TGAAGCCGGC ATTATCGCTG CTGCTGGAGT TGTTACCAGA GACCCAATGC
GCGGCGCCGC CGGCACGGGC GCTGCCGCCA GCGCCTGCCG CGGCCGATCC GATGCCGGCC
GAATTTCTGC TCGATGCGTT GAGCAAGGCG ATGCCGGCCG GCACGATGCT GGTCGAGGAA
GCGCCGTCGC ATCGGCTGGC GATGCAGAAA TTCATGCCGA TGCGCGGCCA GGACAGTTTC
GCCACGATGG CGAGCGGCGG CCTCGGCTGG TCGCTGCCGG CCGCGGTCGG CTTCGCGCTG
GCGCATCCGG AGCGCCGCAC CGTGTGCCTG ATCGGCGACG GCTCGGCGAT GTATTCGATC
CAGGCGCTGT GGACTGCGGC AGAGCGCAAG CTGCCGCTGA CCGTGGTGGT GCTGAACAAT
GGCGGCTACG GCGCGATGCG CTCGTTCAGC CAGGTGATGC AGGTCCGCGA CGTGCCCGGG
CTGGAGCTGC CCGGGATCGA CTACGTCCAG CTCGCGCAGT CGATGGGCTG TGTCGCCGAA
CGCGTGTCAC GCTGTGAGGA CCTCGCGCCG GTGCTCGCCC GCGCGCTGGC GCATGACGGC
GTGTTCGTGG TCGAGGCGAC GCTGGATAGC GCGGTGCCGC TGCTGTACGC GAAGAACGGG
TAG
 
Protein sequence
MPKKPKPSAA VSTVKSATLD LLRAFKIDKV FGNPGSTELP FLSDWPDDID YVLALQEASA 
MAMADGYAQA TRNAGFVNLH SAAGVGNALG NIYTAFKNQT PLLITAGQQA RSLLPLQAFL
GAERASEFPR PYVKYSIEPA RAEDVPAAIA RAYYVAMQPP CGPTFVSVPI DDWARPAQPV
PLRNVTRELG PERAAMQALA EALAEAKKPA LVVGPAIDRA AAVDLMVRLA ERANAPVFVS
PFSARCSFPE RHPLFAGFLP ASPGQLSEAI GAYDVVVVIG APVFTFHVEG RASIFDGATS
LFQITDDAEA ASVTPLGTSI IATMKPALSL LLELLPETQC AAPPARALPP APAAADPMPA
EFLLDALSKA MPAGTMLVEE APSHRLAMQK FMPMRGQDSF ATMASGGLGW SLPAAVGFAL
AHPERRTVCL IGDGSAMYSI QALWTAAERK LPLTVVVLNN GGYGAMRSFS QVMQVRDVPG
LELPGIDYVQ LAQSMGCVAE RVSRCEDLAP VLARALAHDG VFVVEATLDS AVPLLYAKNG