Gene RPB_0724 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0724 
Symbol 
ID3910020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp812186 
End bp815032 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content66% 
IMG OID637882616 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_484346 
Protein GI86747850 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.696908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.792249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTGG TTCACGAAAC CGATTTCGGC ACGCCGCGCT CGCCGTCCGA GACGATGGTG 
ACGCTGACGA TCGACGGCCG CAACGTCAGC GTGCCCGAGG GCACCTCGAT CATGCGCGCC
GCGATGGAGA TCGGCACCGC GATTCCGAAA CTCTGCGCCA CCGACATGGT CGACGCGTTC
GGCTCGTGCC GGCTGTGCCT GGTCGAGATC GACGGCCGCA GCGGCACGCC GGCGTCCTGC
ACCACGCCGG TCGCCGACGG GCTGGTGGTG AAGACCCAGA CCGCGCGGCT GAAGCAGATC
CGCAAAGGCG TGATGGAGCT GTATATCTCC GACCACCCGC TCGACTGCCT GACCTGCTCG
GCCAATGGCG ATTGCGAATT GCAGGACATG GCCGGCGCGG TCGGCCTGCG CGACGTGCGC
TATGGCTATA GCGGCGAGAA GCACCCGAAT CCGGGGCTCG ATGAGTCCAA CCCTTATTTC
ACCTACGATC CGTCGAAATG CATCGTCTGC TCGCGCTGCG TCCGCGCGTG TGAAGAAGTG
CAAGGCACCT TCGCGCTGAC CATCGCCGGT CGCGGCTTCG ACTCGGTCGT CTCACCGGGC
ATGCAGGAGA GCTTCCTCGG CTCGGAATGC GTCTCCTGCG GCGCCTGCGT GCAGGCCTGC
CCGACCGCGA CGCTGAACGA GAAGAGCGTG ATCGAGATCG GCACGCCGGA GCGCTCGGTG
GTGACGACCT GCGCGTATTG CGGCGTCGGC TGCACCTTCA AGGCCGAGAT GCGCGGCGAG
GAAGTCGTCC GCATGGTGCC CTTCAAAGAC GGCAAGGCCA ATCGCGGCCA TTCCTGCGTC
AAGGGCCGCT TTGCCTGGGG CTACGCCAAC CACAAGGAAC GCATCCTCAA CCCGATGATC
CGCGCCAGCA TCAGCGAGCC GTGGCGCGAA GTGAGCTGGG ACGAAGCGTT TGCTTACGCA
GCTTCGGAGT TGAAGCGGAT CCAGGCCACA TACGGCCGCG ATTCGATCGG CGGCATCACC
TCGTCGCGCT GCACCAACGA AGAAACCTTC CTGGTGCAGA AGCTGATCCG CGCCGGCTTC
GGCAACAACA ATGTCGACAC CTGTGCCCGG GTCTGCCATT CGCCGACCGG CTACGGCCTC
TCCACCGCAT TCGGCACCTC GGCCGGCACC CAGGATTTCG ACTCGGTCGA GCACACCGAC
GTGGTGATGC TGATCGGCGC CAATCCGACC GACGGCCACC CGGTGTTCGC CTCGCGGCTG
AAAAAGCGGC TGCGCGCCGG CGCCAAGCTG ATCGTGGTCG ATCCGCGCCG GATCGATCTG
GTGCGCTCGG CGCATGTCGA GGCGGCGCAG CATCTGCCGC TGAAGCCCGG CACCAACGTC
GCGGTGCTGA CGTCGATCGC CCATGTGATC GTCACCGAGG GACTCACCCA CGAGGCCTTC
GTGCGCGAGC GCTGCGACTG GAGCGAATAC GAGCACTGGG CGGCGTTCGT GGCGCAGCCG
GCCAATAGTC CGGAAGCGAC CAGCGCGATG ACCGGCGTCG ATCCGCAGGC GTTGCGCGAG
GCGGCAAGGC TGTACGCCAC CGGCGGCAAC GGCGCGATCT ATTACGGCCT CGGCGTCACC
GAGCACAGCC AGGGCTCGAC CACCGTGATG GCGATCGCCA ACCTCGCAAT GGCCACCGGC
AATCTCGGCC GGCCCGGCGT CGGGGTGAAC CCGCTGCGCG GCCAGAACAA TGTGCAGGGC
GCCTGCGACA TGGGCTCGTT CCCGCACGAA CTGCCGGGCT ATCGCCACAT CTCCAGCGAC
GCGGTGCGCG AGAGCTTCGA GGCGCTGTGG GGCGTGACGC TGAACAGCGA GCCGGGGCTG
CGCATCCCCA ACATGCTCGA CGCCGCGGTC GATGGTTCGT TCAAGGCGCT CTACGTGCAG
GGCGAGGACA TCCTGCAATC CGACCCCAAC ACCAGACACG TCGCCGCCGG GCTCGAAGCG
ATGGAATGCG TCATCGTGCA CGATCTGTTT CTCAACGAGA CCGCGAACTA CGCGCATATT
TTCCTGCCCG GCTCGACCTT CCTGGAGAAG AACGGCACCT TCACCAATGC GGAGCGCCGC
ATCCAGCGCG TCCGCAAGGT GATGACGCCG CGCAATGGGC AGGAGGACTG GGAAGTCACG
CAGCGGCTCG CCAATGCGAT GGGCTTTTCG ATGAGCTACG AGCATCCGTC GCAGATCATG
GACGAGATCG CGGCGCTGAC GCCGACCTTT GCGGGCGTGT CCTACGACCG GCTCGAGCAG
CTCGGCTCGA TCCAATGGCC GTGCAACGAG CGCGCGCCGG ACGGCACGCC GGTGATGCAC
ATCGACGCTT TCGTCCGCGG CAAGGGCAAG TTCGTCATCA CCGAATATGT CGCCACCGAC
GAGCGCACCG GGCCACGCTT CCCGCTGCTG CTGACCACCG GCCGCATCCT GTCGCAGTAC
AATGTCGGCG CGCAGACGCG ACGGACCGCC AATACGGTGT GGCACGACGA GGACCGTCTC
GAAATCCATC CGCACGATGC CGAGCAGCGC GGCGTCAGGG ATGGCGATTG GGTGCGACTC
GCCAGCCGCG CCGGCGAGAC CACGCTGCGC GCGCTGATCA CCGATCGGGT CGCGCCGGGC
GTGGTCTACA CCACGTTCCA CCACCCGGAC ACGCAGGCCA ACGTCGTGAC GACCGAGTAT
TCCGACTGGG CCACCAACTG CCCGGAATAC AAAGTGACGG CGGTGCAGGT GACGCCGTCC
AATGGTCCAT CGGAATGGCA GCGCGACTAT GCCCAGCAGG CGGCCGCCGC TCGCCGGATC
GCGACGCCTG CAGAGGCCGC AGAATGA
 
Protein sequence
MALVHETDFG TPRSPSETMV TLTIDGRNVS VPEGTSIMRA AMEIGTAIPK LCATDMVDAF 
GSCRLCLVEI DGRSGTPASC TTPVADGLVV KTQTARLKQI RKGVMELYIS DHPLDCLTCS
ANGDCELQDM AGAVGLRDVR YGYSGEKHPN PGLDESNPYF TYDPSKCIVC SRCVRACEEV
QGTFALTIAG RGFDSVVSPG MQESFLGSEC VSCGACVQAC PTATLNEKSV IEIGTPERSV
VTTCAYCGVG CTFKAEMRGE EVVRMVPFKD GKANRGHSCV KGRFAWGYAN HKERILNPMI
RASISEPWRE VSWDEAFAYA ASELKRIQAT YGRDSIGGIT SSRCTNEETF LVQKLIRAGF
GNNNVDTCAR VCHSPTGYGL STAFGTSAGT QDFDSVEHTD VVMLIGANPT DGHPVFASRL
KKRLRAGAKL IVVDPRRIDL VRSAHVEAAQ HLPLKPGTNV AVLTSIAHVI VTEGLTHEAF
VRERCDWSEY EHWAAFVAQP ANSPEATSAM TGVDPQALRE AARLYATGGN GAIYYGLGVT
EHSQGSTTVM AIANLAMATG NLGRPGVGVN PLRGQNNVQG ACDMGSFPHE LPGYRHISSD
AVRESFEALW GVTLNSEPGL RIPNMLDAAV DGSFKALYVQ GEDILQSDPN TRHVAAGLEA
MECVIVHDLF LNETANYAHI FLPGSTFLEK NGTFTNAERR IQRVRKVMTP RNGQEDWEVT
QRLANAMGFS MSYEHPSQIM DEIAALTPTF AGVSYDRLEQ LGSIQWPCNE RAPDGTPVMH
IDAFVRGKGK FVITEYVATD ERTGPRFPLL LTTGRILSQY NVGAQTRRTA NTVWHDEDRL
EIHPHDAEQR GVRDGDWVRL ASRAGETTLR ALITDRVAPG VVYTTFHHPD TQANVVTTEY
SDWATNCPEY KVTAVQVTPS NGPSEWQRDY AQQAAAARRI ATPAEAAE