Gene RPD_0623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0623 
Symbol 
ID4021092 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp705647 
End bp708493 
Gene Length2847 bp 
Protein Length948 aa 
Translation table11 
GC content66% 
IMG OID637960811 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_567762 
Protein GI91975103 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.816724 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCTGG TCCACGAAAT CGATTTCGGC ACGCCGCGTT CGCCATCCGA AACGATGGTG 
ACGCTGACGA TCGACGGCCG CAGCGTGTCG GTGCCCGAAG GCACCTCGAT CATGCGTGCG
GCGATGGAGA TCGGCACCGC GATCCCGAAA CTGTGCGCCA CCGACATGGT CGACGCGTTC
GGCTCCTGCC GGCTGTGCCT GGTCGAGATC GACGGCCGCA GCGGCACGCC AGCGTCCTGC
ACCACGCCGG TCGCCGACGG CCTCGTGGTG AAGACCCAAA CCGAGCGGCT GAAGCAGATC
CGCAAGGGCG TGATGGAGCT GTATATCTCC GACCATCCGC TCGACTGCCT GACCTGTTCG
GCCAACGGCG ATTGCGAACT GCAGGACATG GCCGGCGCGG TCGGCCTGCG CGACGTGCGC
TACGGTTACT CCGGCAACAA GCATCCCAAT CCGGGGCTGG ATGAGTCCAA CCCGTATTTC
ACTTACGATG CGTCGAAGTG CATCGTCTGC TCGCGCTGCG TGCGCGCCTG CGAGGAGGTG
CAAGGCACCT TCGCACTGAC CATCGCCGGT CGCGGCTTCG GCTCGGTGGT GTCGCCGGGG
ATGCAGGAAT CCTTCCTCGG CTCGGAATGC GTTTCCTGCG GCGCCTGCGT GCAGGCCTGC
CCGACCGCGA CGCTGAACGA GAAATCGGTG ATCGAGATCG GCACGCCGGA GCGATCGGTG
GTGACGACCT GCGCCTATTG CGGCGTCGGC TGCACCTTCA AGGCGGAGAT GCGTGGCGAG
GAAGTGGTCC GCATGGTGCC GTACAAGGAC GGCAAGGCCA ATCGCGGCCA TTCCTGCGTC
AAGGGCCGGT TCGCCTGGGG CTACACCAAC CACAAGGAAC GCATCCTCAA GCCGATGATC
CGCGCCAGGA TCACCGACCC GTGGCGCGAG GTAAGCTGGG ACGAGGCGTT CTCCCATGCG
GCCTCGGAGC TGAAGCGCAT CCAGGCCAAA TACGGTCGCG ACTCGATCGG TGGCATCACC
TCGTCGCGCT GCACCAATGA AGAGACCTTC CTGGTGCAGA AGCTGATCCG CGCCGGCTTC
GGTAACAACA ATGTCGACAC CTGCGCGCGG GTCTGCCACT CGCCGACCGG CTACGGCCTC
TCCACCGCCT TCGGCACCTC GGCCGGCACC CAGGATTTCG ACTCGGTCGA GCACACCGAC
GTGGTGATGA TCATCGGCGC CAATCCGACC GACGGCCATC CGGTGTTCGC CTCACGGTTG
AAGAAGCGGC TGCGCGCCGG CGCCAAACTG ATCGTGGTCG ACCCGCGCCG GATCGACCTG
GTGCGCTCGG CCCATGTCGA GGCGGCGCAG CATCTGCCGC TGAAGCCCGG CACAAACGTC
GCGGTGCTGA CCGCGCTGGC GCATGTGATC GTCACCGAGG GGCTCGCCAA CGAAGCCTTC
GTGCGCGAAC GCTGCGACTG GAGCGAATAC GAGCACTGGG CGTCGTTCGT GGCGCAGCCG
AACAACAGCC CGGAAGCGAC CGCAGCGATG ACCGGCGTCG ATCCGCAAGC GCTGCGCGAA
GCGGCGCGGC TCTACGCCAC CGGCGGCAAC GGCGCGATCT ATTACGGCCT CGGCGTCACC
GAACACAGCC AGGGCTCGAC CACCGTGATG GCGATCGCCA ACCTGGCGAT GGTCACCGGC
AATCTCGGCC GCCAAGGTGT CGGCGTGAAC CCGCTGCGCG GCCAGAACAA CGTTCAGGGC
GCCTGCGACA TGGGCTCGTT CCCGCACGAG CTGCCCGGCT ATCGCCACAT CTCCACCGAC
GCGGTGCGCG ACAGCTTCGA GGCGCTGTGG GGCGTGACGC TGAACAGCGA ACCGGGCCTA
CGCATTCCCA ACATGCTCGA CGCCGCGGTC GATGGCTCGT TCAAGGCGCT CTATGTCCAG
GGCGAGGACA TTCTGCAGTC CGATCCCAAC ACCAAGCACG TCGCCGCCGG CCTCGAAGCG
ATGGAATGCG TGATCGTGCA CGATCTCTTC CTCAACGAGA CCGCGAACTA CGCGCATATC
TTCCTGCCGG GTTCGACCTT CCTGGAGAAG AACGGCACCT TCACCAATGC CGAGCGCCGC
ATCCAGCGGG TCCGCAAGGT GATGACGCCG AAGAACGGGC TGGAGGACTG GGAAGTCACG
CTTCGCCTCG CCGAGGCGAT GGGTTTCAAG ATGAGCTACG ATCACCCGTC GCAGATCATG
GACGAGATCG CCACGCTGAC GCCGACCTTC ACCGGCGTCT CCTACGCAAG ACTCGACGAG
CTCGGCTCGA TCCAGTGGCC GTGCAACGCC AACGCGCCGG AGGGCACGCC GGTGATGCAC
ATCGATCACT TCGTCCGCGG CAAGGGCAAG TTCGTCATCA CCGAATATGT CGCGACCGAC
GAGCGCACCG GGCCGCGCTT CCCGCTGCTG CTGACCACCG GCCGCATCCT GTCGCAGTAC
AATGTCGGCG CGCAGACGCG GCGCACCGCC AATACGGTAT GGCACGACGA GGACCGGCTC
GAAATCCATC CGCACGACGC CGAGCAGCGC GGCGTCAGGG ACGGCGATTG GGTGCGGCTG
GCGAGCCGCG CCGGCGAGAC GACGTTGCGC GCGCTGATCA CCGATCGGGT CGCGCCGGGC
GTGGTCTACA CCACGTTCCA CCATCCCGAC ACGCAAGCCA ATGTCGTCAC CACCGAATAT
TCCGACTGGG CGACCAATTG CCCGGAATAC AAGGTGACCG CGGTACAGGT GTCGCCGTCC
AATGGTCCGT CGGAGTGGCA GCGCGCCTAT GAGGAGCAGG CGACCGCGGC GCGCAGGATC
GCCTCGGCCG CCGAAGCCGC GGAGTGA
 
Protein sequence
MALVHEIDFG TPRSPSETMV TLTIDGRSVS VPEGTSIMRA AMEIGTAIPK LCATDMVDAF 
GSCRLCLVEI DGRSGTPASC TTPVADGLVV KTQTERLKQI RKGVMELYIS DHPLDCLTCS
ANGDCELQDM AGAVGLRDVR YGYSGNKHPN PGLDESNPYF TYDASKCIVC SRCVRACEEV
QGTFALTIAG RGFGSVVSPG MQESFLGSEC VSCGACVQAC PTATLNEKSV IEIGTPERSV
VTTCAYCGVG CTFKAEMRGE EVVRMVPYKD GKANRGHSCV KGRFAWGYTN HKERILKPMI
RARITDPWRE VSWDEAFSHA ASELKRIQAK YGRDSIGGIT SSRCTNEETF LVQKLIRAGF
GNNNVDTCAR VCHSPTGYGL STAFGTSAGT QDFDSVEHTD VVMIIGANPT DGHPVFASRL
KKRLRAGAKL IVVDPRRIDL VRSAHVEAAQ HLPLKPGTNV AVLTALAHVI VTEGLANEAF
VRERCDWSEY EHWASFVAQP NNSPEATAAM TGVDPQALRE AARLYATGGN GAIYYGLGVT
EHSQGSTTVM AIANLAMVTG NLGRQGVGVN PLRGQNNVQG ACDMGSFPHE LPGYRHISTD
AVRDSFEALW GVTLNSEPGL RIPNMLDAAV DGSFKALYVQ GEDILQSDPN TKHVAAGLEA
MECVIVHDLF LNETANYAHI FLPGSTFLEK NGTFTNAERR IQRVRKVMTP KNGLEDWEVT
LRLAEAMGFK MSYDHPSQIM DEIATLTPTF TGVSYARLDE LGSIQWPCNA NAPEGTPVMH
IDHFVRGKGK FVITEYVATD ERTGPRFPLL LTTGRILSQY NVGAQTRRTA NTVWHDEDRL
EIHPHDAEQR GVRDGDWVRL ASRAGETTLR ALITDRVAPG VVYTTFHHPD TQANVVTTEY
SDWATNCPEY KVTAVQVSPS NGPSEWQRAY EEQATAARRI ASAAEAAE