Gene RPD_0914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_0914 
Symbol 
ID4021389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1030347 
End bp1032638 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content66% 
IMG OID637961105 
Productaldehyde oxidase and xanthine dehydrogenase, molybdopterin binding 
Protein accessionYP_568053 
Protein GI91975394 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACAG CCGCGCCCGA TCCGAAGGCC AATATGGGCC AGCCCGCCCC GCGCTACGAC 
GCCGTCGCCA AGGTGACCGG CCGCGCCACC TATGGCGCCG ACCAGAAGCT GCCACGCCCG
GCCTATGCCT GGCTGGTGAC CAGCGCTATC GCCAAGGGCC GCATCAACGG CTTCGATCTC
GCCGACGCCA ATCATGTGCG CGGCGTGATC GAGATCATCA CCCACGAGAA CGCCGAGAAG
CTGAAGACGG CCAAGCTGTT CTCCGACGGC GGCTACGCCT CGACGACGAT CCAGCCGCTG
ACCTCGCCGG CGATTTCGCA CGATGGCCAG ATCATCGCGG TGGTGCTTGC CGAAAACCTG
GAAGCGGCGC GCGAGGCCGC CTACCGCGTC AAGGTCGATA CCACGGCCGA GACGCCGACC
GCGAGTTTCG ACTCGCCCGG CGCCGAAACC GAAGCCGCCA AGGGCAAGAG CGCGCAATTC
ACCGAGGACC CGAAGGTCGG CGACTTCGCC AAGGCATTCG AACAAGCCGC GGTGAAGATC
AGCGCCGACT ACGAGACGCC GACGCAGCAC CACAATGCGA TCGAGCTGTT CGCCACCAGT
TGCGTCTGGA ACGGCGACAT GCTGACGATC TACGAGCCCT CGCAATTCGT CTATGGCCTG
AAGAACGGCG TCGCCGAGCA GCTCGGCATC GAGCCCGACA AGGTGCGGGT GATCAGCCCC
TATATCGGCG GCGCGTTCGG CTCCAAGGCC TCGATGAATG CCCGGACGGC GATCATCGCG
TCGATCGCCC GACGGCTCGG CCGCGCGGTC AAGCTCGTGG TGCCGCGCGA CCAGGGCTTC
ACCACCGCGA CCTATCGCGC CGAAACCCGG CACAGCGTCA GCCTCGCCGC GTCGCAAGAC
GGCAGGCTGA CGGCGCTGCG CCATGAGGGC TGGGAAGTCA CCTCGCGGCC GGACAATTAT
CTGGTCGGCG GCACCACGAC GACGACGCGG CTGTATGCCT GCCCGAATAT CGAGAGCAAG
GTCTCGATCG TTCATGCCGA CCGCAACACC CCGGGCTTCA TGCGCTCGCC GCCGGAGGTG
CCATATCTGT TCGCGCTGGA AAGCGCGATG GACGAACTCG CCGTGGCGCT GAAGATCGAT
CCGGTCGAGC TGCGCCGGAT CAACGACACC ATGGTCGAAC CGATCGACGG CAAGAGCTAC
ACGTCGCGCT CGCTGATGGC CTGTTTCGAC GAAGCCGCGG CGGCGTTCGG CTGGGCCAAG
CGCAATCCGC AGCCGAAATC GATGAGCGAT GGCGACTGGC TGATCGGCTA TGGCTGCGCG
GCGACCTGCT ATCCGACGAT GATGGCGCCC GCGGCGGCGC GCGTCCGGCT GCATCGCGAC
GGCGCGGTGC GGGTCGAGAT CGCCGGGCAC GAAATCGGCA CCGGCGCCTA CACGGTGATT
GCACAAGCCG CCGCGCGTCG GCTCGGCGTG CAGCTCGAGC GGGTGTCGGT CGAGATGGGC
GACAGCAATT TGCCGCCGGC TCCAGTGGCG GGCGGCTCGA ACTCGACTGC CTCAACCTGT
TCGGCGGTGG CGATGGTGTG CGACCAGATC CGCGAGCGTC TGCTGAAGGC GACGATGCCG
GCGGACTCGC TGGTCGACAA GGCGAAGTCG ACCGTCGGCC TCGGCCAGAC CCCGACCGAG
CAGGCCGCCA AGAGCGACCG GCCGATCGAC ATCGCCGCCG CGTTCGACCG GCTCGGCGTC
AACGTCATCG AGGAGTTCGG CGAGTGGAAG CCGGACGGCG CACCGCTCGA CTCGTTCAGG
GCGATGTACA AGGGCCAGAC CCGGATGGTC GGCGGCGAAA AGGTCAAGGC CGGCATCGCC
TATGCCTTCG GCGCGGAGTT CGTCGAGCTA CGCGTCAACA AATACACCGG CGAGATCCGG
GTGCCGCGAA TGGTCGGCGC TTTCGCCGCC GGTCACATCA TGAACCCCCG CACCGCCCGC
AGCCAGCTTC TGGGCGGGCT GATCTGGGGA CTATCGTCGG CGCTGCACGA GGCGACCGAG
ATCGACGAGC GCACCGCACG CTACGTCAAC GACAACCTCG CCGACTACCT GATCCCGGTG
AACGCCGATG TGCCGAGCGT CGATGTCATC CTGCTGTCGG AGCAGGACGA CAAGATCAAT
CCGCTGGGCA TCAAGGGCGT GGGCGAGCTC GGCAATGTCG GGACCAACGC TGCGGTCTGC
AACGCCCTGT ATCACGCTAC CGGTCAACGT ATCCGGAAGC TGCCGGTGCG GCTCGAAAAA
ATTGAACTTT AG
 
Protein sequence
MSTAAPDPKA NMGQPAPRYD AVAKVTGRAT YGADQKLPRP AYAWLVTSAI AKGRINGFDL 
ADANHVRGVI EIITHENAEK LKTAKLFSDG GYASTTIQPL TSPAISHDGQ IIAVVLAENL
EAAREAAYRV KVDTTAETPT ASFDSPGAET EAAKGKSAQF TEDPKVGDFA KAFEQAAVKI
SADYETPTQH HNAIELFATS CVWNGDMLTI YEPSQFVYGL KNGVAEQLGI EPDKVRVISP
YIGGAFGSKA SMNARTAIIA SIARRLGRAV KLVVPRDQGF TTATYRAETR HSVSLAASQD
GRLTALRHEG WEVTSRPDNY LVGGTTTTTR LYACPNIESK VSIVHADRNT PGFMRSPPEV
PYLFALESAM DELAVALKID PVELRRINDT MVEPIDGKSY TSRSLMACFD EAAAAFGWAK
RNPQPKSMSD GDWLIGYGCA ATCYPTMMAP AAARVRLHRD GAVRVEIAGH EIGTGAYTVI
AQAAARRLGV QLERVSVEMG DSNLPPAPVA GGSNSTASTC SAVAMVCDQI RERLLKATMP
ADSLVDKAKS TVGLGQTPTE QAAKSDRPID IAAAFDRLGV NVIEEFGEWK PDGAPLDSFR
AMYKGQTRMV GGEKVKAGIA YAFGAEFVEL RVNKYTGEIR VPRMVGAFAA GHIMNPRTAR
SQLLGGLIWG LSSALHEATE IDERTARYVN DNLADYLIPV NADVPSVDVI LLSEQDDKIN
PLGIKGVGEL GNVGTNAAVC NALYHATGQR IRKLPVRLEK IEL