Gene RPB_1613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1613 
Symbol 
ID3910084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1818988 
End bp1821291 
Gene Length2304 bp 
Protein Length767 aa 
Translation table11 
GC content66% 
IMG OID637883508 
Productcarbon-monoxide dehydrogenase 
Protein accessionYP_485233 
Protein GI86748737 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR02416] carbon-monoxide dehydrogenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.33112 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTCCTA CCAAATTCGG CGTCGGCCAA AGCGTGATGC GCAAGGAGGA TGATCCCCTG 
ATCCGCGGCA AGGGCCGCTA TACCGACGAC TACGCGCCGC AATCCTCGGC GCACGCGCTG
GTGCTGCGAT CACCGCATGC GCATGCGAAA TTCAGGCTCG ACGCGACGGC GGCGCGCGGC
CTGCACGGCG TGCTCGCGAT CCTCACCGCC GAAGACGTCA GCGATCTCGG CGGGCTGCCG
TGCCTGTTCA ACCTACCCGA CAATCCGTTC AAGGGGCCGG ACTACGCGAT CCTCGCCGGC
GGCGAGGTGC GTCATGTCGG CGACGCAGTG GCCTTCGTGG TCGCGGATAC GGTGGCGCAT
GCGCGCGACG CGCTGGAGGC GATCGCGGTC GAATGGACGC CGCTGCCGGC GGCGATCGGC
GCGGCCAATG CGATCAAGCC CGGCGCGCCG CAGGTGTGGC CGGACCACGC CGGCAATCTG
CTGTTCGACA CCGCGATCGG CGACAAGGCC GCAACAGAGG CCGCGTTCGC GAAGGCTCAT
GCCGTCGCCG AGATCGCCAT CGTCAATCCG CGGATCATCA CCAACTACAT GGAGACCCGC
GCCGCGGTCT GCGAATATGA CGCCAAGCGC GATCATTTCA CGCTGACGAT CGGCAGCCAG
GGCAGCCACC GGCTGCGTGA TATCCTGTGC CAGAACGTGC TGAAAATTCC GGTCGAGAAG
ATGCGGGTGA TCTGCCCCGA TGTCGGCGGC GGCTTCGGCA CCAAGCTGTT TCCGTATCGC
GAATACGCGC TGCTGGCGGT CGCGGCTAAG AAGCTCGGCA AGACCATCCG CTGGGCGGCC
GACCGCTCCG ATCATTTCGT CGGCGACTCG CAGGGCCGCG ACAACATCAC CACGGCGCGG
ATGGCGCTCG CGGCGGATGG CAAGTTTCTC GGCATGGACG TCGACCTGAT CGGCGATCTC
GGCGCCTATC TGTCGACCTT CGGGCCTTAC ATCCCGTATG GCGGCGCCGG AATGTTGCCG
GGGCTCTACG ACATTCAGGC GTTTTACTGC CGCATCCGGA CGGTGTTCAC CCACACTGTT
CCAGTCGATG CCTATCGCGG CGCCGGTCGG CCCGAAGCGG CCTATGTCAT CGAACGTCTG
GTCGATGCCT GCGCGCGCAA GCTCGGGATG TCGCCGGATG CGATCCGGCG CAAGAATTTC
ATCGCGCCGC GCGCGATGCC CTACAAGACT GCGACCGGCA AGGTCTACGA CTCCGGTGAT
TTCGCCGCGC ATCTGAAACG CGCGATGGAC ATCGGCGAGT GGAAGGAATT TCCGAAGCGC
GCCAAGGCAG CGACGAAACT CGGGCTGGTG CGCGGCATCG GCCTGGCCTC CTATGTCGAA
GTCTGCGGCA CGATGGGCGA GGAGACCGCC AAGGTCGTGC TCGATCCCGA CGGCGACATC
ACCGTTCTGA TCGGCACCCA GTCGAGCGGG CAGGGCCATC AGACCGCCTA TGCGCAGATC
GTCGCCGAAC AGTTCGGCGT GCCGCCGGAG CGCGTCCGCG TGGTTCAGGG CGACACCGAC
AGGATTGCGA CCGGGCTCGG CACCGGCGGC TCGGCATCGA TCCCCTCGGG CGGCGTCAGC
GTTCAACGCG CGACGCACCA AATCGGCGAG CAGATTCGCG AGTTGGCCGC GGACGCGCTG
GAAGCCGGCG CTGCGGATCT CGAAATCAGC GACGGCATCG TCCGCATCGC CGGCACCGAC
CGCTCGATCT CGTTCGCCGA TCTCGCCAAG CGCCCCGGCC TCGATCCGGC CAAGCTGAAT
GCCAGCGCGA CGTTCTCCAG CGCCGACGGC ACGTTCCCAA ACGGCACGCA TTTGGTCGAA
GTCGAGATCG ATCCGGCGAC CGGCAAAATC CGGATCGTCA ACTACGTCAT CGTCGACGAT
TTCGGCGTGA CGCTGAACCC GCTGCTGCTC GCCGGCCAGG TTCATGGCGG CACCATCCAG
GGCATCGGGC AGGCGCTGAT GGAGCGGGCG GTGTACGATC AGGACGGCCA GCTCGTCACC
GGCACCTTCA TGGATTACGC GATGCCGCGC GCGGAGGATG CCGCGCCGAT CATCTTCGAG
ACTCACAACG TGCCGTGCAC CACCAACCCG ATGGGTGTGA AGGGGGCCGG CGAGGCCGGC
GCGATCGGCT CGTGCCCGGC CGTGGTCAAT GCGATCATCG ATGCCCTGTG GCGCGAGTAC
AAGATCGACC ACATCGACAT GCCGGCCACG CCGGAGCGGG TTTGGATGGC AATCCGCGAG
CACCATCGGC AGCACAGTCT CTAG
 
Protein sequence
MAPTKFGVGQ SVMRKEDDPL IRGKGRYTDD YAPQSSAHAL VLRSPHAHAK FRLDATAARG 
LHGVLAILTA EDVSDLGGLP CLFNLPDNPF KGPDYAILAG GEVRHVGDAV AFVVADTVAH
ARDALEAIAV EWTPLPAAIG AANAIKPGAP QVWPDHAGNL LFDTAIGDKA ATEAAFAKAH
AVAEIAIVNP RIITNYMETR AAVCEYDAKR DHFTLTIGSQ GSHRLRDILC QNVLKIPVEK
MRVICPDVGG GFGTKLFPYR EYALLAVAAK KLGKTIRWAA DRSDHFVGDS QGRDNITTAR
MALAADGKFL GMDVDLIGDL GAYLSTFGPY IPYGGAGMLP GLYDIQAFYC RIRTVFTHTV
PVDAYRGAGR PEAAYVIERL VDACARKLGM SPDAIRRKNF IAPRAMPYKT ATGKVYDSGD
FAAHLKRAMD IGEWKEFPKR AKAATKLGLV RGIGLASYVE VCGTMGEETA KVVLDPDGDI
TVLIGTQSSG QGHQTAYAQI VAEQFGVPPE RVRVVQGDTD RIATGLGTGG SASIPSGGVS
VQRATHQIGE QIRELAADAL EAGAADLEIS DGIVRIAGTD RSISFADLAK RPGLDPAKLN
ASATFSSADG TFPNGTHLVE VEIDPATGKI RIVNYVIVDD FGVTLNPLLL AGQVHGGTIQ
GIGQALMERA VYDQDGQLVT GTFMDYAMPR AEDAAPIIFE THNVPCTTNP MGVKGAGEAG
AIGSCPAVVN AIIDALWREY KIDHIDMPAT PERVWMAIRE HHRQHSL