Gene Rpal_4497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4497 
Symbol 
ID6412181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4838535 
End bp4840856 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content65% 
IMG OID642714379 
ProductCarbon-monoxide dehydrogenase (acceptor) 
Protein accessionYP_001993468 
Protein GI192292863 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR02416] carbon-monoxide dehydrogenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0476148 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCCAA CCAAGTTGGC TCCCGCCAAG TTCGGTGTCG GTCAGAGTGT GCTGCGCAAA 
GAGGACGATC CTCTGATTCG CGGCAAAGGC CGCTACACCG ATGACGTCGC TCCGGCCTCG
ACCGCTTATG CGCTGATGCT GCGCTCGCCG CATGCCCATG CCACCTTCAA GCTGGACGCG
ACGGCGGCGC GCGCGCTGCC CGGTGTGCTG ACGATTTTGA CTGCGGCCGA CGTTGCCGAT
CTCGGCGGCC TGCCGTGCCT GTTCAATCTG CCGGACACCC CGTTCAAGGG GCCAGACTAT
CCAATCCTGG CGCGCGACGA GGTGCGCCAT GTCGGCGATG CGATCGCGTT CGTGGTCGCC
GACACGATTG CGCATGCGCG CGATGCACTC GAGGCGATTG CGGTGGAGTG GTCGCCGCTG
CCGGCGGTGA TCGGGGCTGT GCGTGCCGTC GAGCCGGGAG CGCCGCAGGT GTGGCCCGAC
CATGCCGGCA ACGTGCTGTT CGACGCCGGC ATCGGCAACA AGAAGGCAAC CGAAGAGGCG
TTTGCCAAAG CCCATGCGGT CGCGGAAATC CGCATCGTCA ATCCGCGTAT CGTCACGAAC
TACATGGAAA CCCGCGCGGC GGTCTGTGAG TACGATGCCA AGCGCGATCA CTTCACGCTG
ACGGTCGGCA GCCAGGGTAG CCATCGGCTG CGCGATATCC TGTGCCAGAC CGTGCTGAAG
ATCCCGGTCG AAAAGATGCG GGTGATCTGT CCGGATGTCG GCGGCGGGTT CGGCACCAAG
CTGTTTCCCT ATCGCGAATA CGCGCTGCTC GCGGTGGCCG CCAAGAAGCT GCGCAAGACG
GTGAAGTGGA CCGCCGATCG CGGCGATCAC TTCGTCGGCG ACTCGCAAGG TCGCGACAAC
GTCACGACGG CACGGATGGC GCTGGCCGAA GATGGCAAGT TTCTCGGCAT GGACGTCGAT
CTGATCGGCG ACGTCGGCGC CTATCTGTCG ACCTTCGGTC CGTACATTCC TTACGGCGGC
GCCGGCATGT TGCCGGGGCT CTATGACATC CAGGCGTTCT ACTGTCGCAT CCGCACCGTG
TTCACCCACA CCGTGCCGGT TGATGCCTAT CGCGGCGCCG GGCGTCCCGA AGCGGCTTAC
GTGGTTGAAC GTCTCGTCGA TGCCTGCGCG CGGAAGCTGG CGATGTCGCC GGATGCGATC
CGGCGCAAGA ACTTCATTCC ACCGCGCAAG CTCCCCTACA AGACCGCGAC CGGCAAGGTG
TACGACTCCG GCGACTTCAC TGCGCATCTG AAGCGCGCGA TGGAGATCGG CGACTGGAAG
GACTTCGGCA AGCGCGCCAA GGCCGCCAAG AAGCACGGGC TGGTGCGCGG CATCGGGCTC
GCCTCCTACG TCGAGGTCTG CGGTACCATG GGCGAGGAGA CCGCCAAGGT GGTGCTTGAT
CCGGATGGCG ACATCACTGT GCTGATTGGC ACGCAGTCGA GCGGGCAAGG CCACCAGACC
GCTTATGCGC AGATCGTCGC CGAGCAGTTC GGCGTGCCGC AGGAGCGGGT GCGGGTGGTG
CAGGGCGACA CCGACAGGAT CGCCACCGGC CTCGGCACCG GCGGCTCGGC ATCGATCCCG
TCCGGCGGCG TCAGCGTGCA GCGCGCCACC CATCAGCTCG GCGAGCAGCT TCGCGACATC
GCGGCCGACG CGCTGGAGAC CAGCACCGCC GACCTCGAAA TCAGCGATGG CACGATCCGC
ATCGCCGGCA CCGACCGCTC GATCAGCTTC GCCGATCTCG CCAAGCGGTC CGGCGTCGAT
CCGGCCAAGC TCAACGCCAG CGCGGCGTTC TCCAGTGCCG ACGGCACCTT TCCCAACGGC
ACGCATTTGG TCGAGATTGA ACTCGATCCG GCGACTGGCA AGATCAGGAT CGTCAACTAT
GTCATCGTCG ATGATTTCGG CGTCACGTTG AACCCGCTGC TACTCGCCGG GCAGGTCCAT
GGCGGCACCA TTCAGGGCAT CGGCCAGGCG CTGATGGAGC AGGCGGTGTA CAATGCCGAG
GATGGCCAGC TCATCACCGG TTCGTTCATG GACTACGCGC TGCCGCGCGC GGCCGATGGC
GCGCCGATCA CTTTCGAAAC CCACAACGTG CCCTGCGCCA CCAACCCGAT GGGCGTCAAA
GGTGCAGGCG AGGCGGGGGC GATCGGCTCC TGCCCGGCGG TGATGAATGC GATCATCGAC
GCGCTCTGGC GTGAATATCG GATCGACCAT ATCGACATGC CGGCGACGCC GGAGCGGGTG
TGGATGGCGA TCCGCGAGCA TGAGCGCCGG CACCGTCTGT GA
 
Protein sequence
MVPTKLAPAK FGVGQSVLRK EDDPLIRGKG RYTDDVAPAS TAYALMLRSP HAHATFKLDA 
TAARALPGVL TILTAADVAD LGGLPCLFNL PDTPFKGPDY PILARDEVRH VGDAIAFVVA
DTIAHARDAL EAIAVEWSPL PAVIGAVRAV EPGAPQVWPD HAGNVLFDAG IGNKKATEEA
FAKAHAVAEI RIVNPRIVTN YMETRAAVCE YDAKRDHFTL TVGSQGSHRL RDILCQTVLK
IPVEKMRVIC PDVGGGFGTK LFPYREYALL AVAAKKLRKT VKWTADRGDH FVGDSQGRDN
VTTARMALAE DGKFLGMDVD LIGDVGAYLS TFGPYIPYGG AGMLPGLYDI QAFYCRIRTV
FTHTVPVDAY RGAGRPEAAY VVERLVDACA RKLAMSPDAI RRKNFIPPRK LPYKTATGKV
YDSGDFTAHL KRAMEIGDWK DFGKRAKAAK KHGLVRGIGL ASYVEVCGTM GEETAKVVLD
PDGDITVLIG TQSSGQGHQT AYAQIVAEQF GVPQERVRVV QGDTDRIATG LGTGGSASIP
SGGVSVQRAT HQLGEQLRDI AADALETSTA DLEISDGTIR IAGTDRSISF ADLAKRSGVD
PAKLNASAAF SSADGTFPNG THLVEIELDP ATGKIRIVNY VIVDDFGVTL NPLLLAGQVH
GGTIQGIGQA LMEQAVYNAE DGQLITGSFM DYALPRAADG APITFETHNV PCATNPMGVK
GAGEAGAIGS CPAVMNAIID ALWREYRIDH IDMPATPERV WMAIREHERR HRL