Gene RPB_0912 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0912 
Symbol 
ID3909765 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1050325 
End bp1052667 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content67% 
IMG OID637882805 
Productcarbon-monoxide dehydrogenase large subunit 
Protein accessionYP_484534 
Protein GI86748038 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR02416] carbon-monoxide dehydrogenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCATCG AGGGCATCGG CGCGAGCGTC GTGCGCAAGG AAGACCGTCG CTTCATCACC 
GGCAAGGGCC GCTATGTCGA CGACATCAAG ATCATCGGCA TGACGCATGC GTATTTCCTG
CGCAGCCCGC ATGCCCACGC CAGGATCAAG CGCATCGACG TCGAGGCCGC CAAGGCGATG
CCTGGGGTGG TCGACGTGCT GACCGGGCAG CAGATCGTCG ACGACAAGGT CGGCAATCTG
ATCTGCGGCT GGGCGATCCA CTCCAAGGAC GGCTCGGCGA TGAAGATGGG CGCGTGGCCG
GCGATGGCGC CGGAGACGGT GCGCTTCGTC GGCCAGGCGG TCGCGGTGGT GATCGCCGAG
ACCCGAAACC TCGCCCGCGA CGCCGCCGAG GCGATCGTGG TGGAGTACGA AGAGCTGCCT
GCCGTCGCCG ATGTCAAGGC CGCGATCGCG CCCGGCGCGC CGCAGCTTCA TCCCGAGGCC
CCCGGCAACA TCGTCTATGA CTGGGAGATC GGCGACCAGA AGGCGGTCGA CGAGGCCTTC
GGCAAGGCCG CCAATGTGGT GTCGTTCGAA CTGACCAACA ACCGGCTGGT GCCGAACGCG
ATGGAGCCGC GCGCCGCACT CGCCGACTAC AACGAGGCGG AAGAGCACTT CACGCTGTAC
ACCACGTCGC AAAATCCGCA CGTCGCCCGG CTGGTGCTGT CGGCGTTCTA CAACATCGCG
CCCGAGCACA AGCTGCGGGT GGTGGCGCCC GACGTCGGCG GCGGCTTCGG CTCCAAGATC
TTCATCTATC CGGAAGAGAT GGTGGCGCTG TGGGCCTCCA AGAAGGTCGG CCGCCCGGTG
AAATGGACCG GCGACCGCAA CGAGGCGTTC CTGACCGACG CCCACGGCCG CGACCACGTC
TCCAAGGCCG AGATGGCGTT CGACAAGGAC AACAAGATGC TGGCGCTGCG GGTGACGACC
CACGCCAATT TCGGCGCCTA CATGTCGCTG TTCTCGTCCT CGGTACCGAC TTATCTCTAC
GCCACGCTGC TGTCGGGCCA GTACAACATC CCGGCGATCT ACACCGAGGT GATCGGCGTC
TACACCAACA CCACGCCGGT CGACGCCTAT CGCGGCGCCG GGCGGCCCGA GGCGTGCTAC
CTGCTGGAGC GGCTGGTCGA GACCGCGGCG CGGCAGCTCA AGGTCGACCC GGCCGAGCTG
CGGCGCAAGA ACTTCATCAC CCAGTTCCCG CATCAGACCC CGGTGATCAT GGCCTACGAC
ATCGGCGACT TCAACGCGTC GCTCGACGCC GCGCTGAAGG CCGCCGACTA TGCCGGCTTC
CCGGCGCGCA AGGCCAAGGC CAAGGCCGAG GGCAAGCTGC GCGGGCTCGG CTTCTCCTGC
TACATCGAGG CCTGCGGCAT CGCGCCCTCG AAAGCGGTGG GCTCGCTCGG CGCCGGCGTC
GGTCTGTGGG AATCCGCGGA GGTGCGCGTC AACCCGGTCG GCACCATCGA GATCCTGACC
GGTTCGCACA GCCACGGCCA GGGCCACGAG ACCACCTTCG CCCAATTGGT CGCCGATCGC
CTCGGCATTC CGATCAACCA GGTGTCGATC GTGCACGGCG ACACCGACAA GGTGCAGTTC
GGCATGGGCA CCTACGGCTC GCGCTCCGCC GCGGTCGGCA TGTCGGCGAT CTTCAAGGCG
ATGGAGAAGG TCGAGGCCAA GGCCAAGAAG ATCGCGGCGC ATCAGCTCGA GGCCAGCGAG
GGCGACATCG TCATCGAGAA CGGCGAGTTC AAGGTCACCG GCACCGACAA GTCGATCGCG
CTGCCGATGG TGGCGCTGGC CGCCTACACG GCGCACAATC TGCCCGACGG CATGGAGCCC
GGCCTGAAGG AGAGCGCGTT CTACGATCCG ACCAACTTCA CGTTCCCGGC CGGCTCCTAC
ATCTGCGAGC TCGAAGTCGA TCCCGGCACC GGCAAGACCT CGTTCGTCAA TTTCGTCGCG
GTCGATGATT TCGGCCGGCT GATCAATCCG ATGATCGTCG AAGGCCAGGT CCATGGCGGC
CTCGTCCAGG GCATCGGCCA GGCGCTGCTG GAGCAGGCGA TCTACGACGA CACCGGCCAG
CTCGTCACCG CGTCGTTCAT GGACTACGCG ATGCCGCGCG CCGACGACGT GCCGTCGTTC
AAGGTGTCGC ACACCGAGAC GCTGTGTCCG GGCAATCCGC TCGGCGTCAA GGGCTGCGGC
GAGGCCGGCG CGATCGGCGC CTCGGCGGCG GTGATCAACG CCATCACCGA CGCGATCGGC
AACAACAAGC TCGAAATGCC GGCGACGCCC GACCGGGTGT GGCACGCGAT CCAAGCGGCC
TGA
 
Protein sequence
MGIEGIGASV VRKEDRRFIT GKGRYVDDIK IIGMTHAYFL RSPHAHARIK RIDVEAAKAM 
PGVVDVLTGQ QIVDDKVGNL ICGWAIHSKD GSAMKMGAWP AMAPETVRFV GQAVAVVIAE
TRNLARDAAE AIVVEYEELP AVADVKAAIA PGAPQLHPEA PGNIVYDWEI GDQKAVDEAF
GKAANVVSFE LTNNRLVPNA MEPRAALADY NEAEEHFTLY TTSQNPHVAR LVLSAFYNIA
PEHKLRVVAP DVGGGFGSKI FIYPEEMVAL WASKKVGRPV KWTGDRNEAF LTDAHGRDHV
SKAEMAFDKD NKMLALRVTT HANFGAYMSL FSSSVPTYLY ATLLSGQYNI PAIYTEVIGV
YTNTTPVDAY RGAGRPEACY LLERLVETAA RQLKVDPAEL RRKNFITQFP HQTPVIMAYD
IGDFNASLDA ALKAADYAGF PARKAKAKAE GKLRGLGFSC YIEACGIAPS KAVGSLGAGV
GLWESAEVRV NPVGTIEILT GSHSHGQGHE TTFAQLVADR LGIPINQVSI VHGDTDKVQF
GMGTYGSRSA AVGMSAIFKA MEKVEAKAKK IAAHQLEASE GDIVIENGEF KVTGTDKSIA
LPMVALAAYT AHNLPDGMEP GLKESAFYDP TNFTFPAGSY ICELEVDPGT GKTSFVNFVA
VDDFGRLINP MIVEGQVHGG LVQGIGQALL EQAIYDDTGQ LVTASFMDYA MPRADDVPSF
KVSHTETLCP GNPLGVKGCG EAGAIGASAA VINAITDAIG NNKLEMPATP DRVWHAIQAA