Gene RPB_3096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3096 
Symbol 
ID3910897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3527565 
End bp3529913 
Gene Length2349 bp 
Protein Length782 aa 
Translation table11 
GC content64% 
IMG OID637885000 
Productcarbon-monoxide dehydrogenase 
Protein accessionYP_486705 
Protein GI86750209 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR02416] carbon-monoxide dehydrogenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGA CGGGCGGATT CATTGGAAAG TCGGTTCCGC GGCGCGAAGA CAAGCGGCTT 
CTGACCGGCA AGGGCGAGTT CGTCGCGGAC CTCAAGCTGC CGTCGATGCT GCACGCGGCG
TTCGTTCGCA GCCAGGTCGC CCATGGCCGG ATCAAATCGG TCGATCTGTC GCGCGCCCTG
CGGTCACCGG GCGTCGTCTA TGCGATCTCC GGGCCGGATC TCGCCAAGCT GCTGCCACCC
GTGCCCGACA CGCAACTGTC GCTGCCGAAG AAATGGACGA CGCGGGTCCA GCACACCTTC
CTCAATCCGC AGCAACCGCT GCTCGCTTAC GACAAGGTCC GGCATGTCGG CGAGGCCGTC
GCGGTCATTC TCGCAGAGAG CCGCTACCTC GCTGAAGACG CCGCCGAGCT GGTGACCATG
GAGATCGAGC CGCTGCCGGC CGTGGTAGAT CCGGAAGCCG GGCTCACCCG CGACAGCGCC
GTGTTGCACG AACAATACGA CACCAATCTG ATCGGCGATT TCGCCATTGC CAAAGGCGAC
GTCGAGACAG CGCTGGCGAA CGCGCCGCAC CGCATGAAGC GCCGCTTCTA TCATCATCGC
TACGCCGCGA TCCCGATGGA GGGCCGCGGC GTCGCCGCCA ACTACGACGC ACGCACGGAT
TCGATCAACA TCTGGTCGGC GTGCCAGGTC ATTCACTGGC TCCGCCGCGA GGCGTCGACG
GTGCTCGGCA TGCCCGAAGC TCGCATTCGC TGCGTCGCGC TCGACGTCGG CGGCGGCTTC
GGCGTCAAGG GCCACGTCTA TCCCGAAGAA TTGCTGATTC CTTATCTTGC GCGCGAAGTC
GGCCGCCCGG TGAAATGGAT CGAGGATCGG CACGAGCATT TCATGAGCGC CTGCCATTCC
CGCGACCAGA CCCACGATGT AGAGTTCGGG TTCGACGACG ACGGTCGTCT GCTAGCGTTC
CAGGACGAAT TCCTCGTCGA TTGCGGCGCC TGGAATCCGA TCGGCTCCGG CATCGCCTAC
AACACTGCGG TGCATTTGCC CGGCCCCTAC AAATTCGAGC ACTTCGCGGT GCGATCGAAG
ATCGTCGCCA CCAACAAAGT GCCCAACGCG CCCTATCGCG GTGCCGGCCG CCCCGAAGCG
ACGTTCGCGA TGGAGCGGGT GATCGATCTG ATCGCGGCCG AACTCGGCCT CGATCCGGCC
GACGTCCGCA TGCGCAACAT GATTCCGGCT TCCGAGATGC CGTATCGTCT CGGCCTTCCC
TACCGGGACG GCGAGCCGAT CGTCTACGAC AGCGGCGACT ATCCGGAATC GCTGCGCCAG
GCATTGGCGG CACTAGGAGG TGTCGACGCC TTCCGCGATC GGCAGCGGGC CGCACGGGCG
CAAGGTCGAT ATTTCGGCCT CGGACTCGGA TGCTACGTCG AAGGCACCGG CGTCGGACCG
TTCGAAAGCG CTACCGTCCG CGTAGACCCG ACCGGCAAGA TCTATCTCGC AGGCGGCGCC
TGCCCGCAGG GACAAGGCAT GGAAACGATC TTCTCGCAGA TCGTCGCGGA TGCCTGGCAG
GTTCAACCCG ATGACGTCGT CGTAGCATTG GCGGACACGA GCGTGATCTC GATCGGCTTC
GGCACCATCG CGAGCCGCAG CACCGTGAAC TTGTCGGGCG CAATCCACAC CGCGAGCCAA
TCGCTGCAGA AGAAGGTCTT CGCGATCGCA GCAGACATGC TCGAATGCTC GCCGGCCGAC
CTCGAGCTGC GCAACGGAAC CGTCGGCCTC GTCGGCGTAC CGGGCAGAGA GATTCCCCTC
GCCCGCATCG CCAAAGCGGC GATGCCCGGC TGGGACAACA AGCGACCCGC CGGTGTTTCA
GCGGGGCTGG AGGAAACCGC CTACTTCGAG CCGCCGACCG TGACCTGGGC TTACGCGACG
CACGCCGCGA TCGTCGAGCT CGATGTCGAA CTCGGCCGCG TCGAGATCGA GAAATACGTC
ATCGTGCATG ATTGCGGCGT GGTGGTGAAT CCGATGCTGG TCGACGGGCA GATTAACGGC
GGCGCCGTGC AGGGCCTCGG CGGCGCGCTG CTGGAGGAAC TAAGCTACGA TTCGGAAGGC
CAGTTGCTGG TTGGATCGTT CATGGACTAT CTAGTACCGG GCGCGAGCGA CGTGCCGCAT
TTCGAGCTGA AGCACATGCA CTTCCCCTCG CCGCTGAATC CCTATGGCGT GAAAGGCGTC
GGAGAAGGTA GCGCGATCGC GCCGCCGGTG GTCATCGCCA ATGCGGTCTC CGATGCACTC
TCTCACCTCA AGGTCGAATT CAATTCGACG CCGATCAGGC CTGAGCACAT CGTCACAGCG
TTCGGATAA
 
Protein sequence
MAETGGFIGK SVPRREDKRL LTGKGEFVAD LKLPSMLHAA FVRSQVAHGR IKSVDLSRAL 
RSPGVVYAIS GPDLAKLLPP VPDTQLSLPK KWTTRVQHTF LNPQQPLLAY DKVRHVGEAV
AVILAESRYL AEDAAELVTM EIEPLPAVVD PEAGLTRDSA VLHEQYDTNL IGDFAIAKGD
VETALANAPH RMKRRFYHHR YAAIPMEGRG VAANYDARTD SINIWSACQV IHWLRREAST
VLGMPEARIR CVALDVGGGF GVKGHVYPEE LLIPYLAREV GRPVKWIEDR HEHFMSACHS
RDQTHDVEFG FDDDGRLLAF QDEFLVDCGA WNPIGSGIAY NTAVHLPGPY KFEHFAVRSK
IVATNKVPNA PYRGAGRPEA TFAMERVIDL IAAELGLDPA DVRMRNMIPA SEMPYRLGLP
YRDGEPIVYD SGDYPESLRQ ALAALGGVDA FRDRQRAARA QGRYFGLGLG CYVEGTGVGP
FESATVRVDP TGKIYLAGGA CPQGQGMETI FSQIVADAWQ VQPDDVVVAL ADTSVISIGF
GTIASRSTVN LSGAIHTASQ SLQKKVFAIA ADMLECSPAD LELRNGTVGL VGVPGREIPL
ARIAKAAMPG WDNKRPAGVS AGLEETAYFE PPTVTWAYAT HAAIVELDVE LGRVEIEKYV
IVHDCGVVVN PMLVDGQING GAVQGLGGAL LEELSYDSEG QLLVGSFMDY LVPGASDVPH
FELKHMHFPS PLNPYGVKGV GEGSAIAPPV VIANAVSDAL SHLKVEFNST PIRPEHIVTA
FG