Gene RPC_3083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_3083 
Symbol 
ID3974034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp3421242 
End bp3423620 
Gene Length2379 bp 
Protein Length792 aa 
Translation table11 
GC content66% 
IMG OID637926191 
Productcarbon-monoxide dehydrogenase 
Protein accessionYP_532944 
Protein GI90424574 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0650295 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTGCG CTATGCAAGA TCACGCGTTC CCGCGCGACA ACGCGCTCGC GGTGCAACGA 
TTTGGCGTCG GCCAGCCGGT GCGGCGCAAG GAAGACGATA CCCTGGTCCG CGGCCATGGC
TGCTACACCG ACGACCTGCA ACTCGCCGGC CAGGCCTATG CCTGGATGCT GCGCAGCAAC
CACGCCCACG GCGTGATCCG CAATATCGAC GCCCGCGCGG CCAGAGCGAT GCCCGGCGTG
CTCGGGATCT GGACCGGCGC CGATCTTGCC GCCGCCGACT ATGCCCCGTT CAGCTGCGCG
CTGCCGTTGC AGAATCGCGA CGGCTCGCCG CTCTTGCAGA CCCCGCGCCT GGCGCTGATG
ACCGACAAGG TGCGCTATGT CGGCGACCCG GTGGCGATCG TGGTCGCCGA GACGCTGCTG
CAGGCCCGCG ACGCCGCCGA GGCGATCGAA CTCGACATCG CGCCGCTGCC CGCGGTGACC
AGCGCAGAGG ACGCCGCCAA GCCCGGCGCG CCGCGGCTCT ACGATCACAT CCCCGACAAC
GTCGCGCTCG ACTATCACTA TGGCGACACC GCAGCGGTCG AGGCCGCGTT CGCCGCCGCC
GCCCACGTCA CCAGGCTCGA CATCACCAAC ACCCGCATCG CCGTGGTGGC GATGGAGCCA
CGCAGCGCGC TCGCCGCCTA TGACAGGACG AGCCGACGCT ACACCATCGA GGTGCCGACC
CAGGGCGTCT CCGGCAACCG CGCCGCGCTG GCGAAACTGC TGAAAGTGCC GAACGACAAG
GTGCACCTGT TGACCGGCAA TGTCGGCGGC TCGTTCGGCA TGAAGAACAT CAACTACCCG
GAGTACATCT GCATCCTGCA CGCCGCCAAG GCGCTGGGCC GGCCGGTGAA ATGGACCGAC
GAGCGCTCCA GCAGTTTCTT GTCCGACAGC CACGGCCGTG CGCAGCAGAT CCACGCCGAA
CTGGCGCTCG ACGCCGAGGG ACATTTTCTC GCGGTGCGGA TTTCCGGCTA TGGCAATCTC
GGCGCCTACA TCACCGGGGT GTCGCCCTCG CCGCTGTCGC TCAACACCGG CAAGAACCTC
TCCAGCGTCT ATCGCACGCC GCTGCACAGC GTTGACATCA AATGCGTGCT GACCAACACC
ACGCTGATGG GCGCCTATCG CGGCGCCGGT CGGCCCGAGG CCAACTACTT CATGGAGCGA
TTGATCGACC GCGCCGCCGA CGAAATGCGC ATCGATCGTC TCGCCTTGCG CAAGCGCAAC
TTCATCAAGC CGTCGCAGCT GCCGTTCAAG GCCTCCTCCG GCATGACCTA TGACAGCGGC
GACTTCCTCG GCGTGTTCAA CAAGGCGTTG ACGTTGTCGG ACTATGCGGG CTTTGCCAAG
CGCAAGCGCG ACAGCAAGAA GCGCGGCAAA TTGCGCGGCA TCGCGGTCGG CAGCTATCTG
GAAGTCACCG CGCCGCCCAG CGTCGAGCTC GGCAAGATCG TGTTCGAGCA AGACGGCGGC
GTCACCCTGA TCACCGGCAC GCTGGATTAC GGCCAGGGCC ACGCCACCGC GTTCGCCCAG
GTGCTGGCGG CGCAACTCGG CGTGCCGTTC GAGCGGATCG CGCTGCAGCA GAACGACAGC
GACCTGGTGC ACGCCGGCAG CGGCACCGGC GGCTCGCGCT CGATCACCGC CTCCGGCATG
GCGATCGTGG AAGCGTCGAA GCTGGTGATC GCCAAGGGCA AGATCGCGGC GGCGCATCTG
TTGGAAACCG CGGAAGCCGA CATCGAATTC GACGCCGGCC GCTTCACTGT GGTCGGCACC
GACCGCGGCA TCGACATTCT CGAACTGGCG CAGCGGCTGC GCGAGAGCAA CCTCCCCGAC
GGCATACCGT CGTCGATCGA CGTCGACCAC ACCGTGAACG ACATCCCTTC GACCTTTCCC
AACGGCTGCC ACGTCGCCGA GGTCGAGATC GATCCCGACA CCGGGGCTAC CAGCGTGGTC
GGCTATACCG GGGTCAACGA TTTCGGCACC ATCGTCAATC CGATGATCGT TGCGGGACAA
TTGCACGGCG GCGTCGCGCA AGGCATCGGC CAGGCCTTGA TGGAAAAGGT CAGCTACGAC
GACAGCGGCC AACCGATCAC CGGCTCGCTG ATGGATTACG CGTTGCCGCG CGCCGAGGAC
GTTCCGATGA TGACGATCGG CGATCACCCG ACATTCGCCA CATCCAATCC GCTCGGCACC
AAGGGCTGCG GCGAAGCCGG CTGTGCCGGC AGCCTCGCCA CCCTGGTCAA CGCAGTGCTC
GACGCGCTGT CCGACTACGG CATCGATCAT CTCGACATGC CGCTGACCTC GGAGCGGGTC
TGGCGGGCGA TCGAAGCGAC GAAGAACAAG GCGGCGTGA
 
Protein sequence
MDCAMQDHAF PRDNALAVQR FGVGQPVRRK EDDTLVRGHG CYTDDLQLAG QAYAWMLRSN 
HAHGVIRNID ARAARAMPGV LGIWTGADLA AADYAPFSCA LPLQNRDGSP LLQTPRLALM
TDKVRYVGDP VAIVVAETLL QARDAAEAIE LDIAPLPAVT SAEDAAKPGA PRLYDHIPDN
VALDYHYGDT AAVEAAFAAA AHVTRLDITN TRIAVVAMEP RSALAAYDRT SRRYTIEVPT
QGVSGNRAAL AKLLKVPNDK VHLLTGNVGG SFGMKNINYP EYICILHAAK ALGRPVKWTD
ERSSSFLSDS HGRAQQIHAE LALDAEGHFL AVRISGYGNL GAYITGVSPS PLSLNTGKNL
SSVYRTPLHS VDIKCVLTNT TLMGAYRGAG RPEANYFMER LIDRAADEMR IDRLALRKRN
FIKPSQLPFK ASSGMTYDSG DFLGVFNKAL TLSDYAGFAK RKRDSKKRGK LRGIAVGSYL
EVTAPPSVEL GKIVFEQDGG VTLITGTLDY GQGHATAFAQ VLAAQLGVPF ERIALQQNDS
DLVHAGSGTG GSRSITASGM AIVEASKLVI AKGKIAAAHL LETAEADIEF DAGRFTVVGT
DRGIDILELA QRLRESNLPD GIPSSIDVDH TVNDIPSTFP NGCHVAEVEI DPDTGATSVV
GYTGVNDFGT IVNPMIVAGQ LHGGVAQGIG QALMEKVSYD DSGQPITGSL MDYALPRAED
VPMMTIGDHP TFATSNPLGT KGCGEAGCAG SLATLVNAVL DALSDYGIDH LDMPLTSERV
WRAIEATKNK AA