Gene RPB_3957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3957 
Symbol 
ID3911764 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4515405 
End bp4517735 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content69% 
IMG OID637885861 
Productcarbon-monoxide dehydrogenase 
Protein accessionYP_487561 
Protein GI86751065 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR02416] carbon-monoxide dehydrogenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.92859 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTC TTCCCGGCTC CATGCGGTTC GGCGCGGGGC AGCCCGTCAA GCGTCTCGAG 
GATCAGCGGC TCGTCACCGG ACACGGGCAC TATCTCGACG ACAAGCCCGC CGACGGCGCG
TTGTGGCTGG TGGTGCTGCG CTCACCACAC GCGCACGCCA AGATCGTCTC GATCGATGCC
GAGGCGGCGC GCGCGATGCC GGGAGTCGAA AGCGTTCTGA CCGGCGCGGA CCTCGTCGCC
GACGAGATCG GCACGATCCC GACCCTGCCG ATCTTCAAGC GGCCGGACGG TTCGCCGATG
CTGCTGCCGC CGCGCCGGCT CTTGGCGCAC GAGATCGTCC GCTTCGTCGG CGAGCCGGTC
GCCGCGGTGA TCGCGGCGTC GCAGGCCGCG GCGCAGGCTG CGGCCGAGGC GGTCGTCGTC
GAGTATGAAG AATTGCCGGC GGTGACCGAT CCGGTCGCGG CGATCCAGCC CGGCGCGCCG
GTGGTGGTCG AGACCGCGCC CGACAACATC GTCGCGGCGA TGAGCTATGG CGATGCCGCC
AAGGTCGATG AGGCTTTCGC CAGCGCCGCG CACACCGTGT CGCTCGACAT CGTCAGCCAG
CGCCTGATCC CCTCGGCGAT GGAGCCGCGC GCCACTATCG CGGAAATCGA GAAGAAGACC
GGCCGGCTGA TCCTGCACGT GCAGTCGCAG ACGCCGGCGC AGACCCGCGA CGCGCTCGCC
GACGCGATCC TGAAGCGGCC GAAGGAGTCG ATCCAGGTGC TGGTCGGCGA CATCGGCGGC
GGTTTCGGCC AGAAGACCGG CGTCTATCCC GAGGACGCGC TGGTGGCCTA TGCGGCGGTG
AAGCTCAACA AGAAGATCCG CTGGCGCGGG GACCGCACCG ACGAATTCGT CGGTGGCACC
CATGGCCGCG ACCTGACCTC GACCGCGTCG ATCGCGCTCG ACGCCAAGGG CCGCGTGCTG
GCCTATCGGG TGTCGTCGAT CGGCGGCACC GGCGCGTATC TCGCCGGCGC CGGCGTGATC
ATTCCGCTGG TGCTCGGCCC GTTCGTGCAG ACCGGCGTCT ATGATCTGCC GCTGGTGCAT
TTCGACATCA AGGCGGTGAT GACCCACACC GCGCCCGTCG GCGCCTATCG CGGCGCAGGC
CGCCCGGAAG CCGTGTACAT CATCGAGCGC CTGATGGACG CCGCGGCGCG CCAGCTGAAC
ATGGACCCGC GCGCGATCCG CAAGGTCAAC TACATCAAGC CGACGCAACT GCCCTACACC
AACGCGGTCG GGCAGGTGTA CGATTCGGGC GCCTTCGCGC ATCTGATGCA GCGCGCGACC
GAGCTGTCCG ACTGGGACGG CTTCAAGGCG CGCAAGAAGG AAGCGCAGAA GAAGGGCCTG
CTCTACGGCC GCGGCGTCAC CAGCTACATC GAATGGACCG GCGGCCGCGC CCACACCGAG
AAGGTCAGCC TGCACGCCAC CGCGGAAGGC CGCATCGTGC TGCATTCCGG CACGCAGGCG
ATGGGGCAGG GGCTGGAGAC CACCTACTCG CAGATGATCG CGCAGGCGCT CGACATCCCG
ATCGAGAGCA TCGACGTCGT GCAGGGCAAC ACCGATCTGG CGCAGGGCTT CGGCAGCGTC
GGCTCGCGCT CGCTGTTCGT CGGCGGCACC GCGGTCGCGG TGTCGACCGT CGATATGATC
GCCAAAGCGC GCGAGAAGGC CGCGAACATT CTCGAAGCCT CGATCGAGGA CATCGAGTAT
TCCGGCGGCA TGCTGACGAT CGCCGGCACC GATCGCAAGA TCAGCCTGTT CGAAATCGCC
GCCAAGGAAA AAGGTACCAA GCTCAGCGTC GATTCGACCG GCGAAGTCGA CGGTCCGAGC
TGGCCGAACG GCGCGCATAT CTGCGAGGTC GAGGTCGATC CCGAAACCGG CGTCAGCCGT
GTGGTGCGCT ACACCACGGT CGACGACGTC GGCAATGCGG TCAATCCGAT GCTGGTCGCG
GGGCAGATCC ATGGCGGCGT CGCGCAGGGC GTCGGCCAGG CGCTGTACGA AGGCGCGGCC
TATAACGACG ACGGCCAGCT GCTGACCGCG AGCTATCAGG ACTACTGCAT CCCGCGCGCC
GACAATCTGC CGCCGATCAA CGTCACGCTC GATCCGTCGG CGCCGTGCCG GACCAATCCG
CTCGGCGCCA AGGGCTGCGG CGAATCCGGC GCGATCGGTG GGCCGCCCTG CGTCGTCCAC
GGCGTGCTCG ACGCGCTGGC GCCGCTCGGC GTCACCACGC TGAACACGCC GCTGACCCCG
GAAAAGGTGT GGCGGGCGAT CCAGGACGCC AAGGCCGCGC AGGCGGCCTG A
 
Protein sequence
MNILPGSMRF GAGQPVKRLE DQRLVTGHGH YLDDKPADGA LWLVVLRSPH AHAKIVSIDA 
EAARAMPGVE SVLTGADLVA DEIGTIPTLP IFKRPDGSPM LLPPRRLLAH EIVRFVGEPV
AAVIAASQAA AQAAAEAVVV EYEELPAVTD PVAAIQPGAP VVVETAPDNI VAAMSYGDAA
KVDEAFASAA HTVSLDIVSQ RLIPSAMEPR ATIAEIEKKT GRLILHVQSQ TPAQTRDALA
DAILKRPKES IQVLVGDIGG GFGQKTGVYP EDALVAYAAV KLNKKIRWRG DRTDEFVGGT
HGRDLTSTAS IALDAKGRVL AYRVSSIGGT GAYLAGAGVI IPLVLGPFVQ TGVYDLPLVH
FDIKAVMTHT APVGAYRGAG RPEAVYIIER LMDAAARQLN MDPRAIRKVN YIKPTQLPYT
NAVGQVYDSG AFAHLMQRAT ELSDWDGFKA RKKEAQKKGL LYGRGVTSYI EWTGGRAHTE
KVSLHATAEG RIVLHSGTQA MGQGLETTYS QMIAQALDIP IESIDVVQGN TDLAQGFGSV
GSRSLFVGGT AVAVSTVDMI AKAREKAANI LEASIEDIEY SGGMLTIAGT DRKISLFEIA
AKEKGTKLSV DSTGEVDGPS WPNGAHICEV EVDPETGVSR VVRYTTVDDV GNAVNPMLVA
GQIHGGVAQG VGQALYEGAA YNDDGQLLTA SYQDYCIPRA DNLPPINVTL DPSAPCRTNP
LGAKGCGESG AIGGPPCVVH GVLDALAPLG VTTLNTPLTP EKVWRAIQDA KAAQAA