Gene RPD_3715 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3715 
Symbol 
ID4024231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4146931 
End bp4149261 
Gene Length2331 bp 
Protein Length776 aa 
Translation table11 
GC content68% 
IMG OID637963919 
Productcarbon-monoxide dehydrogenase 
Protein accessionYP_570837 
Protein GI91978178 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR02416] carbon-monoxide dehydrogenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.264219 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTC TCCCCGGCTC CATGCGGTTC GGTGCGGGTC AGCCCGTCAA GCGTCTCGAG 
GATCAGCGCC TGCTCACCGG GCACGGCCTC TATCTCGACG ACAAGCCCGC CGACGGCGCG
CTGTGGCTGG TGGTGCTGCG CTCGCCTCAT GCGCACGCGA AGATCGTTGC GATCGACGGT
GAGGCGGCGC GGGCGATGCC GGGCGTCGAG TCCGTGCTGA CCGGAGCCGA TCTGGTCGCC
GACGCGGTCG GCACGATCCC GACTCTGCCG ATCTTCAAGC GGCCGGACGG CTCGCCAATG
ACGCTGCCGC CACGGCGTCT GCTCGCGCAT GAGATCGTCC GTTTCGTCGG CGAGCCCGTC
GCCGCCGTCA TCGCGTCGTC GCAAGCCGCC GCGCAAGCGG CCGCCGAGGC TGTCGTCGTC
GAGTACGAAG AGCTTCCCGC CGTGACCGAT CCGACCGCGG CAATCCAGCC CGGCGCGCCG
GTCGTGTACG ACACCGCTCC CGACAACATC GTCGCGGCGA TGAGCTATGG CGATGCCGCC
AAGGTCGATG AGGCCTTCGC CAAAGCCGCG CACACGGTCT CGCTCGATAT CGTCAGCCAG
CGGCTGATCC CTTCCGCGAT GGAGCCGCGC GCGACCATCG CCGAGATCGA GAAGAAGACC
GGCCGGCTGA TCCTGCACGT GCAGTCGCAG ACGCCGGCGA CGACGCGCGA CACGCTCGCC
GACGCCATCC TGAAGCGGCC GAAGGACAGC ATTCAGGTTC TGGTCGGCGA CATCGGCGGC
GGTTTCGGCC AGAAGACCGG CCTCTATCCG GAGGATGGTC TCGTCGCCTA CGCGGCGGTC
AAGCTCAACC GCAAGGTGCG ATGGCGCGGC GACCGGACCG ACGAATTCGT CGGCGGCACC
CATGGCCGCG ACCTGACCTC GACGGCGTCG ATTGCGCTCG ACGCCAAGGG CCGCGTGCTG
GCCTATCGCG TGTCGTCGAT CGGCGGCACC GGCGCCTATC TCGCTGGCGC CGGCGTGATT
ATTCCGCTGG TGCTCGGCCC GTTCGTGCAG ACCGGCGTCT ACGATCTGCC GCTGGTGCAT
TTCGACATCA AGGCGGTGTT GACCCACACC GCGCCGGTCG GAGCCTATCG CGGGGCCGGT
CGCCCCGAGG CGGTGTACAT CATCGAGCGG CTGATGGACG CCGCTGCGCG ACAGCTCGGC
ATGGACCCGC GCGCGATCCG CAAGGTCAAT TACATCAAGC CGTCGCAGCT GCCTTACACC
AACGCGGTCG GGCAGGTGTA CGATAGCGGC GCCTTCGCCC ATATGATGCA GCGCGCCGCC
GAACTGTCCG ACTGGGTCGG CTTCAAGGCG CGCAAGAAGG AAGCCGCGAA GAAGGGCCTG
CTCTACGGCC GCGGCGTCAC AAGCTACATC GAATGGACCG GCGGCCGCGC GCATACCGAA
AAGGTGAGCC TGCACGCCAC GGCGGAAGGC CGCATCGTGC TGCATTCCGG CACGCAGGCG
ATGGGGCAGG GGCTCGAGAC CACCTACACC CAGATGATCG CGCAGGCGCT CGACATTCCG
ATGGATCAGA TCGACGTGGT GCAGGGCAAC ACCGATCTTG CGCAGGGTTT CGGCAGCGTC
GGCTCGCGCT CGCTGTTCGT CGGCGGCACT GCGGTCGCGG TGTCGACCGT CGACATGATC
GCCAAGGCGC GCGAGAAGGC GGCGAACATC CTCGAAGCCT CGGTCGAGGA CATCGAGTAT
TCCGGCGGCA CGCTGACGAT CGCCGGCACC GATCGCAAGA TCAGCCTGTT CGAGATCGCC
GCCAAGGAGA ACGGCGCCAA GCTCAGCGTC GACAGCACCG GCGAAGTGGA CGGCCCGAGC
TGGCCGAACG GCGCGCATAT CTGCGAGGTC GAGGTCGATC CAGAGACGGG TGTGTCGCGC
GTGGTGCGCT ACACCACGGT CGATGACGTC GGCAATGCGG TCAATCCGAT GTTGGTGGCC
GGCCAGATCC ACGGTGGCGT CGCGCAGGGG GTTGGACAAG CGCTGTACGA AGGCGCGTCC
TACAATGACG ACGGCCAGTT GGTGACCGCG AGCTATCAGG ACTACTGCAT CCCGCGGGCC
GACAATCTGC CGCCGATCTC GGTGACGCTC GATCCTTCGG CGCCGTGCCG GACCAATCCG
CTCGGCGCCA AGGGCTGCGG CGAATCCGGC GCGATCGGCG GTCCCCCTTG CGTGGTCCAT
GGCGTGCTCG ATGCGCTGGC ACCGCTCGGC GTCACCGCGC TGAACACGCC GCTGACGCCG
GAAAAGGTGT GGCGCGCAAT TCAGGAAGCC AAAGCTGCGC AGGCGGCCTG A
 
Protein sequence
MNILPGSMRF GAGQPVKRLE DQRLLTGHGL YLDDKPADGA LWLVVLRSPH AHAKIVAIDG 
EAARAMPGVE SVLTGADLVA DAVGTIPTLP IFKRPDGSPM TLPPRRLLAH EIVRFVGEPV
AAVIASSQAA AQAAAEAVVV EYEELPAVTD PTAAIQPGAP VVYDTAPDNI VAAMSYGDAA
KVDEAFAKAA HTVSLDIVSQ RLIPSAMEPR ATIAEIEKKT GRLILHVQSQ TPATTRDTLA
DAILKRPKDS IQVLVGDIGG GFGQKTGLYP EDGLVAYAAV KLNRKVRWRG DRTDEFVGGT
HGRDLTSTAS IALDAKGRVL AYRVSSIGGT GAYLAGAGVI IPLVLGPFVQ TGVYDLPLVH
FDIKAVLTHT APVGAYRGAG RPEAVYIIER LMDAAARQLG MDPRAIRKVN YIKPSQLPYT
NAVGQVYDSG AFAHMMQRAA ELSDWVGFKA RKKEAAKKGL LYGRGVTSYI EWTGGRAHTE
KVSLHATAEG RIVLHSGTQA MGQGLETTYT QMIAQALDIP MDQIDVVQGN TDLAQGFGSV
GSRSLFVGGT AVAVSTVDMI AKAREKAANI LEASVEDIEY SGGTLTIAGT DRKISLFEIA
AKENGAKLSV DSTGEVDGPS WPNGAHICEV EVDPETGVSR VVRYTTVDDV GNAVNPMLVA
GQIHGGVAQG VGQALYEGAS YNDDGQLVTA SYQDYCIPRA DNLPPISVTL DPSAPCRTNP
LGAKGCGESG AIGGPPCVVH GVLDALAPLG VTALNTPLTP EKVWRAIQEA KAAQAA