Gene RPC_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_1014 
Symbol 
ID3969663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp1112770 
End bp1115106 
Gene Length2337 bp 
Protein Length778 aa 
Translation table11 
GC content66% 
IMG OID637924127 
Productaldehyde oxidase and xanthine dehydrogenase, molybdopterin binding 
Protein accessionYP_530899 
Protein GI90422529 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR03194] 4-hydroxybenzoyl-CoA reductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGCGG GGCCGATGCC ACAGGGCGGC CTCGACCGGA CCGGATTCCC GCTGATCGAC 
GGCATCGAGA AGGTCACCGG GCGGGCCCGC TACACTGCGG ACCTCAACCA TAGTGAAGCC
CTCGTCGCGC GCATCCTGCG CAGTCCGATC AGCCACGGTG ACATCGTCAG GCTCGACATC
AGCAAGGCGC TCGCGTTGGA GGGCGTCTTG GCGGTGGTCA CCGGCGAGGA TTGCCCGTTT
ACCTACGGCG TGCTGCCGAT CGCCATGAAC GAACATCCGA TGGCGCGCGG GCGGCTCCGC
TATCGCGGCG AGCCGATCGC CGCGGTGGCG GCGGTTGATG CCGCAACCGC GCAGCGCGCG
ATCGACTTGA TTGAACTCGA ATTGCGCGAG CTCCCGGCCT ACTATGATTC AGCAAGCGCC
CGCGCTCCGG ACGCGCTGCT GTTGCACGAC CACAAGCCCG GCAATATCGA GCGGGAGGTC
CACAACGAGT TCGGCGATCT CGCTGCCGGC TTCGAGGCGG CAGATCTGGT TCGGGAGCAC
AATTTCAACT GCGCCGAGGT CAATCACGCC CAGATCGAGC CCCATGCCTG CCTGATGGAC
TACGATCCGA TCAACGGCCG GCTGACCGCG CAGAGCGTCT CCCAGGTCGG CTATTATCTG
CATCTGATGC TGGCGCGGTG CCTCGACATC GATCAGTCGC GCATTCGGGT GATCAAACCG
TTCGTCGGCG GCGGCTTCGG GGCACGCGTT GAGGTCCTGA ACTTCGAGAT CATCACTGCG
CTGCTGGCGC GCAAGGCCAG CGCCAAGGTG TCGATGCGCC TCACACGCGA GGAGACTTTC
GTCACCCATC GGGCGCGGCC GCAGACCGAT GTCAGACTAA AGATCGGCAT GAAGCGCGAC
GGTCGGCTCA CCGCCTGCGC CTGTGAAGTC GTGCAGCGCG GCGGCGCCTA TGCGGGATAC
GGTATCGTCA CCATCCTCTA TGCCGGCGCA TTGCTGCAGG GCCTGTACGA TATTCCAGCG
ATCCGATACG ACGGCTATCG GGTGTATACC AATCTGCCTC CCTGCGGCGC GATGCGGGGG
CACGGTTCGG TCGACGTCCG TCACGCGTTC GAAACCCTGA TTGATCGGAT GGCGCGCGAA
CTCGGCCTCG ATCCGTTCGC GGTACGCCGC GCCAATCTGT TGACGGCGCC GACGCGGACA
CTGAACGACC TGATGGTGAA CAGCTATGGG CTCGCCGATT GCCTCGATAA GGTCGAGCGC
GCCAGCGGAT GGCACGAGCG AATCGGGCGG ATGCCGCCGG GCAAAGGGCT CGGCATGGCC
TGCTCGCACT ATGTCAGCGG ATCGGCCAAA CCGATACACT TCACCGGCGA GCCCCACGCG
GTGGTCGCAC TCCGACTCGA TTTCGACGGC GGCGTAACGG CGCTGACCGG CGCGGCCGAT
ATCGGCCAGG GCTCCTCCAC CGTGGTGGCC ATCACGGTCG CGGAGACATT GGGCATCGCG
CTGAACCGGG TGCGGGTGAT TTCCGGCGAC TCCGCGGTCA CACCGAAGGA CAATGGCGCC
TACTCGTCGC GCATCACCTT CATGGTCGGC AACGCCGCGA TCGATGCGGC CAAACAACTG
AAGGACATCC TCATCGCCGC GGCGGCGCGC AAGCTCGAAG CCAGTCCCGA TCAAGTGCAA
TGCGGGGGCG AGACGTTCTA TGTCGGCAGC GGCGCGCAGG CCGCGCTGTC GTTCGCGGAG
GTCGTGGCGG CCGCGCTGGT CGCCGAGGGC GCGATCACCG TCAAGGGCTC TTTCACCTGC
CCGCCCGAAT CCCAGGGTGG CAAACATCGC GGCGGCGCGG TCGGCTCCAC CATGGGCTTC
AGCTACGCCG CCCAAGTGGT CGAGGTCAGC GTCGATGACG CGACCGGCTT GATCGCGATC
GAAAAGGTGT GGACCGCGCT CGACTGTGGA CGCGCCATCA ATCCGCTGGC GGTGGTCGGT
CAGGTGCAGG GCGCGGTGTG GATGGGAATG GGACAGGCGC TGAGCGAGGA AACCCGGTAT
CTCGACGGTT TGCCCGCCCA TGCCAGCTTC CTCGAATATC GCATGCCGAC GATGGCGGAA
TCCCCGCCGA TCGAGGTGCA AATCGTCGAA AGCCACGATC CGTTCGGCCC GTTCGGCGCC
AAAGAAGCTA GCGAGGGAGC GCTGGCCGGA TTTCCGCCGG CGATGGTCAA TGCCGTCGCC
AATGCGATCG GCGTCGATCT CGATGATTTG CCGGTGACGC CGGATCGCGT CGTCGATGCG
CTTGTCCGGC GACGGCGCGA GGCAAGACGG ACCAATCCCG CGAGGGCCAC ATCATGA
 
Protein sequence
MTAGPMPQGG LDRTGFPLID GIEKVTGRAR YTADLNHSEA LVARILRSPI SHGDIVRLDI 
SKALALEGVL AVVTGEDCPF TYGVLPIAMN EHPMARGRLR YRGEPIAAVA AVDAATAQRA
IDLIELELRE LPAYYDSASA RAPDALLLHD HKPGNIEREV HNEFGDLAAG FEAADLVREH
NFNCAEVNHA QIEPHACLMD YDPINGRLTA QSVSQVGYYL HLMLARCLDI DQSRIRVIKP
FVGGGFGARV EVLNFEIITA LLARKASAKV SMRLTREETF VTHRARPQTD VRLKIGMKRD
GRLTACACEV VQRGGAYAGY GIVTILYAGA LLQGLYDIPA IRYDGYRVYT NLPPCGAMRG
HGSVDVRHAF ETLIDRMARE LGLDPFAVRR ANLLTAPTRT LNDLMVNSYG LADCLDKVER
ASGWHERIGR MPPGKGLGMA CSHYVSGSAK PIHFTGEPHA VVALRLDFDG GVTALTGAAD
IGQGSSTVVA ITVAETLGIA LNRVRVISGD SAVTPKDNGA YSSRITFMVG NAAIDAAKQL
KDILIAAAAR KLEASPDQVQ CGGETFYVGS GAQAALSFAE VVAAALVAEG AITVKGSFTC
PPESQGGKHR GGAVGSTMGF SYAAQVVEVS VDDATGLIAI EKVWTALDCG RAINPLAVVG
QVQGAVWMGM GQALSEETRY LDGLPAHASF LEYRMPTMAE SPPIEVQIVE SHDPFGPFGA
KEASEGALAG FPPAMVNAVA NAIGVDLDDL PVTPDRVVDA LVRRRREARR TNPARATS