Gene RPB_1669 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1669 
Symbol 
ID3908656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1898155 
End bp1901706 
Gene Length3552 bp 
Protein Length1183 aa 
Translation table11 
GC content70% 
IMG OID637883563 
Productpyrroloquinoline-quinone aldehyde dehydrogenase 
Protein accessionYP_485288 
Protein GI86748792 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs
[COG2010] Cytochrome c, mono- and diheme variants 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.224955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAC CGACGCCGGC GCCGAAGGAC CTGCTTGCTC GCACGGGCAC GCTTGCCGTG 
TTGCGGCCGT CGCAGCACGT CAAGGGCCTG GTGCCGACGG CGCCGCCGCC GGAAGGCGCG
CTCGACCTGT TCCTGTTTCT CGACGATTCG GGTCGGGTGC TCGCCTTCAA CGGCCATGTC
GATCTCGGCA CCGGCATCCG CACCGCGCTG GCGCAGATCG TGGCGGAAGA ACTCGACGTG
TCGTTCGCCG CCGTCACCAT GGTGCTCGGC CACACCTCCG GCACGCCGAA CCAGGGCGCA
ACCATCGCCA GCGACAGCAT CCAGGTCTCG GCGCTGCCGC TGCGCAACGC TGCGGCGCAG
GCGCGGCATC ATCTGATCGC CCTGGCCGCG GCAGAACTCG AACTGCCGCC GAACGATCTG
GCCGTCACCG ACGGCGTCGT GCATCCGCGC GGCGGCGCCA ATATCGGCGT GTCCTATGCG
TCGTTGCTGC AGGGCCGCAC CGACCGGCTG TTGCTCGCCG AGGGTGTGGC GGTGAAGCCG
GTGGCCGAGC ATCGCATCGT CGGTCAGCGC ATCGCGCGCT CGGATATTCC GGCCAAGGCG
ACCGGCGACT TCGTCTATGT GCACGACGTG CGCGTCCCCG GCATGTTGCA CGGCCGCGTG
GTGCGGCCGC CTTATGCCGG AATCGATGCC GGCGACTTCA TCGGCGCCAG CCTGATCGGC
GTCGACGAAA CCTCCGTCGC GCATATTCCG GGTGTGGTCG CGGTCATCAG CATGGGCGAC
TTCGTCGGCG TCGTCGCCGA GCGCGAGGAG CAGGCGGCCG AAGCCGCGCG CGTGTTGAAG
GTGGAATGGA AGCCGCCGCC GCCGCTGCCG GATCTCGATG ATCTTGCCAC GGCGCTGCGC
GCCAACCCGG CGACCACGCG GACGCTGCAC GACAAAGGCG ACGTCGATCG CGCCCGCGCC
GACGCCGCGG TGCCGATGGA CCGCAGCTAT GTCTGGCCGT ATCAGATGCA CGGCTCGATC
GGCCCGTCCT GCGCGGTGGC GGACGTCCGC GACGGCGCCG CCACGATCTG GTCCGGCACG
CAGAATCCCT ATCCGCTGCG GCTCGATCTC TCGGTGCTGC TCGGCATCCC CGAGTCCGAT
ATCGAGGTGC TGCGGTTTGA AGCTGCCGGC TGCTACGGCC GCAACTGCGC CGACGACGTT
TCTGCGGATG CGGCGCTGCT GTCGCGCGCG GTGGGGCGGC CTGTCCGCGT CCAACTGACC
CGCGAGCAGG AGCACGCCTG GGAGCCGAAG GGCGCGGCGC AACTGATGGA GATTTCCGGC
GGGCTGAACG CCGACGGCAG CCCGGCCGCG TATGATTTCG CCACGCGCTA TCCGTCGAAT
GCGGCGACGA CGCTGGCGCT GCTGCTGACC GGGCGCGTGC CGGCGAACAA TCCGGTGTTC
GAGATGGGCG ATCGCACCGC GATCCCGCCT TACGCCTACG ACAACATCCG CGTTAAGGTG
CACGACATGG CGCCGATCGT GCGCGCGGCG TGGCTGCGTG GCGTCTCGGC GCTGCCGAAT
TCGTTCGCGC ATGAGAGCTA TATCGACGAA CTCGCCGCCG CGGCCGGGGT CGATCCGGTT
GAGTATCGGC TGCGCTATCT GCACGACCCG CGCGCGGTCG ATCTGGTCAA GGAAGTCGCC
GCGCGCGCCG GCTGGGCGCA TCGCACCGGC CCGCGCCAGG AAGTCGTCGA TGGCGACGTC
GTGCGCGGGC AGGGCATCGC CTATGCGCTG TACGTCCACT CCAAATTCCC CGGCTACGGC
GCGGCGTGGT CGGCGTGGGT CGCCGATGTC GAGGTCAACA AGGCGACCGG CGACGTCGCG
GTGAAGCGCG TCGTGGTCGG TCAGGATTCC GGGCTGATGA TCAATCCCGC CGGGATCGAG
CATCAGATCC ACGGCAACGT CATTCAATCG ACCAGCCGCG TGCTGAAGGA ACAGGTGAGC
TTCACCGGCA CCGCGGTGGC CGACAAGGAA TGGGGCGCGT ACCCGATCCT CACCTTCCCG
GACGTGCCGG TCATCAACGT CGTGCTGATG CCGCGCCCGA ACGATCCGCC GCTCGGCAGC
GGCGAATCCG CCTCGGTACC GTCGGCGGCG GCGATCGCCA ATGCGATCTT CGATGCCACC
GGGGTGCGGC TGCGCGAGCC GCCATTCACG CCGGAGCGGG TGCGGGCCGC GCTGGGCCAT
CCGATGCTGC CGCCGCCGCC CGCGCCTGCG CCGAAGAAGC GGTCGTGGTT GGCGCTGGCC
GGCGCGGCGC TGGTCGGCGC GCTCGGCATG GCGACGGTGG CGCTGCCGAT CCGCGGTGCC
ATTGCGCCGA TCGCGCCGCC CGATCCGGCG AGCTTCTCCG CCGAGATGAT CTCGCGCGGA
CGGCAGCTCG CCGCGCTCGG CGGCTGCGCG GTGTGCCACA CCGAGATCGG CGGCGCGACC
AATGCGGGCG GCCGCCCGGT CGAGACTCCG TTCGGCACGG TGTACTCCAC CAACCTCACG
CCAGATCCGG AAACCGGCAT CGGCCGCTGG TCCTACGCGG CGTTCGAGCG CGCGATGCGC
GAGGGCATCG CCCGCGACGG GCGCCATCTC TATCCGGCGT TTCCCTACGC CTCGTTCACA
CGCACGAGCG ATATCGATCT GCAGGCGCTG TACGCCTATC TGATGACGCA GACGCCGGTG
AAGGCGCCGA CGCCCGAGGC GCGGATGGCG TTTCCGTTCA ATCTCCGGCC GCTGATGGCC
GGCTGGAATG CGCTGTTTCT GCGCACCGGC CAGATGCAGG CCGACCCCGT CAAAGCGCCG
CAATGGAATC GCGGCGCGTA TCTGGTCGAG AGCCTCGGCC ATTGCGGCGC CTGCCACACG
CCGCGCAATG CACTCGGTGC CGAACAGGCG CGGACCGCGT ATCTCAGTGG CGGCGTCGTC
GACGGCTGGC ACGCCCCGGC GCTGAACGCG CTCTCCAGTG CGCCGATTCC GTGGACGGAG
GCCGAGCTGT TCTCTTATCT GCGCACCGGG TTCTCGCAGT TCCACGGCAC GGCGGCCGGG
CCGATGGCGC CGGTGGTGGA GCAACTCGCC GCGCTGCCGG ATGCCGACAT CCGCGCCATG
GCGACCTATC TGGCGTCGTT CGCGCCGCCG GCCGACGAGG CGCCCGCCGC CCGCGCCGCC
GCGCTGCAGG CCGCGGCGAC GGCGACGCTG CGGCCGCTGG ATTCGCTCGG CGGCAGGCTC
TACGAGGGCG CCTGCGCCTC CTGCCACAGC GACGACGGGC CGACCCTGTT CGGCGTGCGC
CCGGCGCTGG CGCTCAACAC CAACGTTCAC GCCGCGGACC CCGACAATCT GATTAGGGTC
ATTCTAGACG GAATACCTAG TCCGGCGGCG GCCGAACTGG GGGATATGCC CGGGTTCCGC
CACAGTTTCG ACGATAACCA GATCGCGGCG CTGGTGACTT ATCTCCGCGC CTCCTTCGCC
CCCCAGGCGC CCGCCTGGGG CGGCGTGGAA CAAACGGTGG CGCGCCTGCG CGCCCACCGA
GGCTCCCACT GA
 
Protein sequence
MSEPTPAPKD LLARTGTLAV LRPSQHVKGL VPTAPPPEGA LDLFLFLDDS GRVLAFNGHV 
DLGTGIRTAL AQIVAEELDV SFAAVTMVLG HTSGTPNQGA TIASDSIQVS ALPLRNAAAQ
ARHHLIALAA AELELPPNDL AVTDGVVHPR GGANIGVSYA SLLQGRTDRL LLAEGVAVKP
VAEHRIVGQR IARSDIPAKA TGDFVYVHDV RVPGMLHGRV VRPPYAGIDA GDFIGASLIG
VDETSVAHIP GVVAVISMGD FVGVVAEREE QAAEAARVLK VEWKPPPPLP DLDDLATALR
ANPATTRTLH DKGDVDRARA DAAVPMDRSY VWPYQMHGSI GPSCAVADVR DGAATIWSGT
QNPYPLRLDL SVLLGIPESD IEVLRFEAAG CYGRNCADDV SADAALLSRA VGRPVRVQLT
REQEHAWEPK GAAQLMEISG GLNADGSPAA YDFATRYPSN AATTLALLLT GRVPANNPVF
EMGDRTAIPP YAYDNIRVKV HDMAPIVRAA WLRGVSALPN SFAHESYIDE LAAAAGVDPV
EYRLRYLHDP RAVDLVKEVA ARAGWAHRTG PRQEVVDGDV VRGQGIAYAL YVHSKFPGYG
AAWSAWVADV EVNKATGDVA VKRVVVGQDS GLMINPAGIE HQIHGNVIQS TSRVLKEQVS
FTGTAVADKE WGAYPILTFP DVPVINVVLM PRPNDPPLGS GESASVPSAA AIANAIFDAT
GVRLREPPFT PERVRAALGH PMLPPPPAPA PKKRSWLALA GAALVGALGM ATVALPIRGA
IAPIAPPDPA SFSAEMISRG RQLAALGGCA VCHTEIGGAT NAGGRPVETP FGTVYSTNLT
PDPETGIGRW SYAAFERAMR EGIARDGRHL YPAFPYASFT RTSDIDLQAL YAYLMTQTPV
KAPTPEARMA FPFNLRPLMA GWNALFLRTG QMQADPVKAP QWNRGAYLVE SLGHCGACHT
PRNALGAEQA RTAYLSGGVV DGWHAPALNA LSSAPIPWTE AELFSYLRTG FSQFHGTAAG
PMAPVVEQLA ALPDADIRAM ATYLASFAPP ADEAPAARAA ALQAAATATL RPLDSLGGRL
YEGACASCHS DDGPTLFGVR PALALNTNVH AADPDNLIRV ILDGIPSPAA AELGDMPGFR
HSFDDNQIAA LVTYLRASFA PQAPAWGGVE QTVARLRAHR GSH