Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1669 |
Symbol | |
ID | 3908656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1898155 |
End bp | 1901706 |
Gene Length | 3552 bp |
Protein Length | 1183 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 637883563 |
Product | pyrroloquinoline-quinone aldehyde dehydrogenase |
Protein accession | YP_485288 |
Protein GI | 86748792 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [COG2010] Cytochrome c, mono- and diheme variants |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.224955 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGAAC CGACGCCGGC GCCGAAGGAC CTGCTTGCTC GCACGGGCAC GCTTGCCGTG TTGCGGCCGT CGCAGCACGT CAAGGGCCTG GTGCCGACGG CGCCGCCGCC GGAAGGCGCG CTCGACCTGT TCCTGTTTCT CGACGATTCG GGTCGGGTGC TCGCCTTCAA CGGCCATGTC GATCTCGGCA CCGGCATCCG CACCGCGCTG GCGCAGATCG TGGCGGAAGA ACTCGACGTG TCGTTCGCCG CCGTCACCAT GGTGCTCGGC CACACCTCCG GCACGCCGAA CCAGGGCGCA ACCATCGCCA GCGACAGCAT CCAGGTCTCG GCGCTGCCGC TGCGCAACGC TGCGGCGCAG GCGCGGCATC ATCTGATCGC CCTGGCCGCG GCAGAACTCG AACTGCCGCC GAACGATCTG GCCGTCACCG ACGGCGTCGT GCATCCGCGC GGCGGCGCCA ATATCGGCGT GTCCTATGCG TCGTTGCTGC AGGGCCGCAC CGACCGGCTG TTGCTCGCCG AGGGTGTGGC GGTGAAGCCG GTGGCCGAGC ATCGCATCGT CGGTCAGCGC ATCGCGCGCT CGGATATTCC GGCCAAGGCG ACCGGCGACT TCGTCTATGT GCACGACGTG CGCGTCCCCG GCATGTTGCA CGGCCGCGTG GTGCGGCCGC CTTATGCCGG AATCGATGCC GGCGACTTCA TCGGCGCCAG CCTGATCGGC GTCGACGAAA CCTCCGTCGC GCATATTCCG GGTGTGGTCG CGGTCATCAG CATGGGCGAC TTCGTCGGCG TCGTCGCCGA GCGCGAGGAG CAGGCGGCCG AAGCCGCGCG CGTGTTGAAG GTGGAATGGA AGCCGCCGCC GCCGCTGCCG GATCTCGATG ATCTTGCCAC GGCGCTGCGC GCCAACCCGG CGACCACGCG GACGCTGCAC GACAAAGGCG ACGTCGATCG CGCCCGCGCC GACGCCGCGG TGCCGATGGA CCGCAGCTAT GTCTGGCCGT ATCAGATGCA CGGCTCGATC GGCCCGTCCT GCGCGGTGGC GGACGTCCGC GACGGCGCCG CCACGATCTG GTCCGGCACG CAGAATCCCT ATCCGCTGCG GCTCGATCTC TCGGTGCTGC TCGGCATCCC CGAGTCCGAT ATCGAGGTGC TGCGGTTTGA AGCTGCCGGC TGCTACGGCC GCAACTGCGC CGACGACGTT TCTGCGGATG CGGCGCTGCT GTCGCGCGCG GTGGGGCGGC CTGTCCGCGT CCAACTGACC CGCGAGCAGG AGCACGCCTG GGAGCCGAAG GGCGCGGCGC AACTGATGGA GATTTCCGGC GGGCTGAACG CCGACGGCAG CCCGGCCGCG TATGATTTCG CCACGCGCTA TCCGTCGAAT GCGGCGACGA CGCTGGCGCT GCTGCTGACC GGGCGCGTGC CGGCGAACAA TCCGGTGTTC GAGATGGGCG ATCGCACCGC GATCCCGCCT TACGCCTACG ACAACATCCG CGTTAAGGTG CACGACATGG CGCCGATCGT GCGCGCGGCG TGGCTGCGTG GCGTCTCGGC GCTGCCGAAT TCGTTCGCGC ATGAGAGCTA TATCGACGAA CTCGCCGCCG CGGCCGGGGT CGATCCGGTT GAGTATCGGC TGCGCTATCT GCACGACCCG CGCGCGGTCG ATCTGGTCAA GGAAGTCGCC GCGCGCGCCG GCTGGGCGCA TCGCACCGGC CCGCGCCAGG AAGTCGTCGA TGGCGACGTC GTGCGCGGGC AGGGCATCGC CTATGCGCTG TACGTCCACT CCAAATTCCC CGGCTACGGC GCGGCGTGGT CGGCGTGGGT CGCCGATGTC GAGGTCAACA AGGCGACCGG CGACGTCGCG GTGAAGCGCG TCGTGGTCGG TCAGGATTCC GGGCTGATGA TCAATCCCGC CGGGATCGAG CATCAGATCC ACGGCAACGT CATTCAATCG ACCAGCCGCG TGCTGAAGGA ACAGGTGAGC TTCACCGGCA CCGCGGTGGC CGACAAGGAA TGGGGCGCGT ACCCGATCCT CACCTTCCCG GACGTGCCGG TCATCAACGT CGTGCTGATG CCGCGCCCGA ACGATCCGCC GCTCGGCAGC GGCGAATCCG CCTCGGTACC GTCGGCGGCG GCGATCGCCA ATGCGATCTT CGATGCCACC GGGGTGCGGC TGCGCGAGCC GCCATTCACG CCGGAGCGGG TGCGGGCCGC GCTGGGCCAT CCGATGCTGC CGCCGCCGCC CGCGCCTGCG CCGAAGAAGC GGTCGTGGTT GGCGCTGGCC GGCGCGGCGC TGGTCGGCGC GCTCGGCATG GCGACGGTGG CGCTGCCGAT CCGCGGTGCC ATTGCGCCGA TCGCGCCGCC CGATCCGGCG AGCTTCTCCG CCGAGATGAT CTCGCGCGGA CGGCAGCTCG CCGCGCTCGG CGGCTGCGCG GTGTGCCACA CCGAGATCGG CGGCGCGACC AATGCGGGCG GCCGCCCGGT CGAGACTCCG TTCGGCACGG TGTACTCCAC CAACCTCACG CCAGATCCGG AAACCGGCAT CGGCCGCTGG TCCTACGCGG CGTTCGAGCG CGCGATGCGC GAGGGCATCG CCCGCGACGG GCGCCATCTC TATCCGGCGT TTCCCTACGC CTCGTTCACA CGCACGAGCG ATATCGATCT GCAGGCGCTG TACGCCTATC TGATGACGCA GACGCCGGTG AAGGCGCCGA CGCCCGAGGC GCGGATGGCG TTTCCGTTCA ATCTCCGGCC GCTGATGGCC GGCTGGAATG CGCTGTTTCT GCGCACCGGC CAGATGCAGG CCGACCCCGT CAAAGCGCCG CAATGGAATC GCGGCGCGTA TCTGGTCGAG AGCCTCGGCC ATTGCGGCGC CTGCCACACG CCGCGCAATG CACTCGGTGC CGAACAGGCG CGGACCGCGT ATCTCAGTGG CGGCGTCGTC GACGGCTGGC ACGCCCCGGC GCTGAACGCG CTCTCCAGTG CGCCGATTCC GTGGACGGAG GCCGAGCTGT TCTCTTATCT GCGCACCGGG TTCTCGCAGT TCCACGGCAC GGCGGCCGGG CCGATGGCGC CGGTGGTGGA GCAACTCGCC GCGCTGCCGG ATGCCGACAT CCGCGCCATG GCGACCTATC TGGCGTCGTT CGCGCCGCCG GCCGACGAGG CGCCCGCCGC CCGCGCCGCC GCGCTGCAGG CCGCGGCGAC GGCGACGCTG CGGCCGCTGG ATTCGCTCGG CGGCAGGCTC TACGAGGGCG CCTGCGCCTC CTGCCACAGC GACGACGGGC CGACCCTGTT CGGCGTGCGC CCGGCGCTGG CGCTCAACAC CAACGTTCAC GCCGCGGACC CCGACAATCT GATTAGGGTC ATTCTAGACG GAATACCTAG TCCGGCGGCG GCCGAACTGG GGGATATGCC CGGGTTCCGC CACAGTTTCG ACGATAACCA GATCGCGGCG CTGGTGACTT ATCTCCGCGC CTCCTTCGCC CCCCAGGCGC CCGCCTGGGG CGGCGTGGAA CAAACGGTGG CGCGCCTGCG CGCCCACCGA GGCTCCCACT GA
|
Protein sequence | MSEPTPAPKD LLARTGTLAV LRPSQHVKGL VPTAPPPEGA LDLFLFLDDS GRVLAFNGHV DLGTGIRTAL AQIVAEELDV SFAAVTMVLG HTSGTPNQGA TIASDSIQVS ALPLRNAAAQ ARHHLIALAA AELELPPNDL AVTDGVVHPR GGANIGVSYA SLLQGRTDRL LLAEGVAVKP VAEHRIVGQR IARSDIPAKA TGDFVYVHDV RVPGMLHGRV VRPPYAGIDA GDFIGASLIG VDETSVAHIP GVVAVISMGD FVGVVAEREE QAAEAARVLK VEWKPPPPLP DLDDLATALR ANPATTRTLH DKGDVDRARA DAAVPMDRSY VWPYQMHGSI GPSCAVADVR DGAATIWSGT QNPYPLRLDL SVLLGIPESD IEVLRFEAAG CYGRNCADDV SADAALLSRA VGRPVRVQLT REQEHAWEPK GAAQLMEISG GLNADGSPAA YDFATRYPSN AATTLALLLT GRVPANNPVF EMGDRTAIPP YAYDNIRVKV HDMAPIVRAA WLRGVSALPN SFAHESYIDE LAAAAGVDPV EYRLRYLHDP RAVDLVKEVA ARAGWAHRTG PRQEVVDGDV VRGQGIAYAL YVHSKFPGYG AAWSAWVADV EVNKATGDVA VKRVVVGQDS GLMINPAGIE HQIHGNVIQS TSRVLKEQVS FTGTAVADKE WGAYPILTFP DVPVINVVLM PRPNDPPLGS GESASVPSAA AIANAIFDAT GVRLREPPFT PERVRAALGH PMLPPPPAPA PKKRSWLALA GAALVGALGM ATVALPIRGA IAPIAPPDPA SFSAEMISRG RQLAALGGCA VCHTEIGGAT NAGGRPVETP FGTVYSTNLT PDPETGIGRW SYAAFERAMR EGIARDGRHL YPAFPYASFT RTSDIDLQAL YAYLMTQTPV KAPTPEARMA FPFNLRPLMA GWNALFLRTG QMQADPVKAP QWNRGAYLVE SLGHCGACHT PRNALGAEQA RTAYLSGGVV DGWHAPALNA LSSAPIPWTE AELFSYLRTG FSQFHGTAAG PMAPVVEQLA ALPDADIRAM ATYLASFAPP ADEAPAARAA ALQAAATATL RPLDSLGGRL YEGACASCHS DDGPTLFGVR PALALNTNVH AADPDNLIRV ILDGIPSPAA AELGDMPGFR HSFDDNQIAA LVTYLRASFA PQAPAWGGVE QTVARLRAHR GSH
|
| |