Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3980 |
Symbol | |
ID | 3911787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 4545964 |
End bp | 4547589 |
Gene Length | 1626 bp |
Protein Length | 541 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637885884 |
Product | light-independent protochlorophyllide reductase subunit B |
Protein accession | YP_487584 |
Protein GI | 86751088 |
COG category | [C] Energy production and conversion |
COG ID | [COG2710] Nitrogenase molybdenum-iron protein, alpha and beta chains |
TIGRFAM ID | [TIGR01278] light-independent protochlorophyllide reductase, B subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.129659 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCTCA CCGTCTGGAC CTATGAAGGA CCTCCCCATG TCGGCGCGAT GCGAATCGCC ACCGGGATGG AAGGGCTGCA CTACGTCCTG CACGCGCCGC AGGGCGACAC CTACGCCGAT CTGTTGTTCA CCATGATCGA GCGCCGCAAC AAGCGGCCGC CGGTGACCTA TACGACGTTC GCCGCACGCG ACCTCGGTCG CGACACCGCC GAACTGTTCA TGACCGCCGC GCGCGACGCC TATGCGCGGT TCCAGCCGCA GGCGATGATC GTCGGCGCGT CCTGCACCGG CTCGCTGATC CAGGACGATC CGGGCGGGCT GGCGAAATCG CTCGGCTTCC CGATCCCGGT GATCGCGATC GATCTGCCTG CCTATCAGCG CAAGGAAAAC TGGGGCGCCG CCGAAACCTT CTACCAGCTT GTCCGCGCGC TCGCCGGTCC CAACGCGCCG GCGCCCGGCA CCAAGCGTCC GGAGCGCGCG GCGGGAGTCC GCCCGAGCTG CAATCTGCTG GGGCCGACGG CGCTCGGCTT CCGCCATCGC GACGACATCA CCGAAATCAC CGGGCTGCTC GGCAAGCTCG GCATCGATGT CAACGTTGTC GCCCCGATGG GGTCGACGCC GGCGGATATC GCGCGGCTCG GCGATGCCGA TTTCAACGTG GTGATGTATC CGGAAATCGC CGGGCAGGCG GCGTCCTGGC TGCACCGCAT CTTCCATCAG CCGTTCACCA AGACGGTACC GATCGGCGTG TCCGCGACGC GCGACTTCAT TCAGGAAGTG ACCGCGCTCG CCGGCATCGA TCCCGCGCCG ATGCTGCAGG CCTCGTCGAG CCGGTTGCCG TGGTACTCGC ATTCGGTCGA CAGCACTTAC CTGACCAACA AGCGCGTCTT CATCTTCGGC GATGCCACCC ACGCGATCGC CGCGGCGCGG ATCGCGTCGG AAGAGCTCGG CTTCAAGGTG GTTGGGCTCG GCAGCTACAG CCGTGAGTTC GGCCGCGAGC TGCGCGAAGC CGCGAAGCGT TACGATGTCG AACCGCTGAT CACCGACGAC TATCTCGAGG TCGAGGCCAA GGTGGCCGAG CTCCACCCCG AGCTGGTGCT CGGCACCCAG ATGGAGCGCC ACATCGCCAA GCGGCTCGGC GTTCCCTGCG CGGTGATCTC GGCGCCGGTG CACGTTCAGG ATTTCCCGGC GCGCTACGCG CCGCAGATGG GCTTCGAAGG CGCCAATGTG ATCTTCGACA CCTGGGTGCA TCCGCTGATG ATGGGCCTGG AAGAGCATCT GCTGACGATG TTCAAGGACG ATTTCGAATT CAAGGACGGG GCGATGCCGT CGCATCTCGG GACCGGCCAC GCCGCGCCGG TCGCGGAAGC CGTCGCCGCT CCGGCTGCCG CTGTCGCGAC CGAATCCGTC GCGACCGGAG TCGCCGCGCC CGACATCGCG TCGGCAACCG CTGTGGCCGC AGCCGCAGCC GTGTGGGCGC CCGAAGCCGA GAAGGAACTG CAGAAGATAC CGTTCTTCGT TCGCGGGAAA GCCCGCCGGA ATACCGAGCG ATTCGCCAAC GAGAATGGCG TCGCAACCAT CACTGTCGAG ACCTTGTACG ATGCAAAAGC GCACTTCGCA CGCTGA
|
Protein sequence | MQLTVWTYEG PPHVGAMRIA TGMEGLHYVL HAPQGDTYAD LLFTMIERRN KRPPVTYTTF AARDLGRDTA ELFMTAARDA YARFQPQAMI VGASCTGSLI QDDPGGLAKS LGFPIPVIAI DLPAYQRKEN WGAAETFYQL VRALAGPNAP APGTKRPERA AGVRPSCNLL GPTALGFRHR DDITEITGLL GKLGIDVNVV APMGSTPADI ARLGDADFNV VMYPEIAGQA ASWLHRIFHQ PFTKTVPIGV SATRDFIQEV TALAGIDPAP MLQASSSRLP WYSHSVDSTY LTNKRVFIFG DATHAIAAAR IASEELGFKV VGLGSYSREF GRELREAAKR YDVEPLITDD YLEVEAKVAE LHPELVLGTQ MERHIAKRLG VPCAVISAPV HVQDFPARYA PQMGFEGANV IFDTWVHPLM MGLEEHLLTM FKDDFEFKDG AMPSHLGTGH AAPVAEAVAA PAAAVATESV ATGVAAPDIA SATAVAAAAA VWAPEAEKEL QKIPFFVRGK ARRNTERFAN ENGVATITVE TLYDAKAHFA R
|
| |