Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3422 |
Symbol | |
ID | 3911224 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 3910485 |
End bp | 3911408 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637885325 |
Product | pyrroloquinoline quinone biosynthesis protein PqqB |
Protein accession | YP_487029 |
Protein GI | 86750533 |
COG category | [R] General function prediction only |
COG ID | [COG1235] Metal-dependent hydrolases of the beta-lactamase superfamily I |
TIGRFAM ID | [TIGR02108] coenzyme PQQ biosynthesis protein B |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.492214 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.457991 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAGTCG TCGTTCTCGG GGCGGCGGCC GGAGGCGGAG TCCCGCAATG GAACTGCGGC TGCGCGATCT GCCGCGCGGC ATTTGCCGAT CCGGCGCTCA GACGCACGCA GGCGTCGATC GCCGTCAGCG CTGATAGCGT GCACTGGTTC CTGGTCAATG CCTCGCCGGA TCTGCGCCAG CAGATCATCG CGACGCCGCA GCTTCACCCA GCGACCGGCG CGCTGCGCCA CAGCCCGATC GCGGGCGTGA TCCTCAGCAA TGGCGAGGTC GATGCGGTGG CGGGCCTGCT GTCGATGCGC GAAGGTTCAC CGTTCGCGAT CTACGCCCAT CCCAAGGTGC TGGCGATCCT GAAGGCGAAC AGCATCTTCA ACGTGCTCGA CGAGAAACTG GTGCCGCGCC GATCGATCGA GATCGATCGG CCGTTCGAGC CGATGCTGCC GGATGCGACG CCATCGGGCT TCGAGGTGGT GGCGTTCGCC GTGCCCGGCA AGGGCGCGTG GTATCTCGAA GGGCGGGCGC ATCCGGGCGG CGAGGCGGCG GACGGCGACA CGCTCGGGCT CACCATCACC GACAAGGCCA CGCGACAATC CATCCACGTT CTCACCGCCT GTGCGCGGGT GACCGATGAC TTGAAGGCGC GGCTCGCCGG CTCGCCGCTG CTGCTGTTCG ACGGCACGGT GTGGCGCGAC GACGAATTGA TCGCCGCCGG GCTCGGCGCC AAGACCGGGC AGGCGATGGG GCACATCGCG ATGTCGGGCG CGGACGGCGC GATCGCCGCG CTCGACGGCC TAGGCGTTGC GCAAAAGCTG TTCGTGCATA TCAACAATTC CAACCCGGCG CTGCTTTGCG GGTCCGCCGA GCGCAGCGCG GTGGAGCGCG CGGGTTGGCA AATTCCGGCC GACGGCACGG AGGTGACGCT GTGA
|
Protein sequence | MRVVVLGAAA GGGVPQWNCG CAICRAAFAD PALRRTQASI AVSADSVHWF LVNASPDLRQ QIIATPQLHP ATGALRHSPI AGVILSNGEV DAVAGLLSMR EGSPFAIYAH PKVLAILKAN SIFNVLDEKL VPRRSIEIDR PFEPMLPDAT PSGFEVVAFA VPGKGAWYLE GRAHPGGEAA DGDTLGLTIT DKATRQSIHV LTACARVTDD LKARLAGSPL LLFDGTVWRD DELIAAGLGA KTGQAMGHIA MSGADGAIAA LDGLGVAQKL FVHINNSNPA LLCGSAERSA VERAGWQIPA DGTEVTL
|
| |