Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1766 |
Symbol | |
ID | 3909753 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2024046 |
End bp | 2025062 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637883660 |
Product | luciferase-like |
Protein accession | YP_485385 |
Protein GI | 86748889 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03558] luciferase family oxidoreductase, group 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0136597 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTACCGC TCTCGATCCT CGACCTGTCC GTTGTCACCA CCGGCACCCC GCCGGCGGCG GCGTTGCGCA ATTCGATCGA CCTCGCCCGG CATGCCGACA AGCTCGGCTA TGTCCGCTAC TGGCTGGCCG AACATCACAA TCTGCCGTCC GTCGCCAGTC CCGCGCCCGA AATCATGATC GGCCAGATCG CGGCGGTGAC CGAGCGCATC CGTGTCGGCT CCGGCGGCGT GATGCTGCCG AACCACGCGC CGCTGATGGT CGCCGAGCGC TTCAAGATGC TGGAGGCGCT GTTCCCCGGC CGCATCGATC TCGGCATCGG CCGCGCGCCC GGCACCGATC AGGCGACGAT GCATGCGCTG CGCCGCAGGC TCGATGTCCG CGAGGGCGAC GATTTCCTCG AGCGGCTGCA GGAACTGATG CTGTGGGAGA CCCGCGGCTT TCCGCCCGGC CATCCCTACA ACAACGTCAT GGCGATGCCG AACGACGCGC CGCTGCCGCC GGTCTGGCTG CTCGGCTCCA GCGACTACAG TTCGGAACTG TCGGCCCAGG TCGGCATGGG CTTCGCGTTC GCGCATCATT TCGCGTCCTA CGACGCGGCC GAGGCGCTGA CGCATTATCG CGCCGCTTTT CGGCCGACAC GCTGGCGCAG CACGCCGCAC GGCATCCTCG CGGTGGCGGC GGTGATCGCC GAGACCGACG AGGAAGCCGA GCGTCTGGCG ATCTCGATGG ACATCAACCG GCTGCGCCGC GACCGCGGCC AATACGTGCC GCTGCCGAGC GTCGAGGAGG CGCAGGCCTA TCCCTTGACC GACGCCGACC GCGCCTCGAT CGCGCGCAAT CGCTCGCGGC TGTTCGTCGG CAGCCCGTCG ACGGTGCTGC AGGCGCTGCA GCCGCTGATC CGCGCCAGTC AGGCCGACGA ACTGATGGTG ATCACCGCCA CCTACGATCA CGACGCGCGC AAGCGCAGCT ACACGCTGCT CGCGGATGCG TTCGAGCGAC ACAAGGCAGC GGCGTAG
|
Protein sequence | MLPLSILDLS VVTTGTPPAA ALRNSIDLAR HADKLGYVRY WLAEHHNLPS VASPAPEIMI GQIAAVTERI RVGSGGVMLP NHAPLMVAER FKMLEALFPG RIDLGIGRAP GTDQATMHAL RRRLDVREGD DFLERLQELM LWETRGFPPG HPYNNVMAMP NDAPLPPVWL LGSSDYSSEL SAQVGMGFAF AHHFASYDAA EALTHYRAAF RPTRWRSTPH GILAVAAVIA ETDEEAERLA ISMDINRLRR DRGQYVPLPS VEEAQAYPLT DADRASIARN RSRLFVGSPS TVLQALQPLI RASQADELMV ITATYDHDAR KRSYTLLADA FERHKAAA
|
| |