Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4222 |
Symbol | |
ID | 6411906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 4527827 |
End bp | 4528843 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642714104 |
Product | Luciferase-like monooxygenase |
Protein accession | YP_001993193 |
Protein GI | 192292588 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03558] luciferase family oxidoreductase, group 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.504496 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGCCAC TCTCGATCCT CGATCTGTCC GTCGTCACCA CCGGCACACC GCCGGCGGCG TCATTGCGCA ACTCGATCGA CCTTGCGCGC CACGCGGATT CGCTGGGCTA TGTGCGCTAC TGGCTGGCCG AGCATCACAA TCTTCCCTCG GTCGCCAGTC CCGCGCCCGA AATCATGATT GGGCAGATCG CGGCGGTGAC CGAACGCATT CGCGTCGGCT CCGGTGGCGT GATGCTGCCG AACCACGCGC CGCTGGTCGT GGCGGAACGC TTCAAGATGC TTGAAGCGCT GTTCCCCGGT CGGATCGATC TCGGTATCGG CCGCGCGCCG GGCACCGACC AGGCGACGAT GCACGCGTTG CGCCGCCGGC TCGATGGCCG TGAGGGCGAC GATTTTCTGG AGCGGCTGCA AGAGCTGACG CTGTGGGAGA CGCGCGGCTT TCCGCCGGGC CATCCGTACA ACAACGTCGT CGCGATGCCC GACGATACGC CGCTGCCGCC GATCTGGCTG CTCGGCTCCA GCGACTACAG TTCCGAGCTC GCCGCGCAGG TCGGTATGGG CTTCGCGTTC GCGCATCACT TCGCTTCGCA CGATGCGGTC GAGGCACTGA CGCATTATCG CAGCAACTTC CGTCCGACGC GCTGGCGATC GACGCCACAC GGTATTTTGG CCGTCGCCGC CGTTGTCGCC GACACCGATG AGGAAGCGGA GCGGCTAGCG AGTTCGATGG ATCTCAGTCG CCTGTTACGC GACCGTGGCC GCTACGTGCC GCTGCCAAGT GTCGAGGAAG CTCTGTCTTA TTCTTACACT GAAGCCGACC GTGCCTCGAT CGCGCGCAAT CGCTCGCGGC TGTTCGTCGG CAGTCCGGCG ACGGTTCGGC AGGCGCTGCA GCCGCTGATT ACCGCAAGCC GCGCCGACGA GCTGATGGTG ATCACCGCGG TGTATGACCA CGATGCGCGC AAGCGGTCCT ACAGCTTGCT GGCCGACGCG TTCGAGCTGC AGAAGGTGGC AGCTTAG
|
Protein sequence | MLPLSILDLS VVTTGTPPAA SLRNSIDLAR HADSLGYVRY WLAEHHNLPS VASPAPEIMI GQIAAVTERI RVGSGGVMLP NHAPLVVAER FKMLEALFPG RIDLGIGRAP GTDQATMHAL RRRLDGREGD DFLERLQELT LWETRGFPPG HPYNNVVAMP DDTPLPPIWL LGSSDYSSEL AAQVGMGFAF AHHFASHDAV EALTHYRSNF RPTRWRSTPH GILAVAAVVA DTDEEAERLA SSMDLSRLLR DRGRYVPLPS VEEALSYSYT EADRASIARN RSRLFVGSPA TVRQALQPLI TASRADELMV ITAVYDHDAR KRSYSLLADA FELQKVAA
|
| |