Gene Rpal_4222 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4222 
Symbol 
ID6411906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4527827 
End bp4528843 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content65% 
IMG OID642714104 
ProductLuciferase-like monooxygenase 
Protein accessionYP_001993193 
Protein GI192292588 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03558] luciferase family oxidoreductase, group 1 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.504496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGCCAC TCTCGATCCT CGATCTGTCC GTCGTCACCA CCGGCACACC GCCGGCGGCG 
TCATTGCGCA ACTCGATCGA CCTTGCGCGC CACGCGGATT CGCTGGGCTA TGTGCGCTAC
TGGCTGGCCG AGCATCACAA TCTTCCCTCG GTCGCCAGTC CCGCGCCCGA AATCATGATT
GGGCAGATCG CGGCGGTGAC CGAACGCATT CGCGTCGGCT CCGGTGGCGT GATGCTGCCG
AACCACGCGC CGCTGGTCGT GGCGGAACGC TTCAAGATGC TTGAAGCGCT GTTCCCCGGT
CGGATCGATC TCGGTATCGG CCGCGCGCCG GGCACCGACC AGGCGACGAT GCACGCGTTG
CGCCGCCGGC TCGATGGCCG TGAGGGCGAC GATTTTCTGG AGCGGCTGCA AGAGCTGACG
CTGTGGGAGA CGCGCGGCTT TCCGCCGGGC CATCCGTACA ACAACGTCGT CGCGATGCCC
GACGATACGC CGCTGCCGCC GATCTGGCTG CTCGGCTCCA GCGACTACAG TTCCGAGCTC
GCCGCGCAGG TCGGTATGGG CTTCGCGTTC GCGCATCACT TCGCTTCGCA CGATGCGGTC
GAGGCACTGA CGCATTATCG CAGCAACTTC CGTCCGACGC GCTGGCGATC GACGCCACAC
GGTATTTTGG CCGTCGCCGC CGTTGTCGCC GACACCGATG AGGAAGCGGA GCGGCTAGCG
AGTTCGATGG ATCTCAGTCG CCTGTTACGC GACCGTGGCC GCTACGTGCC GCTGCCAAGT
GTCGAGGAAG CTCTGTCTTA TTCTTACACT GAAGCCGACC GTGCCTCGAT CGCGCGCAAT
CGCTCGCGGC TGTTCGTCGG CAGTCCGGCG ACGGTTCGGC AGGCGCTGCA GCCGCTGATT
ACCGCAAGCC GCGCCGACGA GCTGATGGTG ATCACCGCGG TGTATGACCA CGATGCGCGC
AAGCGGTCCT ACAGCTTGCT GGCCGACGCG TTCGAGCTGC AGAAGGTGGC AGCTTAG
 
Protein sequence
MLPLSILDLS VVTTGTPPAA SLRNSIDLAR HADSLGYVRY WLAEHHNLPS VASPAPEIMI 
GQIAAVTERI RVGSGGVMLP NHAPLVVAER FKMLEALFPG RIDLGIGRAP GTDQATMHAL
RRRLDGREGD DFLERLQELT LWETRGFPPG HPYNNVVAMP DDTPLPPIWL LGSSDYSSEL
AAQVGMGFAF AHHFASHDAV EALTHYRSNF RPTRWRSTPH GILAVAAVVA DTDEEAERLA
SSMDLSRLLR DRGRYVPLPS VEEALSYSYT EADRASIARN RSRLFVGSPA TVRQALQPLI
TASRADELMV ITAVYDHDAR KRSYSLLADA FELQKVAA