Gene Rsph17029_0214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_0214 
Symbol 
ID4895538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009049 
Strand
Start bp232953 
End bp233936 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content72% 
IMG OID640110797 
Productluciferase family protein 
Protein accessionYP_001042105 
Protein GI126460991 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03558] luciferase family oxidoreductase, group 1 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.527708 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCCT ATTCCGTTCT CGACCTCGCT CCGGTGCCGG AAGGCGCCAC CACGTCCGAG 
GCTCTCGCCC AGACCGTCAC CCTCGCCCGC CATGCCGAAG CCCTGGGCTT TCACCGCTAC
TGGCTGGCCG AGCATCATGC CATGCCGGGC ATCGCCTCGG CGGCGACGGC GGTTCTCATC
GGCCATGTCG CGGCCCACAC CCAGCGCATC CGCGTGGGGT CGGGCGGGAT CATGCTGCCC
AACCACGCCC CGCTGATGGT GGCCGAAGCG TTCGGCACTC TGGCCGAGCT GCATCCGGGC
CGGATCGATC TGGGCCTCGG CCGGGCCCCC GGCACCGACG GGCGCACCGC GCAGGCGCTG
CGGCGCAACC TCGATGTCAC CGACAGTTTC CCCGCCGACG TGCTGGAGCT TCTGCGCTAC
TTCGGCGAGC CGGAGCCGGG CGCGATCCAG GCCATCCCCG GCCAGGGCAC CCGCGTGCCG
CTGTGGATCC TCGGCTCGAG CCTCTATGGC GCGCAGCTCG CGGCCCATTT CGGCCTGCCC
TACGCCTTCG CCTCGCATTT CGCCCCGCCC GCTCTCGAAG CGGCACTGGC CGCCTACCGG
CAGGGCTTCC GCCCGTCGAC GCAACTCGAC AGGCCCCGCG CCATGGTGGC GATCAACGTC
TTTGCCGGGG CCGACGATGC CGAGGGCCGC TACCTCCGCT CCTCCGCCCA GCTGGCCTTC
GCCAATCTGC GCCTGGGCCG CCCCGGAAAG CTGCCGCGCC CGGTCGAGGA TATCTCGGCC
CATGTCGATC CCGGCATGCT CCGGACGGTC GATCAGGCGC TGTCGGTCTC GGCCACCGGC
GGGCCCGAGA CCGTGCGGCG CGAGCTGGCG GCCCTTCTCG AGCGGCACCG GCCCGACGAG
GTGATCCTCA CCGGGCAGAT CCACGATCCC GCGGCGCGGC TGCGCTCCTT CACCATTGCG
GCGGAGGCTC TGGCCAGCCT CTGA
 
Protein sequence
MIPYSVLDLA PVPEGATTSE ALAQTVTLAR HAEALGFHRY WLAEHHAMPG IASAATAVLI 
GHVAAHTQRI RVGSGGIMLP NHAPLMVAEA FGTLAELHPG RIDLGLGRAP GTDGRTAQAL
RRNLDVTDSF PADVLELLRY FGEPEPGAIQ AIPGQGTRVP LWILGSSLYG AQLAAHFGLP
YAFASHFAPP ALEAALAAYR QGFRPSTQLD RPRAMVAINV FAGADDAEGR YLRSSAQLAF
ANLRLGRPGK LPRPVEDISA HVDPGMLRTV DQALSVSATG GPETVRRELA ALLERHRPDE
VILTGQIHDP AARLRSFTIA AEALASL