Gene Rsph17029_4089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4089 
Symbol 
ID4895000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009040 
Strand
Start bp29991 
End bp31349 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content73% 
IMG OID640110491 
Producthypothetical protein 
Protein accessionYP_001041803 
Protein GI126464827 
COG category[S] Function unknown 
COG ID[COG3551] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones96 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value0.180938 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCCG ACCAGACCCC TGCCCGCCCA GCCCTTCTCG TCCTCGGAAT GCACCGCTCC 
GGCACCTCGG CCCTCGCGGG CGTGCTCGGC CGGGCGGGCT TCGCGCTGCC GCAGGAACTG
ATGCCTCCGA CCGAGCACAA TCCGCGGGGC TATTTCGAAT CCACCCGGAT CTTCCGGCTG
AACGATGCGC TTCTGGCCGC GGCGGGCTCC TCCTGGGACG ACTGGCGGGT CTTCGACGCG
GACTGGCACC TCTCGCCCGC GGCCGAGCCG TTCCATGCCG AGGCGCAGGA GGCGCTCGCG
GCGGAATTTC CCGGCACGGC GCCGATCGTG CTCAAGGATC CGCGGATCTG CCGGCTGCTG
CCCTTCTGGA CCCGCGCGCT GACCGAGGCG GGCTTCCGGC CGCTGGCCGT CTGCACCCAC
CGCCCCGCGC GCGAGGTGGG CGCCTCGCTC GCGCGCCGCA ACGGCTGGCC CGAGGCGCGC
GGCCTCCTGC TCTGGCTGCG CCATGTGCTC GATGCCGAGG CCCAGACCCG CGGCAGGCCC
CGGGTCTTCG TCTCCTACGA CGGGCTGCTC GCGGACTGGC GGGGAACGCT CGGGCGCATC
GCGGAGGCCT TCGATCTGGC GCTCCCGCGC CCGCTCGACG AGGCCGCGCC CGAGATCGAG
GCCTTCCTCT CGGCCGACCT GCGCCATGCG CCGGAGACGC CCGCGGCCGC GGCGGGCCTG
TCCGACTGGA TCGCCCGCCC CGAGGAGATC CTCGACCGGA AGGCCGCCGG AGAGGACCGC
CCCGGAGACC GCGAGACGCT CGACCGGATC GCGGCCGAAG TCGCCGCCGC GGCCCCCCTG
CTGGCAGACC TCTCCGGAGC GGTGGAGGAA CAGGGCGCCC GGCTGGAGCG CGAGGCGGCC
CTGCGCCACG AGGCCCAGAC CATCCTGCAG CAGGAGAGGC AGCGGCTCGA CGACCTGACG
GCCGAGCTGC AGCTCCAGCT CCATCACAGG ACCCTCCATG TCGCGGAACT GGAGCGTCAT
GCGGGGGAGC TGGCCCAGCA GCTCAGGCAG AAGACGCAGC ACGAGGCCGA ACTGGAGCGC
CATGCGGAAG AGCTCGCCCA GCAGCTCCGG CAGCAGAGGT CGCACGCGGC CGAGCTCGAG
CGCCATGCCG GAGAGCTCTC GGCCCTGACC CACGAGCTGC GCCAGCAGGT GCATCACAAG
GGCCGGCATG TCCAGGAACT GGAAGCCCAC TCGGGAGACC TCGAAGCGCG GCTTGTCGCT
CTCGAGGCCG AGCATGCGGC TCTTCTGGGC AGCACCTCCT GGAAGGTCAC GCACCCCCTG
CGCCGCATGT CGCTGGCCTT GCGTCGTCCG AAGACGTGA
 
Protein sequence
MTSDQTPARP ALLVLGMHRS GTSALAGVLG RAGFALPQEL MPPTEHNPRG YFESTRIFRL 
NDALLAAAGS SWDDWRVFDA DWHLSPAAEP FHAEAQEALA AEFPGTAPIV LKDPRICRLL
PFWTRALTEA GFRPLAVCTH RPAREVGASL ARRNGWPEAR GLLLWLRHVL DAEAQTRGRP
RVFVSYDGLL ADWRGTLGRI AEAFDLALPR PLDEAAPEIE AFLSADLRHA PETPAAAAGL
SDWIARPEEI LDRKAAGEDR PGDRETLDRI AAEVAAAAPL LADLSGAVEE QGARLEREAA
LRHEAQTILQ QERQRLDDLT AELQLQLHHR TLHVAELERH AGELAQQLRQ KTQHEAELER
HAEELAQQLR QQRSHAAELE RHAGELSALT HELRQQVHHK GRHVQELEAH SGDLEARLVA
LEAEHAALLG STSWKVTHPL RRMSLALRRP KT