Gene Rsph17029_3692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_3692 
Symbol 
ID4898338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009050 
Strand
Start bp801005 
End bp803215 
Gene Length2211 bp 
Protein Length736 aa 
Translation table11 
GC content71% 
IMG OID640114300 
Productlipopolysaccharide biosynthesis 
Protein accessionYP_001045554 
Protein GI126464441 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.306041 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCAGG CTCAGAAAGT CGCGTTGGCG CTTCGGTCGC AGCCGACGCC GATCCCGCTG 
CCGGTGCCCG CGCCGGCCCA GGAGGAATGG CTGGACGTTC AGGCGCTGTT CCGCATCATC
CGCCGCAGGC TGCCGGTGGC GGCCCTCGTC TTCGTGGGGC TGATGGCCCT GCTGACGGGA
CCGATCCTCG ACATGAAGCG CAGCTTCACC GCGCAGGCGC GGGTGCTGAT GCGCGATCCG
CCGAGCGCGG GCCTCGGGGC CGTCGACGGG GCCGAGCAGA AGCCGCTGAA CCTCAGCACC
GAGATCGAGC GCTTCGTGTC GCGCGACATC AGCGCCGAGG TCATCCGCGA GGTCGGCCTC
GACAAGCTGC CGGAATTCAA CGCGGCCCTG CGCGAGCCCT CGCTGTCCCG GCAGGCGATC
AATGCCGTGC GGAGCTGGTT CGACGCCGAT CCGCCCGTTC GTGCGACCCG GCGCGACGAC
CTGCGGCTCG TCATCCCGGC CTACATGGCC CGGCTCACGG TGTTCCAGAA GGGCAACTCC
GACGTCGTCA ACATCGGCTT CAGCTCCGAG GATGCGTCGA TTGCCGCGGC GGTGCCGAAC
GCGGTCATCC GCACCTACCT GAAGGCGCGC GAGCGCCAGC ATCAGGGCGA GCTGGAGGCC
AACCTGCGCT GGCTCGCGGT GCGGATCGAA GAGCAGAGCC AGCGGCTCAA TGCGGCGCTC
GAGGCCGTGG CCACCCGGCG GAACCAGCCC GACCTCTCCT CGCCGAGCGC CCTCGGCATC
GACACCGCCA TCGCGAGCCT CAGCGAACGG CGCATCGCCA TCCGCCACGA CATCGGAGCG
GCAGAGCGCA GCCTCGAGGA TCTGAAGGCC GACGGCGTGG TGGTCGGCCA ATCCGGCGAC
AGCGGGCCCG AGGCCAAGCC GCAGCTCGGC ATCCAGCTCG AGGCCGCGCG GGCCGAGCTG
GAGCGGCTCC AGACCCAGTT CGGCGAGAAC CATTCGAAGG TCCGCGACGT CCGCGAGCGG
ATCGCCGAGA TCGAGAGCCA GATGCGCTTC GAGGTCTCGT CCGAGATCCT GGCCCTCACG
CGGCGCATCT CGTCGCTGAA GGCGGAGGAG GCCGCGAACC TCGAAGAGCT CGAGAGGGCG
CGCGACACGC TCGCCCGGCA GAAGGAGGCG CAGGCCGAGC TCGCCCGCCT CGAGAACGAG
GCGAGCCAGG AACAGCTTGC GCTGGCGGCG GCGCTGCAGC AGCAGCGGCT CCTCCTCTCC
AGCTCGCGGC AGAACGTGAC GGAGGTGTCG GTGCTGACGC CCGCCTCGGT GCCGCTGAAC
GCGGACGGGC GCGGCAAAGC CTTCTACCTC GTCGCCGCCA TGATCGGCAG CGCCATCGCG
GCGGTGACGG CCGTATTCGC GCTCGAAATC CTCGACACCA AGGTCCGCAG CGCCGAGCAT
CTGCGCCGCA TCCGCCGCGT GGTGCCGACG GGGATCGTGC CGCAGCTGCC CCGCAGCCGC
GGCGCGCCGA CCGGCCCGCT CGGGTGGTGG CAGCCCGAAG GCGTGTTTGC CGATGCGGTC
CGGGCGGTGG TCATCAGCCT CAGTCACGCC CGGCGGCAGC ATCTGGGCAA CATCCTCGTG
AGTTCCGCCC TGCCGGGCGA GGGCAAGACC ACCGTCGCCG CCGCCCTCGC GGCCGAGATG
GCGGCCTCGG GCCAGAAGGT CCTGCTGGTG GATGCGGATC TGCGGCAGGG CAACATGCAT
CGGCTCTTCG GGCTGGAGCC GGGGTTCGGC CTCTCGGATT ATCTCCGGGG CGCGCAGCCG
CTCTCCGAGG TGATCCGCCA CGAGGTGGCC CCCGGCATCG ACCTCCTGCC CTGCGGCAGC
CAGCTCGGCG CGGCCCGCCT TGACCGGCAG AAGATGATGG CGCTGCTGCA GATGGCCCGC
GACGCCGGCC AGATCGTGAT CCTCGACACG CCGCCCGCGC TCGCCACCGT CGATACGGCG
AGCCTTGCCG ATCTGGTCGA GACGGCGCTC CTCGTCGTCG AATGGGGCCG GACGGATCCC
GATGCGGTCG AGGCGGCGGT CCAGCGGCTG ACGCTCGGGC GCGAGGGCGA TGTCTTCGCG
GTCATCAACC GGGTGAACCT GCAACGGCAG GCCCTCTATG GCTTCCGCGA CGGCGGGCCC
CTCGCCCGGA CCCTGAGCAG CTTCCACCGC GGCGCAGGCC GCGCGCGCTG A
 
Protein sequence
MSQAQKVALA LRSQPTPIPL PVPAPAQEEW LDVQALFRII RRRLPVAALV FVGLMALLTG 
PILDMKRSFT AQARVLMRDP PSAGLGAVDG AEQKPLNLST EIERFVSRDI SAEVIREVGL
DKLPEFNAAL REPSLSRQAI NAVRSWFDAD PPVRATRRDD LRLVIPAYMA RLTVFQKGNS
DVVNIGFSSE DASIAAAVPN AVIRTYLKAR ERQHQGELEA NLRWLAVRIE EQSQRLNAAL
EAVATRRNQP DLSSPSALGI DTAIASLSER RIAIRHDIGA AERSLEDLKA DGVVVGQSGD
SGPEAKPQLG IQLEAARAEL ERLQTQFGEN HSKVRDVRER IAEIESQMRF EVSSEILALT
RRISSLKAEE AANLEELERA RDTLARQKEA QAELARLENE ASQEQLALAA ALQQQRLLLS
SSRQNVTEVS VLTPASVPLN ADGRGKAFYL VAAMIGSAIA AVTAVFALEI LDTKVRSAEH
LRRIRRVVPT GIVPQLPRSR GAPTGPLGWW QPEGVFADAV RAVVISLSHA RRQHLGNILV
SSALPGEGKT TVAAALAAEM AASGQKVLLV DADLRQGNMH RLFGLEPGFG LSDYLRGAQP
LSEVIRHEVA PGIDLLPCGS QLGAARLDRQ KMMALLQMAR DAGQIVILDT PPALATVDTA
SLADLVETAL LVVEWGRTDP DAVEAAVQRL TLGREGDVFA VINRVNLQRQ ALYGFRDGGP
LARTLSSFHR GAGRAR