Gene Rsph17029_4087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4087 
Symbol 
ID4895017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009040 
Strand
Start bp26585 
End bp28840 
Gene Length2256 bp 
Protein Length751 aa 
Translation table11 
GC content69% 
IMG OID640110489 
Productlipopolysaccharide biosynthesis protein-like 
Protein accessionYP_001041801 
Protein GI126464825 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3754] Lipopolysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value0.885758 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones86 
Fosmid unclonability p-value0.378221 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAAT TTCACCGTCT GCGCCGGTTT GCCCGGATTC TGGTCCTGAC GGGCGAGGAG 
CGGCGCTATC TCCGCGCCAT CCGGAAAAGC GGGCTTTTCG ACCGGACCTA TTATCGGGGG
GCCTACCCCG GGTTGAACCC GATCTACCTG AAATATCCCG AGAAACACTA CATCGCCTAT
GGCGAGCGGC TGGGCTACCG GCCGAACCCG GACTTCTCGC CCCAGGCCTA TCTGCGCTAT
CACCCCGACG TGGCGGAGGC CGGCGTGCCG CCCTTCCTGC ATTATGTCCG CGTGGGCCAT
GCCGAGCAGC GCCTGACCAA GGAGCTGCCC GAGGTCGTGG CCCTGCCCGC CCGCGGCATG
CCGCAGGTCC GTTTCGAGCA CGGGCGCCAG ACCGCGCCCT ATGCGGTGGC GGTGCATGTC
TATTATCCCG ATCTCTGGCC CGAGTTCGCC GCCCGTCTGC GGCGGCTCCG CATCCCGTTC
GATCTCTATG TCACGCTGAC CTATCGCGGC GAGGAGACCG ATGCGCTGGC CGAGGAGATC
CGCGCCGACT TCCCCGGCGC CTTCGTGACC CCGATGCCGA ACCGCGGCCG CGACATCCTG
CCCTTCGTCA CCCTGCTCAA TGCGGGCGCC TTCGACGGCT ACCGGGCGGT CTGCAAGTTC
CACACGAAGA AATCGCCCCA CCGGCAGGAC GGCGATCTCT GGCGGAAGCA TCTGATCGAG
GGGATCCTGC CCGAGACCGG GCTCGAGGAG AAGCTCGAGG CCTTCGTCGA GGCGCCCGAG
GCGGGCTTCT GGGTGGCCGA CGGCCAGCAT TACACCGGCA CCCAATGGTG GGGCTCGAAC
GTCGAGGCCA CGCGCCACCT GCTCCAGCGC ATCGAGATCC CGCTCGACCG CGAGGCGCTC
TCCTTCCCGG CGGGCTCGAT CTACTGGGTG AAGCCCCTGG TGCTGGGGCT TCTGCGCAGC
CTGCAGCTCC GGCTCGAGGA TTTCGACATC GAGGAGGGTC AGGTCGACGG CACCCTCGCC
CATGCGATCG AGCGGGTGCT GGGCTATCTG ACCGCGCGGG CGGGCCAGAA GGTCCTGCAG
ACGAGCGAGC TGCGCCCGGC CGCGGCGGCG GCGCCCGCGA AGCCCGCCTT CGTCAGCGCC
TTCTACCTGC CCCAGTTCCA CCCCGTGCCC GAGAACGACG CCTGGTGGGG CAAGGGCTTC
ACCGAATGGC GCTCGGTGGT GAAGGCGCCC TCGATGTTCG AGGGCCATCT TCAGCCGATG
CTGCCCGCCG ATCTGGGCTT TTACGACCTG CGCGCCACCG AGGTGATGGG CGAGCAGGCG
GCGATGGCCC GCGAGGCCGG GATCGACGCC TTCTGCGTCT ATCACTACTG GTTCGACGGC
CGCCGCATCC TCGAGGCGCC GATCGACCGG CTGATGGCGC GGCCCGAGAT CGACTTCCCC
TTCTATCTCT GCTGGGCCAA CGAGAGCTGG CGGCGCAACT GGGACGGGCT GTCGGGCACG
GTGCTGCTCG AGCAGACCTA TGGCGCGGGC TTCGAGGAGA AGCTCGCCGC CGATACCGCC
CCCTATCTGC GCGATCCGCG CTATGCCCGC CCCGACGGCC GCCGTCCGCG CTTCGTGATC
TACCGTCCCG AGGACATGCC CGATCCGCAG GCCAGCGTGG CGCGGCTGCG CGAGGGCTGG
CGGCGGGCGG GGATCGGCGA GGTCGAGCTC GGCGCGGTGC GGTTCCATGT CGAGGGCGCC
CATCCGGTGC CCGAGGGGCT CTTCGACTTC TGGGTCGAGA TGCCGCCGCA CGGGCTGGTG
AAGGGGCCGG ACTATCTCTT CGGCGGTCCC GACGGCAACC GGATGCCCGC CGCGATGAAC
CCCGCCTTCT CGGGGCTGAT CTACGATTAC GCCGCCGTCG CGCGCCGGGC CCTGTCGGAG
ACCTATGTCC GCACCCTGCC CAAGGCCACG ATCGCAGGCG TCATGCCGGG CTGGGACAAT
ACGGCCCGGC GCGGGGCGGC GGGCCATGTG GCCTACGGCG CCAATCCCGC CACCTTCAAC
GTCTGGCTCG CGGGCGCGCT CGAGCGCCGC GTGCCCGCCT CCTATCGCCG CGAGCTTTTC
GTCAATGCCT GGAACGAATG GGCCGAGAAG GCCGTCCTCG AGCCGAGCCT GACCTTCGGC
GATCTCAATC TCCAGGTGAT GCGGCAGCAT CTGGGAGCGG CGGAGCCCGC CACCCATCTT
GCGGAGCCGC CCGCGCACGG CATGAGGTCA CACTGA
 
Protein sequence
MLKFHRLRRF ARILVLTGEE RRYLRAIRKS GLFDRTYYRG AYPGLNPIYL KYPEKHYIAY 
GERLGYRPNP DFSPQAYLRY HPDVAEAGVP PFLHYVRVGH AEQRLTKELP EVVALPARGM
PQVRFEHGRQ TAPYAVAVHV YYPDLWPEFA ARLRRLRIPF DLYVTLTYRG EETDALAEEI
RADFPGAFVT PMPNRGRDIL PFVTLLNAGA FDGYRAVCKF HTKKSPHRQD GDLWRKHLIE
GILPETGLEE KLEAFVEAPE AGFWVADGQH YTGTQWWGSN VEATRHLLQR IEIPLDREAL
SFPAGSIYWV KPLVLGLLRS LQLRLEDFDI EEGQVDGTLA HAIERVLGYL TARAGQKVLQ
TSELRPAAAA APAKPAFVSA FYLPQFHPVP ENDAWWGKGF TEWRSVVKAP SMFEGHLQPM
LPADLGFYDL RATEVMGEQA AMAREAGIDA FCVYHYWFDG RRILEAPIDR LMARPEIDFP
FYLCWANESW RRNWDGLSGT VLLEQTYGAG FEEKLAADTA PYLRDPRYAR PDGRRPRFVI
YRPEDMPDPQ ASVARLREGW RRAGIGEVEL GAVRFHVEGA HPVPEGLFDF WVEMPPHGLV
KGPDYLFGGP DGNRMPAAMN PAFSGLIYDY AAVARRALSE TYVRTLPKAT IAGVMPGWDN
TARRGAAGHV AYGANPATFN VWLAGALERR VPASYRRELF VNAWNEWAEK AVLEPSLTFG
DLNLQVMRQH LGAAEPATHL AEPPAHGMRS H