Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_4087 |
Symbol | |
ID | 4895017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009040 |
Strand | + |
Start bp | 26585 |
End bp | 28840 |
Gene Length | 2256 bp |
Protein Length | 751 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640110489 |
Product | lipopolysaccharide biosynthesis protein-like |
Protein accession | YP_001041801 |
Protein GI | 126464825 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3754] Lipopolysaccharide biosynthesis protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 60 |
Plasmid unclonability p-value | 0.885758 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 86 |
Fosmid unclonability p-value | 0.378221 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAAAT TTCACCGTCT GCGCCGGTTT GCCCGGATTC TGGTCCTGAC GGGCGAGGAG CGGCGCTATC TCCGCGCCAT CCGGAAAAGC GGGCTTTTCG ACCGGACCTA TTATCGGGGG GCCTACCCCG GGTTGAACCC GATCTACCTG AAATATCCCG AGAAACACTA CATCGCCTAT GGCGAGCGGC TGGGCTACCG GCCGAACCCG GACTTCTCGC CCCAGGCCTA TCTGCGCTAT CACCCCGACG TGGCGGAGGC CGGCGTGCCG CCCTTCCTGC ATTATGTCCG CGTGGGCCAT GCCGAGCAGC GCCTGACCAA GGAGCTGCCC GAGGTCGTGG CCCTGCCCGC CCGCGGCATG CCGCAGGTCC GTTTCGAGCA CGGGCGCCAG ACCGCGCCCT ATGCGGTGGC GGTGCATGTC TATTATCCCG ATCTCTGGCC CGAGTTCGCC GCCCGTCTGC GGCGGCTCCG CATCCCGTTC GATCTCTATG TCACGCTGAC CTATCGCGGC GAGGAGACCG ATGCGCTGGC CGAGGAGATC CGCGCCGACT TCCCCGGCGC CTTCGTGACC CCGATGCCGA ACCGCGGCCG CGACATCCTG CCCTTCGTCA CCCTGCTCAA TGCGGGCGCC TTCGACGGCT ACCGGGCGGT CTGCAAGTTC CACACGAAGA AATCGCCCCA CCGGCAGGAC GGCGATCTCT GGCGGAAGCA TCTGATCGAG GGGATCCTGC CCGAGACCGG GCTCGAGGAG AAGCTCGAGG CCTTCGTCGA GGCGCCCGAG GCGGGCTTCT GGGTGGCCGA CGGCCAGCAT TACACCGGCA CCCAATGGTG GGGCTCGAAC GTCGAGGCCA CGCGCCACCT GCTCCAGCGC ATCGAGATCC CGCTCGACCG CGAGGCGCTC TCCTTCCCGG CGGGCTCGAT CTACTGGGTG AAGCCCCTGG TGCTGGGGCT TCTGCGCAGC CTGCAGCTCC GGCTCGAGGA TTTCGACATC GAGGAGGGTC AGGTCGACGG CACCCTCGCC CATGCGATCG AGCGGGTGCT GGGCTATCTG ACCGCGCGGG CGGGCCAGAA GGTCCTGCAG ACGAGCGAGC TGCGCCCGGC CGCGGCGGCG GCGCCCGCGA AGCCCGCCTT CGTCAGCGCC TTCTACCTGC CCCAGTTCCA CCCCGTGCCC GAGAACGACG CCTGGTGGGG CAAGGGCTTC ACCGAATGGC GCTCGGTGGT GAAGGCGCCC TCGATGTTCG AGGGCCATCT TCAGCCGATG CTGCCCGCCG ATCTGGGCTT TTACGACCTG CGCGCCACCG AGGTGATGGG CGAGCAGGCG GCGATGGCCC GCGAGGCCGG GATCGACGCC TTCTGCGTCT ATCACTACTG GTTCGACGGC CGCCGCATCC TCGAGGCGCC GATCGACCGG CTGATGGCGC GGCCCGAGAT CGACTTCCCC TTCTATCTCT GCTGGGCCAA CGAGAGCTGG CGGCGCAACT GGGACGGGCT GTCGGGCACG GTGCTGCTCG AGCAGACCTA TGGCGCGGGC TTCGAGGAGA AGCTCGCCGC CGATACCGCC CCCTATCTGC GCGATCCGCG CTATGCCCGC CCCGACGGCC GCCGTCCGCG CTTCGTGATC TACCGTCCCG AGGACATGCC CGATCCGCAG GCCAGCGTGG CGCGGCTGCG CGAGGGCTGG CGGCGGGCGG GGATCGGCGA GGTCGAGCTC GGCGCGGTGC GGTTCCATGT CGAGGGCGCC CATCCGGTGC CCGAGGGGCT CTTCGACTTC TGGGTCGAGA TGCCGCCGCA CGGGCTGGTG AAGGGGCCGG ACTATCTCTT CGGCGGTCCC GACGGCAACC GGATGCCCGC CGCGATGAAC CCCGCCTTCT CGGGGCTGAT CTACGATTAC GCCGCCGTCG CGCGCCGGGC CCTGTCGGAG ACCTATGTCC GCACCCTGCC CAAGGCCACG ATCGCAGGCG TCATGCCGGG CTGGGACAAT ACGGCCCGGC GCGGGGCGGC GGGCCATGTG GCCTACGGCG CCAATCCCGC CACCTTCAAC GTCTGGCTCG CGGGCGCGCT CGAGCGCCGC GTGCCCGCCT CCTATCGCCG CGAGCTTTTC GTCAATGCCT GGAACGAATG GGCCGAGAAG GCCGTCCTCG AGCCGAGCCT GACCTTCGGC GATCTCAATC TCCAGGTGAT GCGGCAGCAT CTGGGAGCGG CGGAGCCCGC CACCCATCTT GCGGAGCCGC CCGCGCACGG CATGAGGTCA CACTGA
|
Protein sequence | MLKFHRLRRF ARILVLTGEE RRYLRAIRKS GLFDRTYYRG AYPGLNPIYL KYPEKHYIAY GERLGYRPNP DFSPQAYLRY HPDVAEAGVP PFLHYVRVGH AEQRLTKELP EVVALPARGM PQVRFEHGRQ TAPYAVAVHV YYPDLWPEFA ARLRRLRIPF DLYVTLTYRG EETDALAEEI RADFPGAFVT PMPNRGRDIL PFVTLLNAGA FDGYRAVCKF HTKKSPHRQD GDLWRKHLIE GILPETGLEE KLEAFVEAPE AGFWVADGQH YTGTQWWGSN VEATRHLLQR IEIPLDREAL SFPAGSIYWV KPLVLGLLRS LQLRLEDFDI EEGQVDGTLA HAIERVLGYL TARAGQKVLQ TSELRPAAAA APAKPAFVSA FYLPQFHPVP ENDAWWGKGF TEWRSVVKAP SMFEGHLQPM LPADLGFYDL RATEVMGEQA AMAREAGIDA FCVYHYWFDG RRILEAPIDR LMARPEIDFP FYLCWANESW RRNWDGLSGT VLLEQTYGAG FEEKLAADTA PYLRDPRYAR PDGRRPRFVI YRPEDMPDPQ ASVARLREGW RRAGIGEVEL GAVRFHVEGA HPVPEGLFDF WVEMPPHGLV KGPDYLFGGP DGNRMPAAMN PAFSGLIYDY AAVARRALSE TYVRTLPKAT IAGVMPGWDN TARRGAAGHV AYGANPATFN VWLAGALERR VPASYRRELF VNAWNEWAEK AVLEPSLTFG DLNLQVMRQH LGAAEPATHL AEPPAHGMRS H
|
| |