Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_3692 |
Symbol | |
ID | 4898338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009050 |
Strand | - |
Start bp | 801005 |
End bp | 803215 |
Gene Length | 2211 bp |
Protein Length | 736 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640114300 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_001045554 |
Protein GI | 126464441 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.306041 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTCAGG CTCAGAAAGT CGCGTTGGCG CTTCGGTCGC AGCCGACGCC GATCCCGCTG CCGGTGCCCG CGCCGGCCCA GGAGGAATGG CTGGACGTTC AGGCGCTGTT CCGCATCATC CGCCGCAGGC TGCCGGTGGC GGCCCTCGTC TTCGTGGGGC TGATGGCCCT GCTGACGGGA CCGATCCTCG ACATGAAGCG CAGCTTCACC GCGCAGGCGC GGGTGCTGAT GCGCGATCCG CCGAGCGCGG GCCTCGGGGC CGTCGACGGG GCCGAGCAGA AGCCGCTGAA CCTCAGCACC GAGATCGAGC GCTTCGTGTC GCGCGACATC AGCGCCGAGG TCATCCGCGA GGTCGGCCTC GACAAGCTGC CGGAATTCAA CGCGGCCCTG CGCGAGCCCT CGCTGTCCCG GCAGGCGATC AATGCCGTGC GGAGCTGGTT CGACGCCGAT CCGCCCGTTC GTGCGACCCG GCGCGACGAC CTGCGGCTCG TCATCCCGGC CTACATGGCC CGGCTCACGG TGTTCCAGAA GGGCAACTCC GACGTCGTCA ACATCGGCTT CAGCTCCGAG GATGCGTCGA TTGCCGCGGC GGTGCCGAAC GCGGTCATCC GCACCTACCT GAAGGCGCGC GAGCGCCAGC ATCAGGGCGA GCTGGAGGCC AACCTGCGCT GGCTCGCGGT GCGGATCGAA GAGCAGAGCC AGCGGCTCAA TGCGGCGCTC GAGGCCGTGG CCACCCGGCG GAACCAGCCC GACCTCTCCT CGCCGAGCGC CCTCGGCATC GACACCGCCA TCGCGAGCCT CAGCGAACGG CGCATCGCCA TCCGCCACGA CATCGGAGCG GCAGAGCGCA GCCTCGAGGA TCTGAAGGCC GACGGCGTGG TGGTCGGCCA ATCCGGCGAC AGCGGGCCCG AGGCCAAGCC GCAGCTCGGC ATCCAGCTCG AGGCCGCGCG GGCCGAGCTG GAGCGGCTCC AGACCCAGTT CGGCGAGAAC CATTCGAAGG TCCGCGACGT CCGCGAGCGG ATCGCCGAGA TCGAGAGCCA GATGCGCTTC GAGGTCTCGT CCGAGATCCT GGCCCTCACG CGGCGCATCT CGTCGCTGAA GGCGGAGGAG GCCGCGAACC TCGAAGAGCT CGAGAGGGCG CGCGACACGC TCGCCCGGCA GAAGGAGGCG CAGGCCGAGC TCGCCCGCCT CGAGAACGAG GCGAGCCAGG AACAGCTTGC GCTGGCGGCG GCGCTGCAGC AGCAGCGGCT CCTCCTCTCC AGCTCGCGGC AGAACGTGAC GGAGGTGTCG GTGCTGACGC CCGCCTCGGT GCCGCTGAAC GCGGACGGGC GCGGCAAAGC CTTCTACCTC GTCGCCGCCA TGATCGGCAG CGCCATCGCG GCGGTGACGG CCGTATTCGC GCTCGAAATC CTCGACACCA AGGTCCGCAG CGCCGAGCAT CTGCGCCGCA TCCGCCGCGT GGTGCCGACG GGGATCGTGC CGCAGCTGCC CCGCAGCCGC GGCGCGCCGA CCGGCCCGCT CGGGTGGTGG CAGCCCGAAG GCGTGTTTGC CGATGCGGTC CGGGCGGTGG TCATCAGCCT CAGTCACGCC CGGCGGCAGC ATCTGGGCAA CATCCTCGTG AGTTCCGCCC TGCCGGGCGA GGGCAAGACC ACCGTCGCCG CCGCCCTCGC GGCCGAGATG GCGGCCTCGG GCCAGAAGGT CCTGCTGGTG GATGCGGATC TGCGGCAGGG CAACATGCAT CGGCTCTTCG GGCTGGAGCC GGGGTTCGGC CTCTCGGATT ATCTCCGGGG CGCGCAGCCG CTCTCCGAGG TGATCCGCCA CGAGGTGGCC CCCGGCATCG ACCTCCTGCC CTGCGGCAGC CAGCTCGGCG CGGCCCGCCT TGACCGGCAG AAGATGATGG CGCTGCTGCA GATGGCCCGC GACGCCGGCC AGATCGTGAT CCTCGACACG CCGCCCGCGC TCGCCACCGT CGATACGGCG AGCCTTGCCG ATCTGGTCGA GACGGCGCTC CTCGTCGTCG AATGGGGCCG GACGGATCCC GATGCGGTCG AGGCGGCGGT CCAGCGGCTG ACGCTCGGGC GCGAGGGCGA TGTCTTCGCG GTCATCAACC GGGTGAACCT GCAACGGCAG GCCCTCTATG GCTTCCGCGA CGGCGGGCCC CTCGCCCGGA CCCTGAGCAG CTTCCACCGC GGCGCAGGCC GCGCGCGCTG A
|
Protein sequence | MSQAQKVALA LRSQPTPIPL PVPAPAQEEW LDVQALFRII RRRLPVAALV FVGLMALLTG PILDMKRSFT AQARVLMRDP PSAGLGAVDG AEQKPLNLST EIERFVSRDI SAEVIREVGL DKLPEFNAAL REPSLSRQAI NAVRSWFDAD PPVRATRRDD LRLVIPAYMA RLTVFQKGNS DVVNIGFSSE DASIAAAVPN AVIRTYLKAR ERQHQGELEA NLRWLAVRIE EQSQRLNAAL EAVATRRNQP DLSSPSALGI DTAIASLSER RIAIRHDIGA AERSLEDLKA DGVVVGQSGD SGPEAKPQLG IQLEAARAEL ERLQTQFGEN HSKVRDVRER IAEIESQMRF EVSSEILALT RRISSLKAEE AANLEELERA RDTLARQKEA QAELARLENE ASQEQLALAA ALQQQRLLLS SSRQNVTEVS VLTPASVPLN ADGRGKAFYL VAAMIGSAIA AVTAVFALEI LDTKVRSAEH LRRIRRVVPT GIVPQLPRSR GAPTGPLGWW QPEGVFADAV RAVVISLSHA RRQHLGNILV SSALPGEGKT TVAAALAAEM AASGQKVLLV DADLRQGNMH RLFGLEPGFG LSDYLRGAQP LSEVIRHEVA PGIDLLPCGS QLGAARLDRQ KMMALLQMAR DAGQIVILDT PPALATVDTA SLADLVETAL LVVEWGRTDP DAVEAAVQRL TLGREGDVFA VINRVNLQRQ ALYGFRDGGP LARTLSSFHR GAGRAR
|
| |