Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rru_A3715 |
Symbol | |
ID | 3837172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodospirillum rubrum ATCC 11170 |
Kingdom | Bacteria |
Replicon accession | NC_007643 |
Strand | + |
Start bp | 4264520 |
End bp | 4266658 |
Gene Length | 2139 bp |
Protein Length | 712 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637827840 |
Product | lipopolysaccharide biosynthesis |
Protein accession | YP_428796 |
Protein GI | 83595044 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACAAAA GCAAAGCCGC GATCACCAGT CAAAACCAGA CGTTTACCCT GCGTGATGGG CTGCTCTATG TTTTTTTCCA TAAGAAAATC ATCCTCATAG GATTTTTTAT TCCTATTGTT TTGATGTCCA TTTTCGCCAC GCAGTTTCCC TTGTTCTATA AAGCCGAAGC CCGGCTGATG GTTTTGTTCA GCCGCGAACA ATCCGGCGCC CAGGATCTGA TGGGGGCGCC GACGGTGGTG TCCGTCGATG GCCTGCGCGC CACCGCCACC GAGGTTGGCA TCCTGCGCTC AAGCGAGGTT CTGCAAAAGG TGATCGCCGA TATCGGCGAG GAGGCGTTGT TTCCCGAACT GCTGCGTCCG CGCCTTTTCG GCCTGCTGCC TGCCTATTCC CCGGAAGAAC GCACGCAACG GGCCATCGAT CTGGCGCAAG AACGCATCCA TATCGATATT CCGACCGATT CCAATATCGT TACGGTGTCC TTTGATCACG AAAACCGGGA AATCGCCCTT AGCGTTGTTC AGGTTCTTCT TGACGTTTAT CTTGATCACC GCGCCAAGGT CTTTGAAAAC CCGCGCTCGC CTTTTTTGCT GAAAGAGGCG GAGCGCGATC TTCAGCTTTT GCAAGAGACC GAGCGCGAGC TGACCCAGAC CAAGGCCCGC TATAAGATCA TCGATATCGA CCAGGATATC CTGCTCGCGG TCAATCAGGT CGATAGCATC GTCCAGCGCA GCCGGCAGAT GGACGAGCGT CAGGCCGCCG TCCGCGCCGA GATCGAAGAG GCGGCCAAGT CCTTGGACGC CCTGCCGATC AGCGTTCCCA GTTTCCAGGA AACCACCAAT CACACCGATA ACGACGCGGT GCGCAACGAG CGCCTTGCCC TTCAGCTTGA ACGACGCCAG CTTTCCGAGC GCTATCAGGC CGATTATCCG CGCATCCAGG ATATCGACAA GCAGCTCGAC GCCATCACGA ATTTTCTGAA GAAAAACCAG CCCATCTATA AGACCGATCG TCAGGTTCGC AATCCGACCA TCGAATTCCT GACCAATCAC TATCTGACCC TGAAGATCGA AGGCGAGGCG GTCACTCATC AGGTCGCCCA ATTGGCCGCC CAAAAGGCCA TCGCCGAAAA GCGCGTGGCC GAACTGACCC TTGTCGCCGA AAAGATCAAC GACCTGGACC GCCGGCGGTC GATCCAGGAA GAGACCTATC GCGAATACAA TCGCCGGGCC GAGGCGGCGC GCATCGAGGA GGCCACCGCC CGCCTGCGCA GCGCCAATGT CCGCGTCATC GCCTCGGCCT ATGCCTCGCC GACCGGCCAT AGCATGGTTC CCAGCCTGCT GGCCGGCGGC GTGTTCCTCG GCCTGTTGCT GGGCGCCGTA TCGGGCATCA TCGCCGCCTA TGGTCGTCAG GTCATGCTGA CGCCGGTCGA GGTCGAGAAG CGTTTGGGCC TGCCCGTGGT CAGCTCCTTC TCCGACGAGC ATCAGCCGCG CGCCAAGGGC AAGAACGCCG CCGAGATGAT CTATCTGGTG GCCCGCCTGC GCGAAGCCGG GCCCGAGAAC ACGACGCTGA AGACCCTGCA GGTCGTATCG TCGTCGAAGC TCGAAAACCA ATCGAAATTC GTTCGGCAAC TGGCCGTCGA GGTTGTTCGC GGCTATGGCG AAAAGACCCT GATCATCGAT CTCAATGAAA AGATGTCGGC GCACAGGAAA ACCCTGGTCC GGCCCGATGC CGAGGCGGCG CCTTTGCTCA CCGATGACGG CGAGGGGCTG GCCGACGAGA TCACGCCCGA CGTTCTGCCC ACCGTTCTGC CCAATCTGTT CCTGACCGAA AACGCCAGCG TCTCGGTGCT GGGCGATCCC CGGGTCAATC GGCAGAGGCT GGCCGAAATC CTCGATAGCC TGGGTGCGAG CTATGATGTC ATCATCCTCG ATCTGCCGCC CTTCGATGTC CATCGCATCG GCCTGCGCTA TGCGCCGCTG ACCGATGGCA GCCTGACCAT TGTCCGGGCC GCGGCGACGC GTCTGCCCGC CGCCCTCAAC CTCAAGGAAA CCATCCTGTC GGCGGGCGGC GACATTCTGG GGGCGGTTCT CACCGAACGC CGCTACTACA TTCCAAAGGG AATTCTGCGA TGGCTCTGA
|
Protein sequence | MNKSKAAITS QNQTFTLRDG LLYVFFHKKI ILIGFFIPIV LMSIFATQFP LFYKAEARLM VLFSREQSGA QDLMGAPTVV SVDGLRATAT EVGILRSSEV LQKVIADIGE EALFPELLRP RLFGLLPAYS PEERTQRAID LAQERIHIDI PTDSNIVTVS FDHENREIAL SVVQVLLDVY LDHRAKVFEN PRSPFLLKEA ERDLQLLQET ERELTQTKAR YKIIDIDQDI LLAVNQVDSI VQRSRQMDER QAAVRAEIEE AAKSLDALPI SVPSFQETTN HTDNDAVRNE RLALQLERRQ LSERYQADYP RIQDIDKQLD AITNFLKKNQ PIYKTDRQVR NPTIEFLTNH YLTLKIEGEA VTHQVAQLAA QKAIAEKRVA ELTLVAEKIN DLDRRRSIQE ETYREYNRRA EAARIEEATA RLRSANVRVI ASAYASPTGH SMVPSLLAGG VFLGLLLGAV SGIIAAYGRQ VMLTPVEVEK RLGLPVVSSF SDEHQPRAKG KNAAEMIYLV ARLREAGPEN TTLKTLQVVS SSKLENQSKF VRQLAVEVVR GYGEKTLIID LNEKMSAHRK TLVRPDAEAA PLLTDDGEGL ADEITPDVLP TVLPNLFLTE NASVSVLGDP RVNRQRLAEI LDSLGASYDV IILDLPPFDV HRIGLRYAPL TDGSLTIVRA AATRLPAALN LKETILSAGG DILGAVLTER RYYIPKGILR WL
|
| |