Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RSP_4084 |
Symbol | |
ID | 3711959 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides 2.4.1 |
Kingdom | Bacteria |
Replicon accession | NC_007489 |
Strand | - |
Start bp | 42792 |
End bp | 44993 |
Gene Length | 2202 bp |
Protein Length | 733 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640069436 |
Product | acetyltransferase |
Protein accession | YP_345303 |
Protein GI | 77404730 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.182153 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGCAG AGAGACCGCA GCCGGTCGCG GCTCCCGAAG AGGACGAGAT CGACCTCGGC CAGCTCCTGG GCCAGATCTG GCACGGCAAG CTCTGGATCG GGAGCGCCAC GCTGGCGGCG GGGATGCTGG GGCTGGCGTG GCTGGCGAAC ACGGCGCCGA CCTACCGGGC GGATACGCTG CTGCAGCTCG AGGAGAAGGC CTCTCCGCTT GCACTGCCGA CGGCACTTTC GGAGTTGGCC GGAGGCGAGG CGCGCAGCGC GACGGAGGTG GAGATCCTGC GTTCTCGCCT TGTGCTGGGG GAGGCGGTGG CGGCGTTTCA CCTGGACTGG GAGGCGAAGC CGCTACGGGC GCCGCTGGTG GGGCAGGCGG TGGCCTCAGG CGTGCTCCCA CTGCCGGAGG TGGAGGGCTT GATCCGCTAC GACCGGGGCG ACAGCCGGAT CCGGCTCGAC CTGCTCGAGG TGCCACCCGA ATGGGTCGAA GAGCCGATCC TCCTGACGGC GATGGGGGAG GGGCAGTTCG CGCTCCTGCT GCCCGACGGG CGCGAGGAGA CGGGGGAGGT CGGACGCCCG CTTGCCGTTC CGGACCGGGG CTTCGGGCTC AGGATCGGGG CGCTGGAGGG CGTACGGGGC CGCCAGTTCG TGATCCGGCA GCTCGACGAG ACCGGGGCGA TCGGGGCTCT GCGCGACCGG CTCACGGTGG CCGAACGTGG CAAGCAATCC TCCATCCTCG AGGTGGGGCT GACGGGCCGG GACCCGGCGG AAGTGCAGCG GACGCTCGGC GGCATCGCCG AGGCCTATCT GCGCCAGAAC ATGACCCGGA GCGCGGCCGA GGCGGAAAGC AGTCTCGAGT TCATCGAGGG CCAGCTGCCC GAGGCGCAGA AGGCGGTCCG GGCGGCGGAG GACCGGCTGA ACGCCTACCG CCAGGCCCAG CAGGCGATCG ACCTCGGCTT CGAGGGCCAG AGCCTGCTGA CCCAGATCAG CGCGATCGAG ACCGAGCTGC GCCAGCTGGC CGATCAGGAG GAGGAGATCG CCAACCGCTA CACGTCGAAC CACCCGACCT ACCAGCGGCT TCTGGCGATG CGGGGGCGGC TCGAGGAGCG GCTGGCGGCA CTGCGCAAGG AGGTGTCGAA TCTGCCGGAG ACCCAGCGCG AGGTCTTCAA CTTCACCCGC GATCTGGAGG TGGCGCAGGA GGTCTATCTG CAGCTGCTGA ACCGCGCCCA GGAACTGCGC GTGGTGAAGG CCAGCACTAT CGGCAATGTC CGCATCGTCG ACGGGGCCCG CACGGCGCCT GAGCCGGTTG CGCCCCGGCG CGGTCGCACG CTCGCGTTGG CGCTGCTTCT GGGGGCGCTC TCGGGCACCG GCCTCGTGCT GGGGCGGGCC TGGCTGCGCC GCAGCGTCCG GGGCCCGGAG GAGCTCGACC GGCTCGGCTT GCCGGTATTT GCCACGGTCC TGTTTGCGCC GGCGGCGGTG GGGAACCGGA AGGGCCGGGG GCTGCTGCCG ATCCTCGCGC TGTCGGATCC GAACTCCGCG ACGGTGGAGG GGATCCGGTC GCTGCGCACG AGCCTGCATT TCGGGATGCT CGACGCGGGC AGCCGGTCGA TCGCGCTCAC CTCCTCGGCG CCTGCCGCCG GCAAGTCCTT CACCTCCGTC AATCTCGCGG TGGTGGCGGC ACAGGCCGGC CAGAGCGTCT GCCTGATCGA TGCCGACCTG CGGCGGGGCC ATCTGCGGCG CTATTTCGCG GTGGCGAAGG GCACGCCCGG CCTTGCCGAA TATCTGGCGG GCGAGGCGGA GCTCGATGAG CTGCTGCGGC CGGGGCCGGT GGAGGGGCTG GCTTTCCTCT CGACGGGCCA GCTTCCGCCC AACCCCTCGG AGCTGCTGAT GCGCCCGCGC CTTGCAGAGC TCGTGGCCGA GCTCGACCGC CGCTTCGATC TGAGCATCTT CGACGCGCCT CCCGTGCTGG CCGTCACCGA CCCTGTGGTG ATCGGCCGGG CGGTCGGCGC CACCATCGCC GTCGTCCGTC ACGACGTCAC CGGCCTGGGC GAGGTGGAGG CCCTCATCCG CCAGCTGCAG GGGGCCGGCG TGAAGCCGGC CGGTGCGGTG CTCAACGCCT ATCGCCCCCA ACGCGGGTCG GGCCGGTACG GCTATGGATA CGGCTATCGC TACCGCTACG ACACCGGATA CCGCCCGCAG TCGGGAGACT AG
|
Protein sequence | MMAERPQPVA APEEDEIDLG QLLGQIWHGK LWIGSATLAA GMLGLAWLAN TAPTYRADTL LQLEEKASPL ALPTALSELA GGEARSATEV EILRSRLVLG EAVAAFHLDW EAKPLRAPLV GQAVASGVLP LPEVEGLIRY DRGDSRIRLD LLEVPPEWVE EPILLTAMGE GQFALLLPDG REETGEVGRP LAVPDRGFGL RIGALEGVRG RQFVIRQLDE TGAIGALRDR LTVAERGKQS SILEVGLTGR DPAEVQRTLG GIAEAYLRQN MTRSAAEAES SLEFIEGQLP EAQKAVRAAE DRLNAYRQAQ QAIDLGFEGQ SLLTQISAIE TELRQLADQE EEIANRYTSN HPTYQRLLAM RGRLEERLAA LRKEVSNLPE TQREVFNFTR DLEVAQEVYL QLLNRAQELR VVKASTIGNV RIVDGARTAP EPVAPRRGRT LALALLLGAL SGTGLVLGRA WLRRSVRGPE ELDRLGLPVF ATVLFAPAAV GNRKGRGLLP ILALSDPNSA TVEGIRSLRT SLHFGMLDAG SRSIALTSSA PAAGKSFTSV NLAVVAAQAG QSVCLIDADL RRGHLRRYFA VAKGTPGLAE YLAGEAELDE LLRPGPVEGL AFLSTGQLPP NPSELLMRPR LAELVAELDR RFDLSIFDAP PVLAVTDPVV IGRAVGATIA VVRHDVTGLG EVEALIRQLQ GAGVKPAGAV LNAYRPQRGS GRYGYGYGYR YRYDTGYRPQ SGD
|
| |