Gene RSP_4084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSP_4084 
Symbol 
ID3711959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides 2.4.1 
KingdomBacteria 
Replicon accessionNC_007489 
Strand
Start bp42792 
End bp44993 
Gene Length2202 bp 
Protein Length733 aa 
Translation table11 
GC content71% 
IMG OID640069436 
Productacetyltransferase 
Protein accessionYP_345303 
Protein GI77404730 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.182153 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGCAG AGAGACCGCA GCCGGTCGCG GCTCCCGAAG AGGACGAGAT CGACCTCGGC 
CAGCTCCTGG GCCAGATCTG GCACGGCAAG CTCTGGATCG GGAGCGCCAC GCTGGCGGCG
GGGATGCTGG GGCTGGCGTG GCTGGCGAAC ACGGCGCCGA CCTACCGGGC GGATACGCTG
CTGCAGCTCG AGGAGAAGGC CTCTCCGCTT GCACTGCCGA CGGCACTTTC GGAGTTGGCC
GGAGGCGAGG CGCGCAGCGC GACGGAGGTG GAGATCCTGC GTTCTCGCCT TGTGCTGGGG
GAGGCGGTGG CGGCGTTTCA CCTGGACTGG GAGGCGAAGC CGCTACGGGC GCCGCTGGTG
GGGCAGGCGG TGGCCTCAGG CGTGCTCCCA CTGCCGGAGG TGGAGGGCTT GATCCGCTAC
GACCGGGGCG ACAGCCGGAT CCGGCTCGAC CTGCTCGAGG TGCCACCCGA ATGGGTCGAA
GAGCCGATCC TCCTGACGGC GATGGGGGAG GGGCAGTTCG CGCTCCTGCT GCCCGACGGG
CGCGAGGAGA CGGGGGAGGT CGGACGCCCG CTTGCCGTTC CGGACCGGGG CTTCGGGCTC
AGGATCGGGG CGCTGGAGGG CGTACGGGGC CGCCAGTTCG TGATCCGGCA GCTCGACGAG
ACCGGGGCGA TCGGGGCTCT GCGCGACCGG CTCACGGTGG CCGAACGTGG CAAGCAATCC
TCCATCCTCG AGGTGGGGCT GACGGGCCGG GACCCGGCGG AAGTGCAGCG GACGCTCGGC
GGCATCGCCG AGGCCTATCT GCGCCAGAAC ATGACCCGGA GCGCGGCCGA GGCGGAAAGC
AGTCTCGAGT TCATCGAGGG CCAGCTGCCC GAGGCGCAGA AGGCGGTCCG GGCGGCGGAG
GACCGGCTGA ACGCCTACCG CCAGGCCCAG CAGGCGATCG ACCTCGGCTT CGAGGGCCAG
AGCCTGCTGA CCCAGATCAG CGCGATCGAG ACCGAGCTGC GCCAGCTGGC CGATCAGGAG
GAGGAGATCG CCAACCGCTA CACGTCGAAC CACCCGACCT ACCAGCGGCT TCTGGCGATG
CGGGGGCGGC TCGAGGAGCG GCTGGCGGCA CTGCGCAAGG AGGTGTCGAA TCTGCCGGAG
ACCCAGCGCG AGGTCTTCAA CTTCACCCGC GATCTGGAGG TGGCGCAGGA GGTCTATCTG
CAGCTGCTGA ACCGCGCCCA GGAACTGCGC GTGGTGAAGG CCAGCACTAT CGGCAATGTC
CGCATCGTCG ACGGGGCCCG CACGGCGCCT GAGCCGGTTG CGCCCCGGCG CGGTCGCACG
CTCGCGTTGG CGCTGCTTCT GGGGGCGCTC TCGGGCACCG GCCTCGTGCT GGGGCGGGCC
TGGCTGCGCC GCAGCGTCCG GGGCCCGGAG GAGCTCGACC GGCTCGGCTT GCCGGTATTT
GCCACGGTCC TGTTTGCGCC GGCGGCGGTG GGGAACCGGA AGGGCCGGGG GCTGCTGCCG
ATCCTCGCGC TGTCGGATCC GAACTCCGCG ACGGTGGAGG GGATCCGGTC GCTGCGCACG
AGCCTGCATT TCGGGATGCT CGACGCGGGC AGCCGGTCGA TCGCGCTCAC CTCCTCGGCG
CCTGCCGCCG GCAAGTCCTT CACCTCCGTC AATCTCGCGG TGGTGGCGGC ACAGGCCGGC
CAGAGCGTCT GCCTGATCGA TGCCGACCTG CGGCGGGGCC ATCTGCGGCG CTATTTCGCG
GTGGCGAAGG GCACGCCCGG CCTTGCCGAA TATCTGGCGG GCGAGGCGGA GCTCGATGAG
CTGCTGCGGC CGGGGCCGGT GGAGGGGCTG GCTTTCCTCT CGACGGGCCA GCTTCCGCCC
AACCCCTCGG AGCTGCTGAT GCGCCCGCGC CTTGCAGAGC TCGTGGCCGA GCTCGACCGC
CGCTTCGATC TGAGCATCTT CGACGCGCCT CCCGTGCTGG CCGTCACCGA CCCTGTGGTG
ATCGGCCGGG CGGTCGGCGC CACCATCGCC GTCGTCCGTC ACGACGTCAC CGGCCTGGGC
GAGGTGGAGG CCCTCATCCG CCAGCTGCAG GGGGCCGGCG TGAAGCCGGC CGGTGCGGTG
CTCAACGCCT ATCGCCCCCA ACGCGGGTCG GGCCGGTACG GCTATGGATA CGGCTATCGC
TACCGCTACG ACACCGGATA CCGCCCGCAG TCGGGAGACT AG
 
Protein sequence
MMAERPQPVA APEEDEIDLG QLLGQIWHGK LWIGSATLAA GMLGLAWLAN TAPTYRADTL 
LQLEEKASPL ALPTALSELA GGEARSATEV EILRSRLVLG EAVAAFHLDW EAKPLRAPLV
GQAVASGVLP LPEVEGLIRY DRGDSRIRLD LLEVPPEWVE EPILLTAMGE GQFALLLPDG
REETGEVGRP LAVPDRGFGL RIGALEGVRG RQFVIRQLDE TGAIGALRDR LTVAERGKQS
SILEVGLTGR DPAEVQRTLG GIAEAYLRQN MTRSAAEAES SLEFIEGQLP EAQKAVRAAE
DRLNAYRQAQ QAIDLGFEGQ SLLTQISAIE TELRQLADQE EEIANRYTSN HPTYQRLLAM
RGRLEERLAA LRKEVSNLPE TQREVFNFTR DLEVAQEVYL QLLNRAQELR VVKASTIGNV
RIVDGARTAP EPVAPRRGRT LALALLLGAL SGTGLVLGRA WLRRSVRGPE ELDRLGLPVF
ATVLFAPAAV GNRKGRGLLP ILALSDPNSA TVEGIRSLRT SLHFGMLDAG SRSIALTSSA
PAAGKSFTSV NLAVVAAQAG QSVCLIDADL RRGHLRRYFA VAKGTPGLAE YLAGEAELDE
LLRPGPVEGL AFLSTGQLPP NPSELLMRPR LAELVAELDR RFDLSIFDAP PVLAVTDPVV
IGRAVGATIA VVRHDVTGLG EVEALIRQLQ GAGVKPAGAV LNAYRPQRGS GRYGYGYGYR
YRYDTGYRPQ SGD