Gene Rsph17025_3079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17025_3079 
Symbol 
ID5083166 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17025 
KingdomBacteria 
Replicon accessionNC_009428 
Strand
Start bp3148452 
End bp3150611 
Gene Length2160 bp 
Protein Length719 aa 
Translation table11 
GC content69% 
IMG OID640484651 
Productmalate synthase G 
Protein accessionYP_001169268 
Protein GI146279109 
COG category[C] Energy production and conversion 
COG ID[COG2225] Malate synthase 
TIGRFAM ID[TIGR01345] malate synthase G 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0370164 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGACCGACC GGATCGAGAA GCACGGGTTG CAGGTGGACA CGCGCCTTGC CGAGTTCGTG 
GCGCGCGAGG CCCTGCCGGG CACGGGTGTC GGCGAGGACG CCTTCTGGGG GGGGCTCGCC
CGGATGGTGC GCGAACTGGG GCCCGGAAAC CGCGCGCTTC TTGACCGTCG CGCCGATCTG
CAGGCGCGGA TCGATGCCTG GCATCGGGAT CGGCGCGGCC AGCGCACGAG CCTCGAGGAC
TACACGGCCT TCCTGCGCGA CATCGGCTAT CTGCTGCCGG AGGGACCGGA TTTCACGATC
GAGACCGGCA ACGTCGATCC CGAGATCGCC GAGGTTGCAG GTCCGCAGCT GGTCGTGCCG
GTGATGAACG CCCGCTACGC GCTGAATGCG GCCAACGCGC GCTGGGGCTC GCTCTATGAT
GCGCTCTACG GTACGGACGC GCTTGGGGAT GCGCCGCCGG CCGGTGAGTT CGATGCCGAG
CAGGGCGCGC GCGTGATCGC CTGGGGGCGG CAGTTCCTGG ACGAGGTGGC GCCGCTCGCA
GAGGGCAGTC ATGCCGAGGT CGAGAGCTAC CGGATCGCGA ACGGGGCTCT CGTGCCCGCG
CTGAAGGAGC CTGCGCAATT CGCCGGCTAC GCGGGGCCGG CCGGGGCACC GAGCGCGATC
CTCCTCGCGA ACAACGGGCT GCATCTGATC CTCGACATCG ACCGGGGGCA CCGGATCGGG
GCGACGGACC GCGCGGGTGT GGCCGACATC CGGATGGAGT CCGCGCTGTC GGCCATCATG
GATTGCGAGG ATTCCGTCGC GGCCGTCGAT GGCGAGGACA AGGCACTGGC CTACGGCAAC
TGGCTGGGCC TGATGCGCGG CGACCTGCGC GAGGCGATCT CGAAGGCCGG GCGCATGTTC
GTGCGGGAGC TGGCGCCGGA CCTGTCCTTC ACGGCGCCCG ATGGCGGCAC GATCACGCTC
AAGGGCCGGG CGCTGATGCT GGTGCGGAAC GTGGGCCATC TGATGACCAC GCCCGCCGTG
CTGGACGAGG CGGGCGAGGA GATCTTCGAG GGGATGCTCG ACGCCTTCGC CACGACCCTC
TGCGCCATCC ATGACCTGCG CAAGACCGCA GGGCCGCGAA ATTCGGTGAC TGGCTCTGTC
TATGTGGTGA AGCCCAAGAT GCACGGGCCT GAGGAAGTGG CCTTCGCCGA CGAGATCTTC
ACCCGGGTCG AGGAGGCGCT CGGCCTGCCG CGCTACAGCG TCAAGCTCGG CATCATGGAT
GAGGAACGCC GGACGTCGGT CAACCTCAAG GAATGCATCC GCGCGGCGAG GCACCGGGTG
GCCTTCATCA ACACGGGCTT CCTCGACCGG ACCGGGGACG AGATCGTGAC CGGCATGGAG
GCGGGCCCGA TGGTGAAGAA GGGCGACATG AAGGCCTCGC GCTGGATCGC GTCCTACGAG
GACCGGAACG TGGACATCGG CCTTGCCTGC GGGCTGCGCG GCCGGGCGCA GATCGGCAAG
GGCATGTGGG CCATGCCCGA CCGGATGGCC GAGATGCTCG CGAACAAGAT CGCGCATCCC
CGTGCGGGCG CCAATTGCGC CTGGGTGCCC TCGCCCACGG CGGCGACGCT CCACGCCACC
CACTATCACC GCGTGGATGT GAAGGCGCGG CAAGAGGAGA TCGCGGCCGG TGGCCCCCGC
GGCAGCCTCG CCGACCTGCT CACCCTGCCC GTGGCCGAGG GCGTGAACTG GTCGGATGCC
GAGCTGCGGC AGGAGATCGA GAACAACGCC CAGGGCATCC TGGGCTATGT CGTCCGCTGG
GTCGATCAGG GCGTGGGCTG CTCGAAAGTG CCCGACATCA ACGACGTGGG CCTGATGGAG
GACCGCGCGA CCTGCCGGAT CTCGAGCCAG GCGCTGGTGA ACTGGCTGCA CCACGGGGTT
GTCTCGGAAG AGCAGGTACT GGCGGCGCTG AAAAAGATGG CGGCCGTCGT GGATGCGCAG
AACGCCGGAG ACCCCGCCTA TCGCCCCATG GCGCCGGATT TCGACGGTGC GGCCTTCCAG
GCGGCCTGCG ATCTGGTCTT CAAGGGCCGC GAGCAGCCTT CGGGCTACAC CGAGCCCGTG
CTTCACGACC GCCGCCTCCA GGTGAAGGCC GAGCGCACGC CGCAGGTGAG CCGCGCCTGA
 
Protein sequence
MTDRIEKHGL QVDTRLAEFV AREALPGTGV GEDAFWGGLA RMVRELGPGN RALLDRRADL 
QARIDAWHRD RRGQRTSLED YTAFLRDIGY LLPEGPDFTI ETGNVDPEIA EVAGPQLVVP
VMNARYALNA ANARWGSLYD ALYGTDALGD APPAGEFDAE QGARVIAWGR QFLDEVAPLA
EGSHAEVESY RIANGALVPA LKEPAQFAGY AGPAGAPSAI LLANNGLHLI LDIDRGHRIG
ATDRAGVADI RMESALSAIM DCEDSVAAVD GEDKALAYGN WLGLMRGDLR EAISKAGRMF
VRELAPDLSF TAPDGGTITL KGRALMLVRN VGHLMTTPAV LDEAGEEIFE GMLDAFATTL
CAIHDLRKTA GPRNSVTGSV YVVKPKMHGP EEVAFADEIF TRVEEALGLP RYSVKLGIMD
EERRTSVNLK ECIRAARHRV AFINTGFLDR TGDEIVTGME AGPMVKKGDM KASRWIASYE
DRNVDIGLAC GLRGRAQIGK GMWAMPDRMA EMLANKIAHP RAGANCAWVP SPTAATLHAT
HYHRVDVKAR QEEIAAGGPR GSLADLLTLP VAEGVNWSDA ELRQEIENNA QGILGYVVRW
VDQGVGCSKV PDINDVGLME DRATCRISSQ ALVNWLHHGV VSEEQVLAAL KKMAAVVDAQ
NAGDPAYRPM APDFDGAAFQ AACDLVFKGR EQPSGYTEPV LHDRRLQVKA ERTPQVSRA