Gene Rsph17029_4156 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRsph17029_4156 
Symbol 
ID4895027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodobacter sphaeroides ATCC 17029 
KingdomBacteria 
Replicon accessionNC_009040 
Strand
Start bp94244 
End bp95419 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content69% 
IMG OID640110547 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_001041859 
Protein GI126464883 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones128 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones95 
Fosmid unclonability p-value0.845536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATCG ATGCCGTCGA CCTTTTCTAT CTGTCGATGC CCGAGGTGAC CGATGCCGCC 
GATGGCAGTC AGGATGCGCT GCTGGTCCGG GTGGCGGCCG GCGGACATAT CGGCTGGGGC
GAGTGCGAAG CGGCGCCGCT GCCCTCGATC GCCGCCTTCG TCTGTCCGAA ATCGCACGGT
GTCTGCCGTC CGGTCTCGGA CTCCGTGCTC GGTCAGCGGC TGGACGGGCC GGACGATATC
GCCCGCATTG CGGCGCTGGT CGGCTATAAC TCGATGGACC TGCTGCAGGC GCCGCATATG
CTGTCGGGGA TCGAGATGGC GCTGTGGGAT CTGCTGGGCC GCAGGCTCTC GGCCCCGGCA
TGGGCCCTGC TGGGCTACAG CGCCAACCAT GGCAAGAGGC CCTACGCCTC GCTGCTGTTC
GGCGACACGC CGCAGGAGAC ACTCGAGCGG GCCCGCGCCG CGCGGCGCGA TGGCTTTGCA
GCCGTCAAAT TCGGCTGGGG TCCGATCGGG CGCGGCACTG TGGCGGCGGA CGCCGATCAG
ATCATGGCCG CGCGCGAAGG GCTGGGCCCG GACGGGGACC TGATGGTCGA TGTCGGCCAG
ATCTTCGGAG AGGATGTCGA GGCGGCCGCT GCGCGCCTGC CGACGCTCGA TGCCGCCGGG
GTGCTCTGGC TCGAAGAGCC GTTCGACGCG GGTGCGCTGG CCGCGCATGC CGCTCTGGCC
GGGCGGGGGG CCCGTGTCCG CATCGCCGGC GGCGAGGCGG CGCACAACTT CCATATGGCG
CAGCATCTGA TGGATTACGG CCGTATCGGC TTCATCCAGA TCGACTGCGG CCGCATCGGA
GGGCTCGGTC CGGCGAAGCG GGTGGCCGAT GCGGCGCAGG CGCGCGGCAT CACCTATGTC
AACCATACCT TCACCTCGCA TCTGGCGCTC TCCGCCTCGT TGCAGCCCTT TGCCGGGCTC
GAGGCCGACC GCATCTGCGA ATATCCCGCG GCTCCGCAGC AACTGGCGCT CGATATCACC
GGCGATCACA TCCGGCCCGA TGGCGAGGGG TTGATCCGGG CTCCGGAAGC ACCGGGGCTG
GGGCTGCAGG TGGCTGCGTC CGCGCTGCGC CGCTACCTGG TCGAGACCGA GATCCGCATC
GGCGGACAGC TGATCTACCG CACGCCGCAG CTTTGA
 
Protein sequence
MKIDAVDLFY LSMPEVTDAA DGSQDALLVR VAAGGHIGWG ECEAAPLPSI AAFVCPKSHG 
VCRPVSDSVL GQRLDGPDDI ARIAALVGYN SMDLLQAPHM LSGIEMALWD LLGRRLSAPA
WALLGYSANH GKRPYASLLF GDTPQETLER ARAARRDGFA AVKFGWGPIG RGTVAADADQ
IMAAREGLGP DGDLMVDVGQ IFGEDVEAAA ARLPTLDAAG VLWLEEPFDA GALAAHAALA
GRGARVRIAG GEAAHNFHMA QHLMDYGRIG FIQIDCGRIG GLGPAKRVAD AAQARGITYV
NHTFTSHLAL SASLQPFAGL EADRICEYPA APQQLALDIT GDHIRPDGEG LIRAPEAPGL
GLQVAASALR RYLVETEIRI GGQLIYRTPQ L