Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rsph17029_4156 |
Symbol | |
ID | 4895027 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodobacter sphaeroides ATCC 17029 |
Kingdom | Bacteria |
Replicon accession | NC_009040 |
Strand | + |
Start bp | 94244 |
End bp | 95419 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640110547 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_001041859 |
Protein GI | 126464883 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 128 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 95 |
Fosmid unclonability p-value | 0.845536 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCG ATGCCGTCGA CCTTTTCTAT CTGTCGATGC CCGAGGTGAC CGATGCCGCC GATGGCAGTC AGGATGCGCT GCTGGTCCGG GTGGCGGCCG GCGGACATAT CGGCTGGGGC GAGTGCGAAG CGGCGCCGCT GCCCTCGATC GCCGCCTTCG TCTGTCCGAA ATCGCACGGT GTCTGCCGTC CGGTCTCGGA CTCCGTGCTC GGTCAGCGGC TGGACGGGCC GGACGATATC GCCCGCATTG CGGCGCTGGT CGGCTATAAC TCGATGGACC TGCTGCAGGC GCCGCATATG CTGTCGGGGA TCGAGATGGC GCTGTGGGAT CTGCTGGGCC GCAGGCTCTC GGCCCCGGCA TGGGCCCTGC TGGGCTACAG CGCCAACCAT GGCAAGAGGC CCTACGCCTC GCTGCTGTTC GGCGACACGC CGCAGGAGAC ACTCGAGCGG GCCCGCGCCG CGCGGCGCGA TGGCTTTGCA GCCGTCAAAT TCGGCTGGGG TCCGATCGGG CGCGGCACTG TGGCGGCGGA CGCCGATCAG ATCATGGCCG CGCGCGAAGG GCTGGGCCCG GACGGGGACC TGATGGTCGA TGTCGGCCAG ATCTTCGGAG AGGATGTCGA GGCGGCCGCT GCGCGCCTGC CGACGCTCGA TGCCGCCGGG GTGCTCTGGC TCGAAGAGCC GTTCGACGCG GGTGCGCTGG CCGCGCATGC CGCTCTGGCC GGGCGGGGGG CCCGTGTCCG CATCGCCGGC GGCGAGGCGG CGCACAACTT CCATATGGCG CAGCATCTGA TGGATTACGG CCGTATCGGC TTCATCCAGA TCGACTGCGG CCGCATCGGA GGGCTCGGTC CGGCGAAGCG GGTGGCCGAT GCGGCGCAGG CGCGCGGCAT CACCTATGTC AACCATACCT TCACCTCGCA TCTGGCGCTC TCCGCCTCGT TGCAGCCCTT TGCCGGGCTC GAGGCCGACC GCATCTGCGA ATATCCCGCG GCTCCGCAGC AACTGGCGCT CGATATCACC GGCGATCACA TCCGGCCCGA TGGCGAGGGG TTGATCCGGG CTCCGGAAGC ACCGGGGCTG GGGCTGCAGG TGGCTGCGTC CGCGCTGCGC CGCTACCTGG TCGAGACCGA GATCCGCATC GGCGGACAGC TGATCTACCG CACGCCGCAG CTTTGA
|
Protein sequence | MKIDAVDLFY LSMPEVTDAA DGSQDALLVR VAAGGHIGWG ECEAAPLPSI AAFVCPKSHG VCRPVSDSVL GQRLDGPDDI ARIAALVGYN SMDLLQAPHM LSGIEMALWD LLGRRLSAPA WALLGYSANH GKRPYASLLF GDTPQETLER ARAARRDGFA AVKFGWGPIG RGTVAADADQ IMAAREGLGP DGDLMVDVGQ IFGEDVEAAA ARLPTLDAAG VLWLEEPFDA GALAAHAALA GRGARVRIAG GEAAHNFHMA QHLMDYGRIG FIQIDCGRIG GLGPAKRVAD AAQARGITYV NHTFTSHLAL SASLQPFAGL EADRICEYPA APQQLALDIT GDHIRPDGEG LIRAPEAPGL GLQVAASALR RYLVETEIRI GGQLIYRTPQ L
|
| |