Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3855 |
Symbol | |
ID | 5541359 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 5038548 |
End bp | 5039660 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640895965 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_001433910 |
Protein GI | 156743781 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.552694 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.795222 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAAATCA CCGATGTCGA AGCCATCTGG TTGCAGCTCC CAACCGTGGA TGATCGCGCC GACGGCACCC AGGATACGCT GGTGGTCAAA GTCCACACCG ACGAAGGGAT CGTCGGAGTT GGCGAGGTCG ATTCGTCGCC AATGGCAGCG AAAGCGATCA TCGAGGCGCC GCTGTCTCAC CGAATCGCAC GCGGTTTACG GCTCTGCGTG ATCGGCGAGG ATCCGCTCGA TATTGCGCGC CTGTGGCACG CCATGTACGA GGGATCGATC TTCTTCGGGC GCGGCGGCGC GGCGCAGCAG GCCATCAGCG GCATCGACAT CGCCCTGTGG GACATTGCCG GCAAGGCGCT GGGGCAGCCG GTCTACCGAC TGCTCGGCGG CGGCTTCCGC AATCGTCTGC GCGCTTATGC GTCGATTCTG TTTCAGGAGA CGCCTGAAGC AACGTATGAC CTTGCCCGCC GCCTCGCCGA CCAGGGATTC ACAGCGGTCA AGTTCGGGTG GGGACCGATG GGAACAAGCG AGAAGACGGA TCTTGCACTG GTGCGGATGG CGCGGCGCGG GCTGGGGGAC CACCTCGACC TGATGATCGA CGCGGGAATC TGCTACGACA CGGCCACGGC AATCCGGCGT GCGCATCAGT TCGCGGAATA CAACCCGTTC TGGCTTGAAG AGCCGCTGCA CCCCGATAAT CTCGAAGGGT ACTCGCGCCT GGCTGCGGTA TCGCCTATCC GCATCGCTGC CGGAGAACAG GAGACCACCC TGGTCGGATT TCAGGCGCTG CTGGACGCCG GTCTGGCAGT GGTGCAGCCG GACGTGGCGC GGGTCGGAGG GATATCGCAG GCAATTCAGA TTGGTCGGAT GGCGATGCAG CGCCACCGTT TGTGCGTCAA TCACTCGTAC AAGACCGGCA TCAGCATCGC CGCTTCGTTG CACTTCCTGG CGGCGCTGCC GAACGCTCCG CTCCTGGAGT ATTGCGTCGA GCAGAGTCCG TTGCGGCAGA CGCTTACCCG CGAGACGTTT CCGGTGGTGG ATGGATGGGT CGCCGTGCCG CAAGAACCAG GGTTGGGAAT CACGCTCGAC GAAGAGGTGA TCGCCCGCTA CCGCGTGGCG TGA
|
Protein sequence | MKITDVEAIW LQLPTVDDRA DGTQDTLVVK VHTDEGIVGV GEVDSSPMAA KAIIEAPLSH RIARGLRLCV IGEDPLDIAR LWHAMYEGSI FFGRGGAAQQ AISGIDIALW DIAGKALGQP VYRLLGGGFR NRLRAYASIL FQETPEATYD LARRLADQGF TAVKFGWGPM GTSEKTDLAL VRMARRGLGD HLDLMIDAGI CYDTATAIRR AHQFAEYNPF WLEEPLHPDN LEGYSRLAAV SPIRIAAGEQ ETTLVGFQAL LDAGLAVVQP DVARVGGISQ AIQIGRMAMQ RHRLCVNHSY KTGISIAASL HFLAALPNAP LLEYCVEQSP LRQTLTRETF PVVDGWVAVP QEPGLGITLD EEVIARYRVA
|
| |