Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3591 |
Symbol | |
ID | 5735452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4520659 |
End bp | 4521696 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280740 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_001546355 |
Protein GI | 159900108 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAATGGG CGATTTATCC GGTTGAACTA CGCTTGCGCA ACACGTTTCG GATTGCCCAC GGAGCTAGCA ATACCCGCCA TAATGTGTTG TTGAATTTAG ATGATGGCTG GGGCGAGGCC GCTGCTGTAG CCTACCACGG CGAAACTGCT GCTAAAATTC AGGCTTGGCT GGAACGCTAT CGCGAAACAA TTACCAGCAG CTATGATCCA GCGGCAATTC ATTGGCTCTT GGCCAAGCTC GATTTTGAGA GCCGCGCCGC CCGCGCTGCT GTCGATTTAG CCTTGCATGA TCGCTTAGGC CAACAACTTG GTGTGTCATT GCGTCATCTT TTGGGCTTGA ACGGCTTAGA ACTCCCACAA ACCTCAGTTA CCTTGCCAAT TGAAGAGCCT GAAGCTTTGC GCCAACAAGC TTTGGCCGTA GCGCATTATC CAATTCTCAA GGTGAAGTTG GGCGGCCCTG CCGATTTAGC CAGCGTGGTT TTGATTCGCG AAGCCGCACC CAACAGCCGC CTGCGGGTTG ATGCCAATGC TGGTTGGAGC CGTGAAACCG CCGCCCAATT GATTCCGGCG CTGGCTGAAT TGGGCGTGGA GTTGATTGAG CAACCGTTGG CAGTTGATGA TTTAGCGGGC TATGCTCAAC TTAAAGCTGC CAACTATGGC GTGCCAATTT TTGCCGACGA GCCGATTAAA ACTGCGGCTG ATGTGGCGCG TTGGGCCAAG GTGGTTGATG GCGTAAACCT CAAGTTGATG AAAACTGGCG GAATTGTCGG GGCGTGCGCA GCGATTGCCA CCGCCAGAGC CCATGATTTA CAAGTGATGC TTGGGTGTAT GATTGAAAGT AGCATCGGGG TTTCAGCGGC CTGTGCTTTG GCTGGCTTGG CCGATTTCGT TGACCTTGAT GGGCCATTAT TGATCGCCAA CGATCTGGCA ACTGGATTGA ACTTTGCAAC CGCTACGATT CAGCCAGCAG CAACGCCAGG TTTAGGCGTG CAAATTGACT GGACAGCACT AAACAGCGCC CGACTTGAAA CACGCTAG
|
Protein sequence | MEWAIYPVEL RLRNTFRIAH GASNTRHNVL LNLDDGWGEA AAVAYHGETA AKIQAWLERY RETITSSYDP AAIHWLLAKL DFESRAARAA VDLALHDRLG QQLGVSLRHL LGLNGLELPQ TSVTLPIEEP EALRQQALAV AHYPILKVKL GGPADLASVV LIREAAPNSR LRVDANAGWS RETAAQLIPA LAELGVELIE QPLAVDDLAG YAQLKAANYG VPIFADEPIK TAADVARWAK VVDGVNLKLM KTGGIVGACA AIATARAHDL QVMLGCMIES SIGVSAACAL AGLADFVDLD GPLLIANDLA TGLNFATATI QPAATPGLGV QIDWTALNSA RLETR
|
| |