Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2026 |
Symbol | |
ID | 5733915 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2518581 |
End bp | 2519606 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279170 |
Product | mandelate racemase/muconate lactonizing protein |
Protein accession | YP_001544797 |
Protein GI | 159898550 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTTAA CCACAACAGT GCTCAGCCTG CAACTTGAGC AACCGTTTGT CAGCAATAAG GGCTCGACGA CTACGGTGCA CCAAGTTGTA ATCAAATTAA CTTGGCAGGA GTATGGTGGT TTTGGTACGG TACTTTGCCC CAAAGAAACC CAACTTAGTG TTGAGCAAAT TCAACAGCTG ATTCAGGCTT GTGAACCATT GCTTAGTACC GCCACACCAT GGCAATTTGA ACTGTATCAA GGTCAATTAG CCTCAGTCGT TCGCAATCAG GCTGCAATGA TGGCTGGCAT CGATATGGCA TGGCATGATC TTTTGGGCAA GGTGGTTGCC CAACCCATCC ACGCGCTTTG GGGCTTGGCA GGGTTGAGCA TCCCACCAAC GGCACTCTCG CTTGGCGCAC AATCGGAGCA GGCCTTGGTC GCACAGGCTG CAAAATTGGC GGCATGGCCA ATTCTTAAAC TCAAACTCAC AACCGATAGC AATCTCGATA GCCTGCGCCA ACTACGCGAG GTCTATGCTG GGCGGATTTG GGTTGATGGC AATGGAGCAT GGGATGTTGA TCAAGCGATT GCTGCGGCGC AACAATGCCA TACCTATGGG GTTGAACTGA TCGAACAGCC AATTCCAGCG GGCAACCTCG ACCAACTGCG CACAATTCGC CAACACTCAC CAATTCCCAT AGTTGCCGAT GAAGATTGTC GTGGGCTTGC TGATGTGCTG CGCTTGCATA CATGTGTTGA TGTAATTAAT CTCAAACTCT TCAAATGTGG AGGCTTACGC CAAGCTCGCA CGATGATCGA CGTGGCCAAG CAATTTGGCT TAAAAGTTAT GTTGGGTTGT AAAACTGAAA GCAGCCTTGG AATTAGCGCC ATCGCCCAAC TTGCCGGGCT AGCAGATTAC CTTGATCTTG ATGGGCATCT TGATTTGGTC AATGACCCCT TTCAAGGCCT TGTGATCGAG CAAGGTACGC TGCGTTTACC GCAAACTCCA GGTTTAGGAT TAACCATTCA AGGAGCAATC GAATGA
|
Protein sequence | MNLTTTVLSL QLEQPFVSNK GSTTTVHQVV IKLTWQEYGG FGTVLCPKET QLSVEQIQQL IQACEPLLST ATPWQFELYQ GQLASVVRNQ AAMMAGIDMA WHDLLGKVVA QPIHALWGLA GLSIPPTALS LGAQSEQALV AQAAKLAAWP ILKLKLTTDS NLDSLRQLRE VYAGRIWVDG NGAWDVDQAI AAAQQCHTYG VELIEQPIPA GNLDQLRTIR QHSPIPIVAD EDCRGLADVL RLHTCVDVIN LKLFKCGGLR QARTMIDVAK QFGLKVMLGC KTESSLGISA IAQLAGLADY LDLDGHLDLV NDPFQGLVIE QGTLRLPQTP GLGLTIQGAI E
|
| |