Gene Haur_2026 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2026 
Symbol 
ID5733915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2518581 
End bp2519606 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content50% 
IMG OID641279170 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_001544797 
Protein GI159898550 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTTAA CCACAACAGT GCTCAGCCTG CAACTTGAGC AACCGTTTGT CAGCAATAAG 
GGCTCGACGA CTACGGTGCA CCAAGTTGTA ATCAAATTAA CTTGGCAGGA GTATGGTGGT
TTTGGTACGG TACTTTGCCC CAAAGAAACC CAACTTAGTG TTGAGCAAAT TCAACAGCTG
ATTCAGGCTT GTGAACCATT GCTTAGTACC GCCACACCAT GGCAATTTGA ACTGTATCAA
GGTCAATTAG CCTCAGTCGT TCGCAATCAG GCTGCAATGA TGGCTGGCAT CGATATGGCA
TGGCATGATC TTTTGGGCAA GGTGGTTGCC CAACCCATCC ACGCGCTTTG GGGCTTGGCA
GGGTTGAGCA TCCCACCAAC GGCACTCTCG CTTGGCGCAC AATCGGAGCA GGCCTTGGTC
GCACAGGCTG CAAAATTGGC GGCATGGCCA ATTCTTAAAC TCAAACTCAC AACCGATAGC
AATCTCGATA GCCTGCGCCA ACTACGCGAG GTCTATGCTG GGCGGATTTG GGTTGATGGC
AATGGAGCAT GGGATGTTGA TCAAGCGATT GCTGCGGCGC AACAATGCCA TACCTATGGG
GTTGAACTGA TCGAACAGCC AATTCCAGCG GGCAACCTCG ACCAACTGCG CACAATTCGC
CAACACTCAC CAATTCCCAT AGTTGCCGAT GAAGATTGTC GTGGGCTTGC TGATGTGCTG
CGCTTGCATA CATGTGTTGA TGTAATTAAT CTCAAACTCT TCAAATGTGG AGGCTTACGC
CAAGCTCGCA CGATGATCGA CGTGGCCAAG CAATTTGGCT TAAAAGTTAT GTTGGGTTGT
AAAACTGAAA GCAGCCTTGG AATTAGCGCC ATCGCCCAAC TTGCCGGGCT AGCAGATTAC
CTTGATCTTG ATGGGCATCT TGATTTGGTC AATGACCCCT TTCAAGGCCT TGTGATCGAG
CAAGGTACGC TGCGTTTACC GCAAACTCCA GGTTTAGGAT TAACCATTCA AGGAGCAATC
GAATGA
 
Protein sequence
MNLTTTVLSL QLEQPFVSNK GSTTTVHQVV IKLTWQEYGG FGTVLCPKET QLSVEQIQQL 
IQACEPLLST ATPWQFELYQ GQLASVVRNQ AAMMAGIDMA WHDLLGKVVA QPIHALWGLA
GLSIPPTALS LGAQSEQALV AQAAKLAAWP ILKLKLTTDS NLDSLRQLRE VYAGRIWVDG
NGAWDVDQAI AAAQQCHTYG VELIEQPIPA GNLDQLRTIR QHSPIPIVAD EDCRGLADVL
RLHTCVDVIN LKLFKCGGLR QARTMIDVAK QFGLKVMLGC KTESSLGISA IAQLAGLADY
LDLDGHLDLV NDPFQGLVIE QGTLRLPQTP GLGLTIQGAI E