Gene Haur_3591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3591 
Symbol 
ID5735452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4520659 
End bp4521696 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content53% 
IMG OID641280740 
Productmandelate racemase/muconate lactonizing protein 
Protein accessionYP_001546355 
Protein GI159900108 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAATGGG CGATTTATCC GGTTGAACTA CGCTTGCGCA ACACGTTTCG GATTGCCCAC 
GGAGCTAGCA ATACCCGCCA TAATGTGTTG TTGAATTTAG ATGATGGCTG GGGCGAGGCC
GCTGCTGTAG CCTACCACGG CGAAACTGCT GCTAAAATTC AGGCTTGGCT GGAACGCTAT
CGCGAAACAA TTACCAGCAG CTATGATCCA GCGGCAATTC ATTGGCTCTT GGCCAAGCTC
GATTTTGAGA GCCGCGCCGC CCGCGCTGCT GTCGATTTAG CCTTGCATGA TCGCTTAGGC
CAACAACTTG GTGTGTCATT GCGTCATCTT TTGGGCTTGA ACGGCTTAGA ACTCCCACAA
ACCTCAGTTA CCTTGCCAAT TGAAGAGCCT GAAGCTTTGC GCCAACAAGC TTTGGCCGTA
GCGCATTATC CAATTCTCAA GGTGAAGTTG GGCGGCCCTG CCGATTTAGC CAGCGTGGTT
TTGATTCGCG AAGCCGCACC CAACAGCCGC CTGCGGGTTG ATGCCAATGC TGGTTGGAGC
CGTGAAACCG CCGCCCAATT GATTCCGGCG CTGGCTGAAT TGGGCGTGGA GTTGATTGAG
CAACCGTTGG CAGTTGATGA TTTAGCGGGC TATGCTCAAC TTAAAGCTGC CAACTATGGC
GTGCCAATTT TTGCCGACGA GCCGATTAAA ACTGCGGCTG ATGTGGCGCG TTGGGCCAAG
GTGGTTGATG GCGTAAACCT CAAGTTGATG AAAACTGGCG GAATTGTCGG GGCGTGCGCA
GCGATTGCCA CCGCCAGAGC CCATGATTTA CAAGTGATGC TTGGGTGTAT GATTGAAAGT
AGCATCGGGG TTTCAGCGGC CTGTGCTTTG GCTGGCTTGG CCGATTTCGT TGACCTTGAT
GGGCCATTAT TGATCGCCAA CGATCTGGCA ACTGGATTGA ACTTTGCAAC CGCTACGATT
CAGCCAGCAG CAACGCCAGG TTTAGGCGTG CAAATTGACT GGACAGCACT AAACAGCGCC
CGACTTGAAA CACGCTAG
 
Protein sequence
MEWAIYPVEL RLRNTFRIAH GASNTRHNVL LNLDDGWGEA AAVAYHGETA AKIQAWLERY 
RETITSSYDP AAIHWLLAKL DFESRAARAA VDLALHDRLG QQLGVSLRHL LGLNGLELPQ
TSVTLPIEEP EALRQQALAV AHYPILKVKL GGPADLASVV LIREAAPNSR LRVDANAGWS
RETAAQLIPA LAELGVELIE QPLAVDDLAG YAQLKAANYG VPIFADEPIK TAADVARWAK
VVDGVNLKLM KTGGIVGACA AIATARAHDL QVMLGCMIES SIGVSAACAL AGLADFVDLD
GPLLIANDLA TGLNFATATI QPAATPGLGV QIDWTALNSA RLETR