Gene Hoch_4063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4063 
Symbol 
ID8546464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5581806 
End bp5583023 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content69% 
IMG OID646388740 
ProductMandelate racemase/muconate lactonizing protein 
Protein accessionYP_003268455 
Protein GI262197246 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0405823 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.115232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGATCG TCAAGGTCGA GACCTTTTTG GCAGATGGTG GCTGGCGGGC CTGGGGCTTC 
GTCAAAATCG AGACCGACGC CGGCATCACC GGCTGGGGCG AGTGCACCTG CGAGTTCTCG
CAGTACGCAG TGCTGGGCGC GGTCGCCGAC CTCACGCCGG TGCTCATCGG CCAGGACCCG
CGCGCCTACG AGATGCGCTT CTGGGACATG TACCGGCTCT CGCGCCTGGG CTCGGTGGGC
GGCGCCGTGG GCAAGGCCAT CGGCGCCATC GAGTGCGCGC TGCTCGACAT CAAGGCGCGC
GCGCTGGGCA TCTCGGTGGC CGAGCTGTTC GGCGGGCCGC TGCGCGAAAC CGTGCCCGTG
TACTGGTCGC ACTTCTGCGT GACCCGCGTG TTCGCGGCCG AGCACTGCCG CGTCGAGCCG
GTGCGCAGCC TCAGCGACGT GGCCGCGTGC GCGCGCGAGG TCGTGGAGCG CGGCTTCCGC
GCGCTCAAGA CCAACATCTT CTTCCCCGGC GACCCGGGCG AGGTCTATCA CCCCGGCTTC
GGCGGCGGCC CCGGCACCAC CGATCAAGTC GCCTGGCCCG AGGTGGTCGG CCAGGCCGAC
GCGCTCTTCG GCACCATCCG CGACGCCGTG GGCCCGGAGG TGGGCGTCAT CCTCGACGTC
AACTTCAACT TCAAGCCCGA GAGCTGCATC CGCCTGGCCA GGGAGCTTTC GCCCTACGAC
CTGCTGTGGA TGGAGCTCGA CATGTACGAT CCCGCGGCCC TGCGCGCGAT CAAAGACGCC
ATCGACATCC CGCTGTGCTC GCTCGAGACC CTGTTCTACG CCGAGCAGTA CCGGCCGTAC
TTCGAGCGCC ACGCGGTCGA CGTGGCCATG CTCGACGTGC CCTGGAACGG CTTCGCCCAG
GCCAAGAAGG TCGGCGACAT GGCCCAGGTG TTCCAGACCA ACGTGTGTCC GCACAACTAC
TACAGCCACC TGGCCTCGTT CATCAGCGCC CAGCTCTGCG CCGTGCTGCC CAACGTGCGC
ATGATGGAGA TCGACCTCGA CGACGTGCCC TGGAAGGACG AGATCGTGTC GCGCGCGCCC
GCGTTCACCG ATGGCGCCAT GCGCGTGCCC GAGGGCCCGG GCTGGGGCAC CGAGATGTTA
GAAGACGAGC TGCGCCGGCA TCCGTGGAGA CCGGACGAGC GTCCGCTGAC CGTGCCCACC
GGCTCGTCCG GACGCTGA
 
Protein sequence
MKIVKVETFL ADGGWRAWGF VKIETDAGIT GWGECTCEFS QYAVLGAVAD LTPVLIGQDP 
RAYEMRFWDM YRLSRLGSVG GAVGKAIGAI ECALLDIKAR ALGISVAELF GGPLRETVPV
YWSHFCVTRV FAAEHCRVEP VRSLSDVAAC AREVVERGFR ALKTNIFFPG DPGEVYHPGF
GGGPGTTDQV AWPEVVGQAD ALFGTIRDAV GPEVGVILDV NFNFKPESCI RLARELSPYD
LLWMELDMYD PAALRAIKDA IDIPLCSLET LFYAEQYRPY FERHAVDVAM LDVPWNGFAQ
AKKVGDMAQV FQTNVCPHNY YSHLASFISA QLCAVLPNVR MMEIDLDDVP WKDEIVSRAP
AFTDGAMRVP EGPGWGTEML EDELRRHPWR PDERPLTVPT GSSGR