Gene Hoch_1849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1849 
Symbol 
ID8544231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2546828 
End bp2547823 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content71% 
IMG OID646386555 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_003266290 
Protein GI262195081 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1087] UDP-glucose 4-epimerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.216355 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCGA CTCCATCTCA GCCCCCCTTC GTGGCGGTGA CCGGCGGCGC CGGCTTCATC 
GGTTCGCACA CCGTGGACCG CCTGCTGGCC GCCGGCTGTC GCGTGGTCGT GCTCGACAAC
CTCAGCACCG GCAAGCGCGA GAACCTGGCC CAGCACGCGG GCGAGCCCCG CTTCCACCTG
GTCGAGACCG ACATCGCCGA CGGCCTGTTC GCGCCCCTGG CCGCGCTCAC CGACGAGCAC
GGGCCGGTGC AGCGCATCAT CCACCTGGCG GCGCAGACCT CGGTGGTGCG CTCGGTCGAG
CAGCCGCTGC ACGACATCCG CATCAACTAC GCGGGCACCG CCCAGGTGCT CGAGTACGCG
CGCCATCGCG GCGTGGCCAA GGTGGTGCTG GCGTCATCGG CCGCGGTCTA CGGCGACACC
GAGGAGCTGC CGGTGCGCGA GACCCTGCCC ACGCGCCCGC TGTCGCCCTA CGGCGCCAAC
AAGCTCGGCA GCGAGCAGCT TCTCTACTAC TACTCGGCCG TGCACGGCGT CGGCACCACG
GCGCTGCGCT TCTTCAACGT CTACGGCCCG CGCCAGGACC CCAAGAGCCC GTACTCGGGA
GTGATCTCGA TCTTCGCCGA TCGCGCCATG GCCGGCAAGC CGCTCACCAT CTTCGGCGAC
GGCGAGCAGA CCCGCGATTT CGTCTACGTC GGCGATGTGT CGCGGGCCGT GGCTCAGGCC
TGCCTGGGCG ACGAGGGCGA CCGCGCGATC ATCAACATCG GCACCGGCAG CGAGACCACG
GTCAACGAGC TGGCGCGCAC CATCGTCTCG CTGTGCGGCG AGGCCGCGGG CGCGCCCGAG
GTCGCCATCT CTCATTCGGA CGCCCGTCCG GGCGAGATCG CGCGCTCGGT GGCCGCGGTC
GAGCGCATGC GCGATATTCT GGGCCTGCGC GCCGAGACCG AGCTGGCCGC CGGGCTGCGC
GAGACCCTGG CCTGGATCCG CAGCGCGGAC GCCTGA
 
Protein sequence
MSSTPSQPPF VAVTGGAGFI GSHTVDRLLA AGCRVVVLDN LSTGKRENLA QHAGEPRFHL 
VETDIADGLF APLAALTDEH GPVQRIIHLA AQTSVVRSVE QPLHDIRINY AGTAQVLEYA
RHRGVAKVVL ASSAAVYGDT EELPVRETLP TRPLSPYGAN KLGSEQLLYY YSAVHGVGTT
ALRFFNVYGP RQDPKSPYSG VISIFADRAM AGKPLTIFGD GEQTRDFVYV GDVSRAVAQA
CLGDEGDRAI INIGTGSETT VNELARTIVS LCGEAAGAPE VAISHSDARP GEIARSVAAV
ERMRDILGLR AETELAAGLR ETLAWIRSAD A