Gene Hoch_4066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4066 
Symbol 
ID8546467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5586290 
End bp5587291 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content73% 
IMG OID646388743 
ProductD-isomer specific 2-hydroxyacid dehydrogenase NAD-binding protein 
Protein accessionYP_003268458 
Protein GI262197249 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.175631 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.249827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCA ACCAGCTACA CATCTGCGTC TACCACGCCA CCATGGCCCA GCCCCTGGCC 
GAGCTGGTGG GCGCGCGCCT CACCGGCGCC CGCGTCAGCG CGGTGGACGA CGTCGCCGCG
GACCCGCCCG ATCTCGACAG CATCGACGTG CTCATCGGGT TTCGCTTCCC CCCGGGTCTG
TTGGCGCGCA TGCCGCGGCT GCGCCTGCTG CAGCTCACCT CGGCCGGTCA CGACCAGGTG
GACGACGAAG ACCTGCCGCC CGGGCTCGTG GTCGCGCACG CCGGCTCGAT TCCGGCGCCC
GCGGTGGCCG AGTACGCGCT CATGGGCATG CTGATGTTCG CGCGCAACGG CCACCAGCTC
GTGCGGCAGC ACCTGCAGCA CCTGTGGTCG CGTCCCGGCG CCCGGCTCAT CGTCGGCACC
ACCGTGGTGA TGCTGGGCTT TGGCCGCATC GGCCGCGAGG TCGCCGAGCG CGCCAGCGCG
CTGGGCATGC AGATCATCGC GGTGACGCGC AGCGGCCACG TCCGCGTCCC CGGCGTGCGC
TGCGTGCCGG TCGAAGAGCT GTCCACGGTG CTGCCGTCGG CCGACTTCCT GGTCGTGTGC
GTGCCCGGCA ACCCGGGGAC GCAGGGCCTG GTCGGCCGCG AAGCCTTTTC GTCCATGCGT
CCGGGTTGCT GCCTCATCGA CGTGAGCCGG CCCGGCGTCG TCGACACCGA GGCCCTGGTC
GAAGCCCTGG GCACCGGCCG CTGCGGCGGC GCCATGCTCG ACGTCGTCGC CGGCGAGCCG
CTCGCCGCCG ATCACCCGCT GTGGCGCGAG CCCGGCGTGT GGATCACGCC GCACTGCGCC
TTCGAACAGG ACCGCGAGGT CGAAGCGCTC AGCGCCCTGC TCATCGAAAA CGTGGAACGC
CTGCGCAGCG CCCGCCCGCT GCGCAATGTC GTCGCCCGCG ACGGCGTCCC CTCGCATCCG
CTTGCCTCCA CGCTCACCAC CGCGCCCACG AGCCTCGGCT AG
 
Protein sequence
MSRNQLHICV YHATMAQPLA ELVGARLTGA RVSAVDDVAA DPPDLDSIDV LIGFRFPPGL 
LARMPRLRLL QLTSAGHDQV DDEDLPPGLV VAHAGSIPAP AVAEYALMGM LMFARNGHQL
VRQHLQHLWS RPGARLIVGT TVVMLGFGRI GREVAERASA LGMQIIAVTR SGHVRVPGVR
CVPVEELSTV LPSADFLVVC VPGNPGTQGL VGREAFSSMR PGCCLIDVSR PGVVDTEALV
EALGTGRCGG AMLDVVAGEP LAADHPLWRE PGVWITPHCA FEQDREVEAL SALLIENVER
LRSARPLRNV VARDGVPSHP LASTLTTAPT SLG