Gene Hoch_3943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_3943 
Symbol 
ID8546339 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp5437145 
End bp5438155 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content77% 
IMG OID646388615 
ProductAlcohol dehydrogenase GroES domain protein 
Protein accessionYP_003268335 
Protein GI262197126 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0142481 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00944074 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTGGCCC TGTGGCTCGA AGACCGCGCG CTGCGGCTGC GCGATGACCT GCCGGTGCCC 
GCGCCCGGCC CCGGCGAGGC GCGCGTGCGC GTGCTGCGCG CCGGCATCTG CGCCACGGAT
GTCGAGCTTG TGCGCGGCTA CTATCCGTTC ACCGGCGTGC CCGGCCACGA GTTCGTCGGC
GTGGTCGACG CGCTCGGCCC CGCGACCGAC GACGGTGCAG CGGACCCGGG CGACGACGAC
GACGACGACG GGGGCTGGCT TGGGCGCCGC GTGGTCGGCG AGATCAACGT GGTCTGCGGC
GCCTGCGCGC AGTGCCGGGC GGGCCGTCGC ACGCACTGCA CGCGCCGGCA AGCGCTCGGC
ATCCACGGCC GCCACGGCGC CTTCGCCGAG TACCTGTGCC TGCCGCTCGC CAACCTGCTC
GCGGTCCCCG ACGAGCTGAG CAGCGACGCG GCCGCGTTCA CCGAGCCGCT GGCGGCCGCG
CTCGAGCTCC AGGAGCAGGT CGCGCTGCGC CCCGGCGCGC GCGTGCTCGT GGTCGGCGCC
GGCAAGCTGG GCCAGCTCGT GGCCCAGAGT CTGGCGCTGA GCACGGCCCA GGTGCGCGTG
GTGTGCCGCT CGGCGCAGCG CCGCGCACCG CTCCACGCGC GCGGCATCGC CACCTGCGCG
CCCGACGAGG TCCGCGCCGG CTGCGCCGAC CTGGCCGTGG AGTGCAGCGG CCACCCCGAC
GGCTTCGCGC TGGCCCGGCG CGCGCTGCGG GCGCGCGGCA CCCTGGTGCT CAAGAGCACC
TACGCGGGCG CTCTGACCAT AGACGCCTCG TCGCTCGTGG TCGACGAGCT GACCGTGGTG
GGCTCGCGCT GCGGTCCCTT CGCGCCGGCC CTGCGCCTGC TCGCCAGCGG CCGCATCGAC
CCGATGCCGC TGGTGAGCGC GCGCTTCCCG CTGCGCGAGG CGCTGGCCGC CTTCGACGCG
GCGCGCGCGC CCGGCGCCTT CAAAGTCCTG CTGGCCGCCG ACGCCGCCTG A
 
Protein sequence
MLALWLEDRA LRLRDDLPVP APGPGEARVR VLRAGICATD VELVRGYYPF TGVPGHEFVG 
VVDALGPATD DGAADPGDDD DDDGGWLGRR VVGEINVVCG ACAQCRAGRR THCTRRQALG
IHGRHGAFAE YLCLPLANLL AVPDELSSDA AAFTEPLAAA LELQEQVALR PGARVLVVGA
GKLGQLVAQS LALSTAQVRV VCRSAQRRAP LHARGIATCA PDEVRAGCAD LAVECSGHPD
GFALARRALR ARGTLVLKST YAGALTIDAS SLVVDELTVV GSRCGPFAPA LRLLASGRID
PMPLVSARFP LREALAAFDA ARAPGAFKVL LAADAA