Gene Emin_0146 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0146 
Symbol 
ID6263944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp154763 
End bp155800 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content45% 
IMG OID642610610 
Productalcohol dehydrogenase 
Protein accessionYP_001875048 
Protein GI187250566 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR00692] L-threonine 3-dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGC TTTGCAAAAC AAAACCTGAA AAAGGCGTTG AATATAAAGA CGTGGACCTG 
CCTAAGATAA AAGATGACGA GCTTTTAGTT AAAATTTATA AAACTTCAAT TTGCGGCAGC
GATATACCGG TTTATAATTA TACCGGCTGG GCGCCTAGAA GAATCCCGCT TCCGTTCGTT
TTCGGGCACG AGCTTTGCGG CGAAGTTGTT GAAACAGGCA AAAACACAAA AGGCTTTGAG
AAAGGCGATT TTATATCCGT TGAATCACAC GTCTTTTGCG GGCTTTGCTA CCAATGCCGC
AACGACCAAA GGCATGTTTG CTCCAACATG GTTGTTTTGG GGCTGGATAC ACAGGGCGGA
TTTTCAGAAT ACGCTGCCAT ACCTGCGCGC TGCGGGTGGA AACATTCTGA CAATAAATTA
AAGGAAATAG CTTCTATTAT GGAGCCTTTG GGTAACGCTG TTTTCGCCAC ATTGGTTGAA
GACGTTGCGG GAAAAACGGT TTTTGTTGAA GGTTGCGGTC CGCAGGGTCT TTTTGCCATT
GAAATAGCCA AAGCCTGTGG CGCGCAAAAA GTGATTGCTT TGGAAGGCTC TCCTTACCGC
CAAAAAATGG CTGAACAAAT GGGAGCGGAC GCTATTTTCA GCCCTACGGA AGAAAAGTTA
CTTGAAAAAA TTAAAAAAGC CTCGGGCGAC CCAAGCGGCG TTGACGTTGT TCTTGAAATG
TCAGGCCATC CGGACGCTGT AAGGCTTGGC CTAAAAGCGG TTAAGCCCGC GGGCAGATTT
ACGGCTTTCG GCCTTCCCGG CTCGGAAATG ACGCTTGACT ATTCCAACGA TATTGTTTTT
AAAGGTATTA AAGTTGAAGG TATTACAGGC AGACAAATTT ATAAAAGCTG GCATGTTATG
GAGGGGCTTT TGCGTTCTGG CAAAATAAAC CCCGCTCCCA TCATTACCCA TACTTTTGAG
ATGAAAGATT ATGAAAAAGC CTTTGCCACT ATGATGGACC CCGAAAGAAA ATGCGGCAAA
GTGGTTTTAA TCCCGTAA
 
Protein sequence
MKALCKTKPE KGVEYKDVDL PKIKDDELLV KIYKTSICGS DIPVYNYTGW APRRIPLPFV 
FGHELCGEVV ETGKNTKGFE KGDFISVESH VFCGLCYQCR NDQRHVCSNM VVLGLDTQGG
FSEYAAIPAR CGWKHSDNKL KEIASIMEPL GNAVFATLVE DVAGKTVFVE GCGPQGLFAI
EIAKACGAQK VIALEGSPYR QKMAEQMGAD AIFSPTEEKL LEKIKKASGD PSGVDVVLEM
SGHPDAVRLG LKAVKPAGRF TAFGLPGSEM TLDYSNDIVF KGIKVEGITG RQIYKSWHVM
EGLLRSGKIN PAPIITHTFE MKDYEKAFAT MMDPERKCGK VVLIP