Gene Acid345_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1946 
Symbol 
ID4071422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2339745 
End bp2340800 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content56% 
IMG OID637983958 
Productzinc-binding alcohol dehydrogenase 
Protein accessionYP_591021 
Protein GI94968973 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.101054 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCCTG CAAGAGGATA TGCTGCCCCT TCCGCCACAT CACCGTTGTC TCCTTTCACC 
TTTGAACGCC GCGATCCCGG CCCTACCGAT GTCGTGATCG AGATCCTCTA CTGCGGCATC
TGCCACTCCG ACATTCACAC GGTTCGCAAT GAGTGGAGCT CAGTGATGCA GACGACGTAT
CCATGTGTCC CCGGACATGA AATTGTGGGC CGGGTGACCA AGGTCGGCAA GGATGTGAAG
AAGTTCAAGG AAGGCGACAG CGTTGGCGTT GGCTGCCTGG TCGATTCCGA CCGGACCTGC
CCCAACTGCC AGGCTGACGA AGAGCAATTC TGTCCGCACA GCGTATTCAC TTACAACGGG
CCCGACAAAG GCAATCCGGG AAAGCCGACG TATGGTGGCT ACGCGAACAG CATCGTGGTG
ACCGAACATT TTGTTCTGAA GATTTCGCCG AAGCTGGATC CTGCGGGCGC TGCACCGCTG
CTCTGTGCCG GGATCACGAC CTACTCGCCG CTGCGGTACT GGAAAGTAGG CAAGGGAATG
AAAGTCGGTG TCGTGGGTCT TGGCGGCCTC GGGCACATGG GTCTGAAATT CGCTCATGCT
TTTGGCGCGC ACGTGGTGCA GTTCACGACA TCGCCGAGCA AGGAAGCCGA GGCGAAGAGA
CTTGGTGCCG ACGAAGTGGT GCTATCGAAA GACCCGGATG CAATGAAGAA GCAGATGGGG
ACGTTCGATT TCATCCTCGA TACCGTTTCC GCAGAGCACG ACGTTCAGGC GGAGATTGGT
CTTCTCAAGC GGAATGGCGT ATTGACACTA GTAGGCGCTC CTCCGAAGCC CATGTCGATC
CTGGGCTTCA GCCTGCTCTT CGGACGCAAA ACGCTGGCGG GTTCGCTCAT CGGCGGATTG
AAAGAAACGC AGGAGATGCT GGATTTCTGC GCCGAGCATG GCATCGTTAG CGATATCGAG
ATGACGCCAA TTCAGAAGGT CAACGAGGCG TATGAGCGCG TATTGAAGAG CGATGTTCGG
TATCGGTTTG TGATTGATAT GGCGTCGTTA AAGTAA
 
Protein sequence
MIPARGYAAP SATSPLSPFT FERRDPGPTD VVIEILYCGI CHSDIHTVRN EWSSVMQTTY 
PCVPGHEIVG RVTKVGKDVK KFKEGDSVGV GCLVDSDRTC PNCQADEEQF CPHSVFTYNG
PDKGNPGKPT YGGYANSIVV TEHFVLKISP KLDPAGAAPL LCAGITTYSP LRYWKVGKGM
KVGVVGLGGL GHMGLKFAHA FGAHVVQFTT SPSKEAEAKR LGADEVVLSK DPDAMKKQMG
TFDFILDTVS AEHDVQAEIG LLKRNGVLTL VGAPPKPMSI LGFSLLFGRK TLAGSLIGGL
KETQEMLDFC AEHGIVSDIE MTPIQKVNEA YERVLKSDVR YRFVIDMASL K