Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1946 |
Symbol | |
ID | 4071422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2339745 |
End bp | 2340800 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637983958 |
Product | zinc-binding alcohol dehydrogenase |
Protein accession | YP_591021 |
Protein GI | 94968973 |
COG category | [R] General function prediction only |
COG ID | [COG1064] Zn-dependent alcohol dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.101054 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTCCTG CAAGAGGATA TGCTGCCCCT TCCGCCACAT CACCGTTGTC TCCTTTCACC TTTGAACGCC GCGATCCCGG CCCTACCGAT GTCGTGATCG AGATCCTCTA CTGCGGCATC TGCCACTCCG ACATTCACAC GGTTCGCAAT GAGTGGAGCT CAGTGATGCA GACGACGTAT CCATGTGTCC CCGGACATGA AATTGTGGGC CGGGTGACCA AGGTCGGCAA GGATGTGAAG AAGTTCAAGG AAGGCGACAG CGTTGGCGTT GGCTGCCTGG TCGATTCCGA CCGGACCTGC CCCAACTGCC AGGCTGACGA AGAGCAATTC TGTCCGCACA GCGTATTCAC TTACAACGGG CCCGACAAAG GCAATCCGGG AAAGCCGACG TATGGTGGCT ACGCGAACAG CATCGTGGTG ACCGAACATT TTGTTCTGAA GATTTCGCCG AAGCTGGATC CTGCGGGCGC TGCACCGCTG CTCTGTGCCG GGATCACGAC CTACTCGCCG CTGCGGTACT GGAAAGTAGG CAAGGGAATG AAAGTCGGTG TCGTGGGTCT TGGCGGCCTC GGGCACATGG GTCTGAAATT CGCTCATGCT TTTGGCGCGC ACGTGGTGCA GTTCACGACA TCGCCGAGCA AGGAAGCCGA GGCGAAGAGA CTTGGTGCCG ACGAAGTGGT GCTATCGAAA GACCCGGATG CAATGAAGAA GCAGATGGGG ACGTTCGATT TCATCCTCGA TACCGTTTCC GCAGAGCACG ACGTTCAGGC GGAGATTGGT CTTCTCAAGC GGAATGGCGT ATTGACACTA GTAGGCGCTC CTCCGAAGCC CATGTCGATC CTGGGCTTCA GCCTGCTCTT CGGACGCAAA ACGCTGGCGG GTTCGCTCAT CGGCGGATTG AAAGAAACGC AGGAGATGCT GGATTTCTGC GCCGAGCATG GCATCGTTAG CGATATCGAG ATGACGCCAA TTCAGAAGGT CAACGAGGCG TATGAGCGCG TATTGAAGAG CGATGTTCGG TATCGGTTTG TGATTGATAT GGCGTCGTTA AAGTAA
|
Protein sequence | MIPARGYAAP SATSPLSPFT FERRDPGPTD VVIEILYCGI CHSDIHTVRN EWSSVMQTTY PCVPGHEIVG RVTKVGKDVK KFKEGDSVGV GCLVDSDRTC PNCQADEEQF CPHSVFTYNG PDKGNPGKPT YGGYANSIVV TEHFVLKISP KLDPAGAAPL LCAGITTYSP LRYWKVGKGM KVGVVGLGGL GHMGLKFAHA FGAHVVQFTT SPSKEAEAKR LGADEVVLSK DPDAMKKQMG TFDFILDTVS AEHDVQAEIG LLKRNGVLTL VGAPPKPMSI LGFSLLFGRK TLAGSLIGGL KETQEMLDFC AEHGIVSDIE MTPIQKVNEA YERVLKSDVR YRFVIDMASL K
|
| |