Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3697 |
Symbol | |
ID | 6065936 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 4046007 |
End bp | 4047029 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641603115 |
Product | alcohol dehydrogenase |
Protein accession | YP_001726635 |
Protein GI | 170021681 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00587402 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCTACGA TGAATGTTTT AATTTGCCAG CAGCCGAAAG AATTAGTCTG GAAACAACGC GAGATTCCTA TTCCGGGTGA CAATGAAGCA TTAATAAAAA TTAAGTCTGT CGGGATTTGC GGTACCGATA TTCATGCCTG GGGTGGAAAT CAACCATTTT TTAATTATCC ACGGGTTTTA GGCCATGAAA TATGTGGGGA GGTTGTTGGG CTGGGTAAAA ATATTGCTGA TCTTAAAAAT GGTCAGCAAG TTGCTGTGAT CCCTTATGTT GCCTGTCAGC AATGCCCGGC GTGTAAAAGC GGGCGTACCA ATTGCTGTGA AAAAATTTCA GTCATTGGCG TGCATCAGGA TGGCGGTTTT AGTGAGTATT TGAGCGTGCC GGTGGCGAAC ATTTTGCCCG CAGACGGTAT TGACCCGCAG GCGGCCGCAT TGATTGAACC TTTCGCTATT AGCGCTCATG CGGTGCGTCG CGCAGCCATT GCTCCCGGCG AGCAGGTGCT GGTGGTCGGA GCGGGGCCAA TCGGTCTGGG CGCGGCGGCA ATCGCTAAAG CCGATGGCGC ACAGGTGGTG GTGGCGGATA CCAGTCCGGC GCGCCGTGAA CATGTGGCAA CGCGTCTGGA GTTGCCCGTG CTGGACCCGT CAGCCGAAGA TTTTGACGCG CAGCTGCGGG CGCAGTTTGG TGGTTCGCTG GCGCAGAAAG TGATCGACGC GACAGGTAAT CAACATGCGA TGAATAACAC CGTGAATCTG ATTCGTCACG GCGGCACGGT GGTATTTGTC GGCCTGTTTA AAGGTGAGTT GCAGTTCTCC GATCCGGAAT TCCATAAAAA AGAAACGACG ATGATGGGCA GCCGCAACGC CACGCCGGAA GATTTCGCTA AAGTCGGTCG ACTTATGGCG GAAGGGAAAA TCACTGCCGA CATGATGTTA ACCCATCGCT ATCCGTTCGC CACGCTGGCA GAAACCTACG AGCGCGATGT GATTAACAAT CGTGAGTTAA TTAAAGGCGT AATTACTTTC TGA
|
Protein sequence | MSTMNVLICQ QPKELVWKQR EIPIPGDNEA LIKIKSVGIC GTDIHAWGGN QPFFNYPRVL GHEICGEVVG LGKNIADLKN GQQVAVIPYV ACQQCPACKS GRTNCCEKIS VIGVHQDGGF SEYLSVPVAN ILPADGIDPQ AAALIEPFAI SAHAVRRAAI APGEQVLVVG AGPIGLGAAA IAKADGAQVV VADTSPARRE HVATRLELPV LDPSAEDFDA QLRAQFGGSL AQKVIDATGN QHAMNNTVNL IRHGGTVVFV GLFKGELQFS DPEFHKKETT MMGSRNATPE DFAKVGRLMA EGKITADMML THRYPFATLA ETYERDVINN RELIKGVITF
|
| |