Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4903 |
Symbol | |
ID | 6142776 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 5024323 |
End bp | 5025345 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641619706 |
Product | zinc-binding dehydrogenase family oxidoreductase |
Protein accession | YP_001746813 |
Protein GI | 170683258 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.363717 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTACGA TGAATGTTTT AATTTGCCAG CAGCCGAAAG AATTAGTCTG GAAACAACGC GAGATACCTA TTCCGGGTGA CAATGAAGCA TTAATAAAAA TTAAGTCTGT CGGGATTTGC GGTACCGATA TTCATGCCTG GGGTGGAAAT CAACCATTTT TTAGTTATCC ACGTGTTTTA GGCCATGAAA TATGTGGGGA GATTGTTGGG CTGGGTAAAA ATATTGCTGA TCTTAAAAAT GGTCAGCAAG TTGCTGTGAT CCCTTATGTT GCCTGTCAGC AATGCCCTGC ATGTAAAAGC GGTCGTACCA ATTGCTGTGA AAAAATTTCA GTTATTGGTG TGCATCAGGA TGGTGGTTTT AGTGAGTATT TGAGCGTGCC GGTGGCGAAC ATTTTGCCCG CAGACGGTAT TGACCCGCAG GCGGCAGCAT TGATTGAACC TTTCGCTATT AGCGCACATG CGGTGCGTCG TGCAGCCATT GCTCCCGGCG AGCAGGTGCT GGTGGTCGGG GCGGGGCCAA TCGGTCTGGG CGCGGCGGCA ATCGCTAAAG CCGATGGCGC ACAGGTGGTA GTGGCGGATA CCAGTCCGGC GCGCCGTGAA CATGTGGCAA CGCGTCTGGA ATTACCTGTA CTGGACCCGT CAGCCGAAGA TTTTGACGCG CAGCTGCGGG CGCAGTTTGG TGGTTCGCTG GCGCAGAAAG TGATCGACGC GACAGGTAAT CAACATGCGA TGAATAACAC CGTAAATCTG ATTCGTCACG GCGGCACGGT GGTATTTGTC GGTCTGTTTA AAGGTGAGTT GCAGTTCTCC GATCCGGAAT TCCATAAAAA AGAAACGACG ATGATGGGCA GCCGCAACGC CACGCCGGAA GATTTCGCTA AAGTCGGTCG ATTGATGGCG GAAGGGAAAA TCACTGCCGA CATGATGTTA ACCCATCGCT ACCCGTTCGC CACGCTGGCA GAAACCTACG AACGTGATGT GATTAACAAT CGTGAGTTGA TTAAGGGTGT AATTACTTTC TGA
|
Protein sequence | MSTMNVLICQ QPKELVWKQR EIPIPGDNEA LIKIKSVGIC GTDIHAWGGN QPFFSYPRVL GHEICGEIVG LGKNIADLKN GQQVAVIPYV ACQQCPACKS GRTNCCEKIS VIGVHQDGGF SEYLSVPVAN ILPADGIDPQ AAALIEPFAI SAHAVRRAAI APGEQVLVVG AGPIGLGAAA IAKADGAQVV VADTSPARRE HVATRLELPV LDPSAEDFDA QLRAQFGGSL AQKVIDATGN QHAMNNTVNL IRHGGTVVFV GLFKGELQFS DPEFHKKETT MMGSRNATPE DFAKVGRLMA EGKITADMML THRYPFATLA ETYERDVINN RELIKGVITF
|
| |