Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2698 |
Symbol | |
ID | 6146545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2770034 |
End bp | 2771128 |
Gene Length | 1095 bp |
Protein Length | 364 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617569 |
Product | zinc-binding dehydrogenase family oxidoreductase |
Protein accession | YP_001744734 |
Protein GI | 170681398 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTA ATCGTATTGT TAATGAGGGA TTTATGAAAA CGATGCTGGC AGCTTATTTA CCAGGAAATT CGACCGTCGA TCTGCGGGAA GTTGCGGTGC CGACGCCGGG TATTAACCAG GTACTGATCA AAATGAAATC CTCCGGAATT TGCGGAAGCG ATGTCCACTA TATCTACCAC CAGCACCGCG CTACGGCGGC GGCACCCGAT AAACCGTTAT ACCAGGGATT TATCAACGGT CATGAACCGT GCGGGCAAAT TGTGGCGATG GGGCAAGGCT GCCGCCATTT TAAAGAGGGC GACCGTGTGC TGGTGTATCA CATTTCTGGC TGTGGTTTTT GCCCGAACTG CCGTCGTGGT TTTCCTATCT CTTGTACTGG CGAAGGAAAA GCGGCTTACG GCTGGCAGCG TGACGGCGGC CATGCTGAAT ACCTGCTGGC GGAAGAAAAA GATCTGATCC TCCTGCCGGA TGCACTGAGT TACGAAGATG GTGCGTTTAT CAGTTGCGGC GTTGGTACAG CGTATGAAGG GATTTTGCGC GGCGAAGTTT CCGGCAGTGA TAACGTGCTG GTGGTCGGTC TGGGGCCGGT CGGCATGATG GCGATGATGC TGGCAAAAGG TCGCGGTGCA AAAAGGATCA TCGGCGTTGA TATGTTGCCG GAACGTCTGG CGATGGCGAA ACAGTTAGGA GTGATGGATC ACGGCTATTT AGCGACTACC GAAGGTCTGC CGCAGATAAT CGCCGAACTC ACTCACGGCG GCGCGGATGT TGCGCTTGAT TGTTCCGGTA ATGCCGCAGG TCGCTTACTG GCACTGCAAT CCACCGCTGA CTGGGGACGG GTGGTTTACA TCGGGGAAAC CGGAAAAGTG GAGTTCGAGG TTAGCGCTGA TCTGATGCAT CATCAACGGC GGATTATTGG CTCCTGGGTG ACCAGTCTGT TCCATATGGA AAAATGCGCC CACGATTTAA CGGACTGGAA ACTGTGGCCG CGTAATGCCA TTACCCATCG CTTCTCACTG GAACAGGCAG GAGATGCCTA TGCGCTGATG GCGAGCGGCA AATGCGGGAA AGTTGTGATT AACTTCCCGG ATTAA
|
Protein sequence | MKINRIVNEG FMKTMLAAYL PGNSTVDLRE VAVPTPGINQ VLIKMKSSGI CGSDVHYIYH QHRATAAAPD KPLYQGFING HEPCGQIVAM GQGCRHFKEG DRVLVYHISG CGFCPNCRRG FPISCTGEGK AAYGWQRDGG HAEYLLAEEK DLILLPDALS YEDGAFISCG VGTAYEGILR GEVSGSDNVL VVGLGPVGMM AMMLAKGRGA KRIIGVDMLP ERLAMAKQLG VMDHGYLATT EGLPQIIAEL THGGADVALD CSGNAAGRLL ALQSTADWGR VVYIGETGKV EFEVSADLMH HQRRIIGSWV TSLFHMEKCA HDLTDWKLWP RNAITHRFSL EQAGDAYALM ASGKCGKVVI NFPD
|
| |