Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1415 |
Symbol | |
ID | 6142784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1398981 |
End bp | 1400057 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641616293 |
Product | zinc-binding dehydrogenase family oxidoreductase |
Protein accession | YP_001743473 |
Protein GI | 170684023 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.297309 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCAC TGGCTCGGTT TGGCAAAGCC TTTGGCGGCT ACAAGATGAT CGATGTGCCA CAACCCATTT GTGGCCCGGA AGATGTCGTG ATTGAAATTA AAGCCGCGGC AATCTGCGGC GCAGACATGA AGCACTACAA TGTCGATAGC GGTTCTGATG AGTTTAACTC TATCCGCGGC CATGAGTTCG CAGGTTGTAT TGCGCAGGTT GGTGAAAAAG TCAAAGACTG GAAAGTGGGG CAACGCGTCG TGTCAGATAA CAGCGGTCAC GTCTGCGGCG TTTGTCCGGC CTGTGAACAA GGTGATTTTC TGTGTTGTAC AGAAAAAGTA AACCTTGGTC TGGATAACAA TACCTGGGGC GGTGGTTTTT CCAAATATTG TCTGGTTCCT GGTGAAATTC TCAAAATTCA TCGTCATGCG TTGTGGGAAA TCCCTGATGG TGTTGATTAT GAGGACGCAG CCGTACTTGA CCCTATCTGC AATGCCTACA AATCCATCGC GCAGCAATCG AAATTCCTCC CTGGTCAGGA TGTAGTCGTC ATCGGCACTG GCCCACTCGG ACTGTTCTCC GTACAAATGG CGCGGATTAT GGGGGCGGTA AATATCGTCG TCGTTGGTCT ACAAGAAGAT GTGGCGGTCC GCTTCCCGGT TGCAAAAGAA CTGGGTGCGA CGGCAGTAGT AAATGGTTCT ACCGAAGATG TGGTGGCACG CTGCCAGCAA ATTTGTGGCA AAGACAATCT GGGGCTGGTG ATTGAATGCT CCGGTGCCAA TATCGCATTG AAACAAGCCA TCGAAATGCT CCGCCCGAAT GGGGAAGTGG TACGCGTTGG AATGGGCTTC AAACCTCTTG ATTTCTCGAT TAATGACATT ACCGCCTGGA ACAAAAGCAT CATTGGGCAT ATGGCCTATG ACTCCACCTC ATGGCGTAAC GCTATCAGGC TATTAGCCAG CGGCGCTATC AAAGTCAAAC CGATGATCAC GCATCGTATC GGCCTGTCGC AATGGCGCGA AGGGTTTGAT GCGATGGTCG ATAAAACCGC AATCAAAGTG ATCATGACTT ACGACTTTGA TGAATAA
|
Protein sequence | MKALARFGKA FGGYKMIDVP QPICGPEDVV IEIKAAAICG ADMKHYNVDS GSDEFNSIRG HEFAGCIAQV GEKVKDWKVG QRVVSDNSGH VCGVCPACEQ GDFLCCTEKV NLGLDNNTWG GGFSKYCLVP GEILKIHRHA LWEIPDGVDY EDAAVLDPIC NAYKSIAQQS KFLPGQDVVV IGTGPLGLFS VQMARIMGAV NIVVVGLQED VAVRFPVAKE LGATAVVNGS TEDVVARCQQ ICGKDNLGLV IECSGANIAL KQAIEMLRPN GEVVRVGMGF KPLDFSINDI TAWNKSIIGH MAYDSTSWRN AIRLLASGAI KVKPMITHRI GLSQWREGFD AMVDKTAIKV IMTYDFDE
|
| |