Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3303 |
Symbol | |
ID | 6142919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3379032 |
End bp | 3380048 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641618133 |
Product | zinc-binding dehydrogenase family protein |
Protein accession | YP_001745283 |
Protein GI | 170683214 |
COG category | [E] Amino acid transport and metabolism [R] General function prediction only |
COG ID | [COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases |
TIGRFAM ID | [TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.0731443 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAT TAATTTGTCA GCAGCCTGGC GTTATGGAAT ATGTGGAAAA GGATATTCCC ACACCAGCAG ATAATGAAGT GCTGTTAAAA ATCAAAGCTG TGGGTATTTG TGGTACTGAT ATTCACGCTT TTGCCGGCAG ACAGCCTTTT TTTAGCTACC CACGTGTATT AGGTCATGAA ATATGCGCCG AAGCGGTTTC GCGAGGCAGC CAGTGCCAAA CAGCACAATC AGGCCAGCGC TATTCCGTCA TCCCATGCAT TCCGTGTGGC GAGTGCGCAG CCTGTCGGGA AGAGAAAACG AACTGCTGCG AACGTGTTTC GCTGTATGGC GTGCATCAGG ATGGGGGTTT TAGTGAGTAC CTTGCGGTAC GTGAAGACAA CCTTGTGCCT CTCCCTGACG AGGTCAGCGA CAGTGCCGGA GCATTGGTTG AATGTTTCGC CATTGGTGCA CATGCCGTTC GTCGGGCAGA GATCAAGGCT GAACAAAACG TACTGGTGAT TGGTGCTGGG CCAATCGGTT TGGCTACCGC AGCCATCGCC AGGGCTAAAG GGGCGCATGT TGTTGTTGCT GATATTGACT GTCAACGTCG CCAGCACGTT GTGGATCATC TGGCAATTAA TGTCTTCGAC CCAACACAGG AAGGTTTTAT TGCCGCGCTT AGTGAAGTAT TTGGAGGCGA ACTGGCTTGC GTAGTACTGG ATGCGACGGG AAATAAAGCT TCAATGAGTC ATGATGTAAA TCTTATTCGT CATGGCGGCA AAATTGTTTT CATCGGTTTG TACATTGGTG AACTTGTTAT TGACGATCCG ACCTTCCATA AAAAAGAGAC AACGTTACTC AGCAGCCGCA ATGCCACACG GGAAGATTTT GCGTTGGTGA TTGAACTGAT GCGCAGCAAT AAAATTCACG AAAATTTAAT GAAAAACCAG GCGTTCAATT TCTTTAGTGT TGGCGAAGAT TACCAGCGTA ACGTTGTAGA AAATAAAAAT ATGGTCAAGG GTGTGATCAC TTTTTAA
|
Protein sequence | MKTLICQQPG VMEYVEKDIP TPADNEVLLK IKAVGICGTD IHAFAGRQPF FSYPRVLGHE ICAEAVSRGS QCQTAQSGQR YSVIPCIPCG ECAACREEKT NCCERVSLYG VHQDGGFSEY LAVREDNLVP LPDEVSDSAG ALVECFAIGA HAVRRAEIKA EQNVLVIGAG PIGLATAAIA RAKGAHVVVA DIDCQRRQHV VDHLAINVFD PTQEGFIAAL SEVFGGELAC VVLDATGNKA SMSHDVNLIR HGGKIVFIGL YIGELVIDDP TFHKKETTLL SSRNATREDF ALVIELMRSN KIHENLMKNQ AFNFFSVGED YQRNVVENKN MVKGVITF
|
| |