Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1420 |
Symbol | |
ID | 6143972 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1404342 |
End bp | 1405322 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641616298 |
Product | aldo/keto reductase family oxidoreductase |
Protein accession | YP_001743478 |
Protein GI | 170680323 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAGA TACCTTTAGG CACAACGGAT ATTACGCTTT CGCGAATGGG GTTGGGGACA TGGGCCATTG GCGGCGGTCC TGCATGGAAT GGCGATCTCG ATCGGCAAAT ATGTATTGAT ACGATTCTTG AAGCCCATCG TTGTGGCATT AATCTGATTG ATACTGCGCC AGGATATAAC TTTGGCAATA GTGAAGTTAT CGTCGGTCAG GCGTTAAAAA AACTGCCCCG TGAACAGGTT GTAGTAGAAA CCAAATGCGG CATTGTCTGG GAACGAAAAG GGAGTTTATT CAACAAAGTG GGCGATCGGC AGTTGTATAA AAACCTTTCC CCGGAATCTA TCCGCGAAGA GGTAGAAGCC AGCTTGCAAC GTCTGGATAT TGATTACATC GATATCTACA TGACGCACTG GCAGTCGGTG CCGCCATTTT TTACGCCGAT AGCTGAAACT GTCGCAGTGC TCAATGAGTT AAAGGCCGAA GGAAAAATTC GCGCTATAGG CGCTGCTAAC GTCGATGCTG ACCATATCCG CGAGTATCTG CAATATGGTG AACTGGATAT TATTCAGGCG AAATACAGTA TCCTCGACCG GGCATTGGAA AGCGAACTGC TGCCGCTATG TCGTGATAAT GGCATTGTGG TACAGGTTTA TTCCCCGCTA GAGCAGGGAT TGTTGACCGG TACCATCACG CGTGATTACG TTCCGGGCGG CGCTCGGGCA AATAAAGTCT GGTTCCAGCG TGAAAACATG CTGAAAGTGA TTGATATGCT TGAACAGTGG CAGCCACTCT GTGCTCGTTA TCAGTGCACA ATTCCCACTC TGGCACTGGC GTGGATATTA AAACAGAGTG ATTTAATCTC CATTCTTAGT GGGGCTACTG CACCGGAACA GGTACGTGAA AATGTCGCGG CACTGAATAT CAACTTATCG GATGCAGACG CAACATTGAT GAGGGAAATG GCAGAGGCCC TGGAGCGTTA A
|
Protein sequence | MKKIPLGTTD ITLSRMGLGT WAIGGGPAWN GDLDRQICID TILEAHRCGI NLIDTAPGYN FGNSEVIVGQ ALKKLPREQV VVETKCGIVW ERKGSLFNKV GDRQLYKNLS PESIREEVEA SLQRLDIDYI DIYMTHWQSV PPFFTPIAET VAVLNELKAE GKIRAIGAAN VDADHIREYL QYGELDIIQA KYSILDRALE SELLPLCRDN GIVVQVYSPL EQGLLTGTIT RDYVPGGARA NKVWFQRENM LKVIDMLEQW QPLCARYQCT IPTLALAWIL KQSDLISILS GATAPEQVRE NVAALNINLS DADATLMREM AEALER
|
| |