Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0455 |
Symbol | |
ID | 6146363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 463317 |
End bp | 464291 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615349 |
Product | aldo/keto reductase family oxidoreductase |
Protein accession | YP_001742556 |
Protein GI | 170682731 |
COG category | [C] Energy production and conversion |
COG ID | [COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATACA ACCCCTTAGG AAAAACCGAC CTTCGCGTTT CCCGACTTTG CCTCGGCTGT ATGACCTTTG GCGAGCCAGA TCGCGGTAAT CACGCATGGA CACTGCCGGA AGAAAGCAGC CGTCCCATCA TTAAACGCGC GCTGGAAGGC GGCATAAATT TCTTTGATAC CGCCAATAGC TATTCCGATG GCAGCAGCGA AGAGATCGTC GGTCGCGCAC TGCGGGATTT CGCCCGTCGT GAAGACGTGG TCGTTGCGAC CAAAGTGTTC CATCGCGTTG GTGATTTACC GGAAGGATTA TCCCGTGCAC AAATTTTGCG CTCTATCGAC GACAGCCTGC GCCGTCTCGG CATGGATTAT GTCGATATCC TGCAAATTCA TCGCTGGGAT TACAACACGC CGATCGAAGA GACGCTGGAA GCCCTCAACG ACGTGGTAAA AGCCGGGAAA GCGCGTTATA TCGGCGCGTC ATCCATGCAC GCTTCGCAGT TTGCTCAAGC ACTGGAACTA CAAAAACAGC ACGGCTGGGC GCAGTTTGTC AGTATGCAGG ATCACTACAA TCTGATTTAT CGTGAAGAAG AGCGCGAGAT GCTGCCGCTG TGTTATCAGG AAGGCGTGGC GGTGATTCCA TGGAGCCCGC TGGCGCGGGG CCGTCTGACG CGTCCTTGGG GAGAAACCAC CGCACGGCTG GTGTCGGATG AAGTGGGGAA AAATCTCTAC CAAGAAAGCG ATGAAAATGA CGCACAAATT GCAGAACGCT TAACGGGCGT CAGTGAAGAA CTTGGTGCAA CACGAGCACA AGTTGCGCTG GCCTGGTTGT TGAGTAAACC GGGCATTGCC GCACCGATTA TCGGAACTTC GCGGGAAGAA CAGCTTGATG AGCTGCTGAA CGCGGTGGAT ATCACTTTAA AGCCGGAACA GATTGCTGAA CTGGAAACGC CGTATAAACC GCATCCGGTA GTAGGATTTA AATAG
|
Protein sequence | MQYNPLGKTD LRVSRLCLGC MTFGEPDRGN HAWTLPEESS RPIIKRALEG GINFFDTANS YSDGSSEEIV GRALRDFARR EDVVVATKVF HRVGDLPEGL SRAQILRSID DSLRRLGMDY VDILQIHRWD YNTPIEETLE ALNDVVKAGK ARYIGASSMH ASQFAQALEL QKQHGWAQFV SMQDHYNLIY REEEREMLPL CYQEGVAVIP WSPLARGRLT RPWGETTARL VSDEVGKNLY QESDENDAQI AERLTGVSEE LGATRAQVAL AWLLSKPGIA APIIGTSREE QLDELLNAVD ITLKPEQIAE LETPYKPHPV VGFK
|
| |