Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1580 |
Symbol | |
ID | 6145692 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1564624 |
End bp | 1565391 |
Gene Length | 768 bp |
Protein Length | 255 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641616457 |
Product | 7-alpha-hydroxysteroid dehydrogenase |
Protein accession | YP_001743635 |
Protein GI | 170680639 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTTAATT CTGACAACCT GAGACTCGAC GGAAAATGCG CCATCATCAC AGGTGCGGGT GCAGGTATTG GTAAAGAAAT TGCCATTACA TTCGCGACAG CTGGCGCATC TGTGGTGGTC AGTGATATTA ATGCCGATGC AGCTAATCAT GTTGTAAATG AAATTCAACA ACTGGGTGGT CAGGCATTTG CCTGCCGCTG CGATATTACT TCCGAACAGG AACTCTCTGC ACTGGCAGAC TTTGCCGTCA GTAAGTTGGG TAAAGTTGAT ATCCTGGTTA ACAACGCCGG TGGCGGTGGT CCTAAACCGT TTGATATGCC AATGGCAGAT TTTCGCCGCG CTTATGAACT GAATGTATTT TCTTTTTTCC ATCTGTCACA ACTTGTTGCG CCAGAAATGG AAAAAAATGG CGGTGGCGTT ATTTTGACCA TTACTTCTAT GGCGGCAGAA AATAAAAATA TAAACATGAC CTCCTATGCA TCATCTAAAG CTGCGGCCAG TCATCTGGTC AGAAATATGG CGTTTGACCT TGGTGAAAAA AATATTCGGG TGAATGGCAT TGCGCCGGGG GCAATATTAA CCGATGCCCT GAAATCCGTT ATTACACCAG AAATTGAACA GAAAATGTTG CAACACACAC CAATCAGACG TCTGGGCCAA CCGCAAGATA TAGCTAACGC GGCGCTGTTC CTTTGCTCGC CTGCAGCCAG CTGGGTAAGC GGACAAATTC TCACCGTCTC CGGTGGTGGG GTACAGGAGC TCAATTAA
|
Protein sequence | MFNSDNLRLD GKCAIITGAG AGIGKEIAIT FATAGASVVV SDINADAANH VVNEIQQLGG QAFACRCDIT SEQELSALAD FAVSKLGKVD ILVNNAGGGG PKPFDMPMAD FRRAYELNVF SFFHLSQLVA PEMEKNGGGV ILTITSMAAE NKNINMTSYA SSKAAASHLV RNMAFDLGEK NIRVNGIAPG AILTDALKSV ITPEIEQKML QHTPIRRLGQ PQDIANAALF LCSPAASWVS GQILTVSGGG VQELN
|
| |