Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1694 |
Symbol | |
ID | 5591017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1719226 |
End bp | 1719993 |
Gene Length | 768 bp |
Protein Length | 255 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 640920842 |
Product | 7-alpha-hydroxysteroid dehydrogenase |
Protein accession | YP_001458398 |
Protein GI | 157161080 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 54 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTTAATT CTGACAACCT GAAACTCGAC GGAAAATGCG CCATCATCAC AGGTGCGGGT GCAGGTATTG GTAAAGAAAT CGCAATTACA TTCGCGACAG CTGGCGCATC TGTGGTGGTC AGTGATATTA ACGCCGACGC AGCTAACCAT GTTGTAGACG AAATTCAACA ACTGGGTGGT CAGGCATTTG CCTGCCGTTG TGATATTACT TCCGAACAGG AACTCTCTGC ACTGGCAGAC TTTGCTATCA GTAAGCTGGG TAAAGTTGAT ATTCTGGTTA ACAACGCCGG TGGCGGTGGA CCTAAACCGT TTGATATGCC AATGGCGGAT TTTCGCCGTG CTTATGAACT GAATGTGTTT TCTTTTTTCC ATCTGTCACA ACTTGTTGCG CCAGAAATGG AAAAAAATGG CGGTGGCGTT ATTCTGACCA TCACTTCTAT GGCGGCAGAA AATAAAAATA TAAACATGAC TTCCTATTCA TCATCTAAAG CTGCGGCCAG TCATCTGGTC AGAAATATGG CGTTTGACCT GGGTGAAAAA AATATTCGGG TAAATGGCAT TGCGCCGGGG GCAATATTAA CCGATGCCCT GAAATCCGTT ATTACACCAG AAATTGAACA AAAAATGTTA CAGCACACGC CGATCAGACG TCTGGGCCAA CCGCAAGATA TTGCTAACGC AGCGCTGTTC CTTTGCTCGC CTGCTGCGAG CTGGGTAAGC GGACAAATTC TCACCGTCTC CGGTGGTGGG GTACAGGAGC TCAATTAA
|
Protein sequence | MFNSDNLKLD GKCAIITGAG AGIGKEIAIT FATAGASVVV SDINADAANH VVDEIQQLGG QAFACRCDIT SEQELSALAD FAISKLGKVD ILVNNAGGGG PKPFDMPMAD FRRAYELNVF SFFHLSQLVA PEMEKNGGGV ILTITSMAAE NKNINMTSYS SSKAAASHLV RNMAFDLGEK NIRVNGIAPG AILTDALKSV ITPEIEQKML QHTPIRRLGQ PQDIANAALF LCSPAASWVS GQILTVSGGG VQELN
|
| |