Gene EcSMS35_1580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1580 
Symbol 
ID6145692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1564624 
End bp1565391 
Gene Length768 bp 
Protein Length255 aa 
Translation table11 
GC content47% 
IMG OID641616457 
Product7-alpha-hydroxysteroid dehydrogenase 
Protein accessionYP_001743635 
Protein GI170680639 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones69 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTAATT CTGACAACCT GAGACTCGAC GGAAAATGCG CCATCATCAC AGGTGCGGGT 
GCAGGTATTG GTAAAGAAAT TGCCATTACA TTCGCGACAG CTGGCGCATC TGTGGTGGTC
AGTGATATTA ATGCCGATGC AGCTAATCAT GTTGTAAATG AAATTCAACA ACTGGGTGGT
CAGGCATTTG CCTGCCGCTG CGATATTACT TCCGAACAGG AACTCTCTGC ACTGGCAGAC
TTTGCCGTCA GTAAGTTGGG TAAAGTTGAT ATCCTGGTTA ACAACGCCGG TGGCGGTGGT
CCTAAACCGT TTGATATGCC AATGGCAGAT TTTCGCCGCG CTTATGAACT GAATGTATTT
TCTTTTTTCC ATCTGTCACA ACTTGTTGCG CCAGAAATGG AAAAAAATGG CGGTGGCGTT
ATTTTGACCA TTACTTCTAT GGCGGCAGAA AATAAAAATA TAAACATGAC CTCCTATGCA
TCATCTAAAG CTGCGGCCAG TCATCTGGTC AGAAATATGG CGTTTGACCT TGGTGAAAAA
AATATTCGGG TGAATGGCAT TGCGCCGGGG GCAATATTAA CCGATGCCCT GAAATCCGTT
ATTACACCAG AAATTGAACA GAAAATGTTG CAACACACAC CAATCAGACG TCTGGGCCAA
CCGCAAGATA TAGCTAACGC GGCGCTGTTC CTTTGCTCGC CTGCAGCCAG CTGGGTAAGC
GGACAAATTC TCACCGTCTC CGGTGGTGGG GTACAGGAGC TCAATTAA
 
Protein sequence
MFNSDNLRLD GKCAIITGAG AGIGKEIAIT FATAGASVVV SDINADAANH VVNEIQQLGG 
QAFACRCDIT SEQELSALAD FAVSKLGKVD ILVNNAGGGG PKPFDMPMAD FRRAYELNVF
SFFHLSQLVA PEMEKNGGGV ILTITSMAAE NKNINMTSYA SSKAAASHLV RNMAFDLGEK
NIRVNGIAPG AILTDALKSV ITPEIEQKML QHTPIRRLGQ PQDIANAALF LCSPAASWVS
GQILTVSGGG VQELN