Gene EcolC_2011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2011 
Symbol 
ID6068056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2218634 
End bp2219401 
Gene Length768 bp 
Protein Length255 aa 
Translation table11 
GC content47% 
IMG OID641601424 
Product7-alpha-hydroxysteroid dehydrogenase 
Protein accessionYP_001724983 
Protein GI170020029 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.617976 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.65251 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTAATT CTGACAACCT GAGACTCGAC GGAAAATGCG CCATCATCAC AGGTGCGGGT 
GCAGGTATTG GTAAAGAAAT CGCCATTACA TTCGCGACAG CTGGCGCATC TGTGGTGGTC
AGTGATATTA ACGCCGACGC AGCTAACCAT GTTGTAGACG AAATTCAACA ACTGGGTGGT
CAGGCATTTG CCTGCCGTTG TGATATTACT TCCGAACAGG AACTCTCTGC ACTGGCAGAC
TTTGCTATCA GTAAGCTGGG TAAAGTTGAT ATTCTGGTTA ACAACGCCGG TGGCGGTGGA
CCTAAACCGT TTGATATGCC AATGGCGGAT TTTCGCCGTG CTTATGAACT GAATGTGTTT
TCTTTTTTCC ATCTGTCACA ACTTGTTGCG CCAGAAATGG AAAAAAATGG CGGTGGCGTT
ATTCTGACCA TCACTTCTAT GGCGGCAGAA AATAAAAATA TAAACATGAC TTCCTATGCA
TCATCTAAAG CTGCGGCCAG TCATCTGGTC AGAAATATGG CGTTTGACCT GGGTGAAAAA
AATATTCGGG TAAATGGCAT TGCGCCGGGG GCAATATTAA CCGATGCCCT GAAATCCGTT
ATTACACCAG AAATTGAACA AAAAATGTTA CAGCACACGC CGATCAGACG TCTGGGCCAA
CCGCAAGATA TTGCTAACGC AGCGCTGTTC CTTTGCTCGC CTGCTGCGAG CTGGGTAAGC
GGACAAATTC TCACCGTCTC CGGTGGTGGG GTACAGGAGC TCAATTAA
 
Protein sequence
MFNSDNLRLD GKCAIITGAG AGIGKEIAIT FATAGASVVV SDINADAANH VVDEIQQLGG 
QAFACRCDIT SEQELSALAD FAISKLGKVD ILVNNAGGGG PKPFDMPMAD FRRAYELNVF
SFFHLSQLVA PEMEKNGGGV ILTITSMAAE NKNINMTSYA SSKAAASHLV RNMAFDLGEK
NIRVNGIAPG AILTDALKSV ITPEIEQKML QHTPIRRLGQ PQDIANAALF LCSPAASWVS
GQILTVSGGG VQELN