Gene EcSMS35_A0098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0098 
SymbolhlyF 
ID6106558 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp72734 
End bp73843 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content38% 
IMG OID641614843 
Producthemolysin F 
Protein accessionYP_001739984 
Protein GI170650913 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3320] Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0392616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000000811123 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATTAT TATTACTTAC AGGTGCAACA GGATTTCTTG GTGGCGCGGT CCTGGATAAG 
CTGCTGGATA ACTGTAATAA TATAAATTTG CTACTTTTAG TACGAGCACC TACTCCACAA
GCGGGACTGG AAAGAATTAA AGAAAATATG CGTAAATTTA ATGTTTGTGA GGAAAGGTTG
CATGCATTAA CTAATGATAA CATCTTGCCT GGGGATCTAA ATAATCCGGA AGCCTTTCTC
ATGGATCCTC GTCTTGATGA AGTCACTCAT GTTATAAACT GTGCGGCTAT AGCTTCTTTT
GGTAATAATC CTTTTATATG GAATGTGAAT GTTACAGGTA CACTTGCTTT TGCAAGAAGA
ATGGCAAAAG TGGCAGGACT GAAACGCTTC CTTCATGTTG GTACTGCTAT GTCTTGTACA
CCTCATACGG GGTCGCTAGT TAAGGAAGAG TCTGCTTCAT CAGAAACAGG TGAACATTTA
GTGGAGTATA CGCATTCAAA AGCAACAATA GAATATCTGA TGCGTAAGCA GTGTCCTGAT
TTACCTTTGT TGGTTGCCCG ACCATCAATT ATTGTTGGCC ACAGTCGTTT AGGGTGCTTA
CCTTCAACCA GTATTTTCTG GGTATTCAGA ATGGGGTTAA TGTTGCAAAA ATTTATGTGC
TCTCTGGATG ATAAAATAGA TGTTATCCCT GTAGATTATT GTGCTGATGC ATTGCTAATG
TTGCTTGAAA GCTCGTTAAT TAATGGTGAG ATTGTTCATA TATCAGCAGG TAAAGAAAGT
AGTGTGACGT TCTCTGCTAT TGACGAAGCT GTAGCCCGTG CTTTGAACTG TGATCCTGTT
GGAGACAGAT ATACTAAAGT CAGTTATGAC ATACTGGCAA TGAGCCGTCA TGATTTTAAA
AATATTTTTG GTCCCTGTAA CGAACGCCTT ATGTTAAAAG CCATTCGTTT ATATGGAGCG
TTCAGTATGC TCAATGTTTG TTTCAGTAAC GACAAGCTAC TGAGTATCGG AATGCCTAAA
CCGCCAAAGT TTACTGATTA TATTAAATAC TGTATAGAAA CGACAAAACA CCTTTCAATT
CAACAACAAA TGGAAGTTGA TTTTAAATAA
 
Protein sequence
MKLLLLTGAT GFLGGAVLDK LLDNCNNINL LLLVRAPTPQ AGLERIKENM RKFNVCEERL 
HALTNDNILP GDLNNPEAFL MDPRLDEVTH VINCAAIASF GNNPFIWNVN VTGTLAFARR
MAKVAGLKRF LHVGTAMSCT PHTGSLVKEE SASSETGEHL VEYTHSKATI EYLMRKQCPD
LPLLVARPSI IVGHSRLGCL PSTSIFWVFR MGLMLQKFMC SLDDKIDVIP VDYCADALLM
LLESSLINGE IVHISAGKES SVTFSAIDEA VARALNCDPV GDRYTKVSYD ILAMSRHDFK
NIFGPCNERL MLKAIRLYGA FSMLNVCFSN DKLLSIGMPK PPKFTDYIKY CIETTKHLSI
QQQMEVDFK