Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_A0098 |
Symbol | hlyF |
ID | 6106558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010488 |
Strand | - |
Start bp | 72734 |
End bp | 73843 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 641614843 |
Product | hemolysin F |
Protein accession | YP_001739984 |
Protein GI | 170650913 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3320] Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.0392616 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.000000811123 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAATTAT TATTACTTAC AGGTGCAACA GGATTTCTTG GTGGCGCGGT CCTGGATAAG CTGCTGGATA ACTGTAATAA TATAAATTTG CTACTTTTAG TACGAGCACC TACTCCACAA GCGGGACTGG AAAGAATTAA AGAAAATATG CGTAAATTTA ATGTTTGTGA GGAAAGGTTG CATGCATTAA CTAATGATAA CATCTTGCCT GGGGATCTAA ATAATCCGGA AGCCTTTCTC ATGGATCCTC GTCTTGATGA AGTCACTCAT GTTATAAACT GTGCGGCTAT AGCTTCTTTT GGTAATAATC CTTTTATATG GAATGTGAAT GTTACAGGTA CACTTGCTTT TGCAAGAAGA ATGGCAAAAG TGGCAGGACT GAAACGCTTC CTTCATGTTG GTACTGCTAT GTCTTGTACA CCTCATACGG GGTCGCTAGT TAAGGAAGAG TCTGCTTCAT CAGAAACAGG TGAACATTTA GTGGAGTATA CGCATTCAAA AGCAACAATA GAATATCTGA TGCGTAAGCA GTGTCCTGAT TTACCTTTGT TGGTTGCCCG ACCATCAATT ATTGTTGGCC ACAGTCGTTT AGGGTGCTTA CCTTCAACCA GTATTTTCTG GGTATTCAGA ATGGGGTTAA TGTTGCAAAA ATTTATGTGC TCTCTGGATG ATAAAATAGA TGTTATCCCT GTAGATTATT GTGCTGATGC ATTGCTAATG TTGCTTGAAA GCTCGTTAAT TAATGGTGAG ATTGTTCATA TATCAGCAGG TAAAGAAAGT AGTGTGACGT TCTCTGCTAT TGACGAAGCT GTAGCCCGTG CTTTGAACTG TGATCCTGTT GGAGACAGAT ATACTAAAGT CAGTTATGAC ATACTGGCAA TGAGCCGTCA TGATTTTAAA AATATTTTTG GTCCCTGTAA CGAACGCCTT ATGTTAAAAG CCATTCGTTT ATATGGAGCG TTCAGTATGC TCAATGTTTG TTTCAGTAAC GACAAGCTAC TGAGTATCGG AATGCCTAAA CCGCCAAAGT TTACTGATTA TATTAAATAC TGTATAGAAA CGACAAAACA CCTTTCAATT CAACAACAAA TGGAAGTTGA TTTTAAATAA
|
Protein sequence | MKLLLLTGAT GFLGGAVLDK LLDNCNNINL LLLVRAPTPQ AGLERIKENM RKFNVCEERL HALTNDNILP GDLNNPEAFL MDPRLDEVTH VINCAAIASF GNNPFIWNVN VTGTLAFARR MAKVAGLKRF LHVGTAMSCT PHTGSLVKEE SASSETGEHL VEYTHSKATI EYLMRKQCPD LPLLVARPSI IVGHSRLGCL PSTSIFWVFR MGLMLQKFMC SLDDKIDVIP VDYCADALLM LLESSLINGE IVHISAGKES SVTFSAIDEA VARALNCDPV GDRYTKVSYD ILAMSRHDFK NIFGPCNERL MLKAIRLYGA FSMLNVCFSN DKLLSIGMPK PPKFTDYIKY CIETTKHLSI QQQMEVDFK
|
| |