Gene EcSMS35_1967 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1967 
SymbolhlyE 
ID6143071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1991319 
End bp1992230 
Gene Length912 bp 
Protein Length303 aa 
Translation table11 
GC content41% 
IMG OID641616843 
Producthemolysin E 
Protein accessionYP_001744019 
Protein GI170682110 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000212688 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.94466 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGAAA TCGTTGCAGA TAAAACGGTA GAGGTAGTTA AAAACGCAAT CGAAACCGCA 
GATGGAGCAT TAGATCTTTA TAATAAATAT CTCGATCAGG TCATCCCCTG GCAGACCTTT
GATGAAACCA TAAAAGAGTT AAGTCGCTTT AAACAGGAGT ATTCACAGGC AGCCTCCGTT
TTAGTTGGCG ATATTAAAAC CTTACTTATG GATAGCCAGG ATAAGTATTT TGAAGCAACC
CAAACAGTGT ATGAATGGTG TGGTGTTGCG ACGCAATTGC TCGCAGCGTA TATTTTGCTA
TTTGATGAGT ACAATGAGAA GAAAGCATCC GCCCAGAAAG ACATTCTCAT TAAGGTACTG
GATGACGGTA TCACGAAGCT GAATGAAGCG CAAAAATCCC TGCTGGTAAG CTCACAAAGT
TTCAACAACG CTTCCGGGAA ACTGCTGGCG TTAGATAGCC AGTTAACCAA TGATTTTTCA
GAAAAAAGCA GCTATTTCCA GTCACAGGTA GATAAAATCA GGAAGGAAGC GTATGCCGGT
GCCGCAGCCG GTGTCGTCGC CGGTCCATTT GGTTTAATCA TTTCCTATTC TATTGCTGCG
GGCGTAGTTG AAGGGAAACT GATTCCAGAA TTGAAGAACA AGTTAAAATC TGTGCAGAGT
TTCTTTACCA CCCTGTCTAA CACGGTTAAA CAAGCGAATA AAGATATCGA TGCCGCCAAA
TTGAAATTAA CCACCGAAAT AGCCGCCATC GGGGAGATAA AAACGGAAAC TGAAACAACC
AGATTCTACG TTGATTATGA TGATTTAATG CTTTCTTTGC TAAAAGAAGC GGCAAAAAAA
ATGATTAACA CCTGTAATGA GTATCAGAAA AGACACGGTA AGAAGACACT CTTTGAGGTA
CCTGAAGTCT GA
 
Protein sequence
MTEIVADKTV EVVKNAIETA DGALDLYNKY LDQVIPWQTF DETIKELSRF KQEYSQAASV 
LVGDIKTLLM DSQDKYFEAT QTVYEWCGVA TQLLAAYILL FDEYNEKKAS AQKDILIKVL
DDGITKLNEA QKSLLVSSQS FNNASGKLLA LDSQLTNDFS EKSSYFQSQV DKIRKEAYAG
AAAGVVAGPF GLIISYSIAA GVVEGKLIPE LKNKLKSVQS FFTTLSNTVK QANKDIDAAK
LKLTTEIAAI GEIKTETETT RFYVDYDDLM LSLLKEAAKK MINTCNEYQK RHGKKTLFEV
PEV