Gene ECH74115_3716 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3716 
Symbol 
ID6966624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3436354 
End bp3437817 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content54% 
IMG OID643387510 
Productpeptidase, M48 family 
Protein accessionYP_002271963 
Protein GI209399555 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.997857 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAGGC AGTTGAAAAA AAACCTGGTT GCAACCCTCA TTGCTGCTAT GACCATTGGT 
CAGGTAGCCC CGGCATTTGC CGACAGCGCA GACACCTTGC CGGATATGGG AACCTCCGCA
GGAAGCACGC TTTCCATTGG TCAGGAAATG CAGATGGGCG ACTATTATGT CCGCCAGCTA
CGCGGCAGCG CGCCGTTAAT TAATGACCCG CTGTTAACGC AATATATTAA TTCGCTGGGG
ATGCGTCTGG TTTCGCATGC CAATTCGGTT AAGACACCGT TTCATTTCTT TCTGATCAAC
AACGACGAAA TTAACGCCTT TGCTTTCTTT GGCGGCAACG TGGTGCTGCA CTCTGCCCTG
TTCCGTTATT CCGATAACGA AAGTCAACTG GCTTCAGTTA TGGCGCACGA AATCTCCCAC
GTCACCCAAC GTCACCTGGC GCGAGCGATG GAAGATCAGC AGCGCAACGC GCCGCTGACC
TGGGTCGGCG CGTTAGGTTC TATTTTACTG GCGATGGCCA GTCCGCAGGC GGGGATGGCG
GCGCTGACCG GTACACTGGC GGGAACGCGT CAGGGGATGA TCAGTTTCAC CCAGCAAAAT
GAACAGGAAG CGGACCGCAT TGGTATTCAG GTGCTGCAAC GCTCGGGATT CGATCCGCAG
GCGATGCCAA CCTTCCTCGA AAAATTACTC GATCAGGCGC GTTACTCCTC GCGCCCGCCG
GAAATTTTAC TGACTCACCC GTTGCCGGAA AGTCGTCTGG CAGATGCCCG CAACCGTGCT
AATCAGATGC GCCCGATGGT GGTGCAGTCG TCGGAAGATT TCTATCTGGC GAAAGCGCGC
ACACTGGGGA TGTATAATTC CGGACGTAAC CAGCTCACCA GTGATTTGCT GGATGAATGG
GCGAAAGGAA ACGTTCGTCA GCAACGAGCG GCGCAATATG GTCGTGCTTT ACAGGCGATG
GAAGCCAATA AATACGACGA GGCGCGAAAA ACGCTGCAAC CGTTACTGGC GGCAGAACCT
GGCAACGCAT GGTATCTCGA TCTGGCTACT GATATCGATC TTGGGCAAAA CAAAGCCAAT
GAGGCGATCA ATCGTCTGAA AAATGCCCGC GATTTGCGCA CCAATCCTGT GTTGCAGCTC
AACCTGGCGA ACGCTTATCT ACAAGGCGGT CAACCACAAG AAGCGGCCAA TATTCTTAAT
CGCTACACCT TTAATAATAA AGATGACAGC AACGGCTGGG ATTTGCTGGC ACAGGCGGAA
GCCGCGCTAA ATAACCGCGA TCAGGAGCTG GCTGCGCGAG CAGAAGGTTA TGCGCTCGCC
GGACGACTCG ATCAGGCCAT TTCGCTGTTG AGTAGCGCCA GTTCGCAGGT GAAATTAGGC
AGCCTGCAAC AAGCGCGTTA CGATGCGCGC ATCGACCAGT TGCGCCAGCT GCAGGAACGC
TTTAAGCCTT ATACCAAGAT GTAA
 
Protein sequence
MFRQLKKNLV ATLIAAMTIG QVAPAFADSA DTLPDMGTSA GSTLSIGQEM QMGDYYVRQL 
RGSAPLINDP LLTQYINSLG MRLVSHANSV KTPFHFFLIN NDEINAFAFF GGNVVLHSAL
FRYSDNESQL ASVMAHEISH VTQRHLARAM EDQQRNAPLT WVGALGSILL AMASPQAGMA
ALTGTLAGTR QGMISFTQQN EQEADRIGIQ VLQRSGFDPQ AMPTFLEKLL DQARYSSRPP
EILLTHPLPE SRLADARNRA NQMRPMVVQS SEDFYLAKAR TLGMYNSGRN QLTSDLLDEW
AKGNVRQQRA AQYGRALQAM EANKYDEARK TLQPLLAAEP GNAWYLDLAT DIDLGQNKAN
EAINRLKNAR DLRTNPVLQL NLANAYLQGG QPQEAANILN RYTFNNKDDS NGWDLLAQAE
AALNNRDQEL AARAEGYALA GRLDQAISLL SSASSQVKLG SLQQARYDAR IDQLRQLQER
FKPYTKM