Gene EcSMS35_2641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2641 
Symbol 
ID6143586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2700421 
End bp2701884 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content54% 
IMG OID641617512 
ProductM48 family peptidase 
Protein accessionYP_001744677 
Protein GI170680376 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.938933 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones61 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAGGC AGTTGAAAAA AAACCTGGTT GCAACCCTCA TTGCTGCTAT GACCATTGGT 
CAGGTAGCCC CGGCGTTTGC CGACAGCGCA GACACCTTGC CGGATATGGG AACCTCCGCA
GGAAGCACGC TTTCCATTGG CCAGGAAATG CAGATGGGCG ACTATTATGT CCGCCAGCTA
CGCGGCAGCG CGCCGTTAAT TAATGACCCG CTGTTAACGC AATATATTAA TTCGCTGGGG
ATGCGTCTGG TTTCGCATGC CAATTCGGTT AAGACACCGT TTCATTTCTT TCTGATCAAC
AACGACGAAA TTAACGCCTT TGCTTTCTTT GGCGGCAACG TGGTGCTGCA CTCTGCCCTG
TTCCGTTATT CCGATAACGA AAGTCAACTG GCTTCAGTTA TGGCGCACGA AATCTCCCAC
GTCACCCAAC GTCACCTGGC GCGAGCGATG GAAGATCAGC AGCGCAGCGC GCCGCTGACC
TGGGTCGGCG CGTTAGGTTC TATTTTACTG GCGATGGCCA GTCCGCAGGC GGGGATGGCG
GCGCTGACCG GTACACTGGC GGGAACGCGC CAGGGGATGA TCAGTTTCAC CCAGCAAAAT
GAACAGGAAG CGGACCGCAT TGGTATTCAG GTGCTGCAAC GCTCGGGATT CGATCCGCAG
GCGATGCCGA CCTTCCTCGA AAAATTACTC GATCAGGCGC GTTACTCCTC GCGCCCACCA
GAAATTCTGC TCACTCACCC ACTACCGGAA AGCCGTCTGG CTGATGCCCG TAACCGTGCC
AATCAGATGC GCCCGATGGT GGTGCAATCG TCAGAAGATT TCTATCTGGC AAAAGTGCGC
ACACTGGGGA TGTATAATTC CGGACGTAAC CAGCTCACCA GTGATTTGCT GGATGAATGG
GCGAAAGGAA ACGTTCGTCA GCAACGAGCG GCGCAATATG GTCGTGCTCT ACAGGCGATG
GAAGCCAATA AATACGATGA GGCGCGTAAA ACGCTGCAAC CGTTACTGGC GGCAGAACCT
GGTAACGCAT GGTATCTCGA TCTGGCTACC GATATCGATC TTGGGCAAAA CAAAGCCAAT
GAGGCAATCA ATCGCCTGAA AAATGCCCGT GATTTGCGCA CCAATCCGGT GTTGCAGCTC
AACCTGGCGA ACGCTTATCT GCAAGGCGGT CAACCACAAG AAGCGGCCAA TATTCTGAAT
CGCTACACTT TTAATAATAA AGATGACAGC AACGGCTGGG ATTTACTGGC ACAGGCGGAA
GCCGCGCTAA ATAACCGCGA TCAGGAACTG GCTGCGCGAG CAGAAGGTTA TGCGCTCGCC
GGGCGACTCG ATCAGGCCAT TTCCTTGTTG AGTAGCGCCA GTTCGCAGGT GAAATTAGGC
AGCCTGCAAC AAGCGCGTTA CGATGCGCGC ATCGACCAGT TGCGCCAGCT GCAGGAACGC
TTTAAGCCTT ATACCAAGAT GTAA
 
Protein sequence
MFRQLKKNLV ATLIAAMTIG QVAPAFADSA DTLPDMGTSA GSTLSIGQEM QMGDYYVRQL 
RGSAPLINDP LLTQYINSLG MRLVSHANSV KTPFHFFLIN NDEINAFAFF GGNVVLHSAL
FRYSDNESQL ASVMAHEISH VTQRHLARAM EDQQRSAPLT WVGALGSILL AMASPQAGMA
ALTGTLAGTR QGMISFTQQN EQEADRIGIQ VLQRSGFDPQ AMPTFLEKLL DQARYSSRPP
EILLTHPLPE SRLADARNRA NQMRPMVVQS SEDFYLAKVR TLGMYNSGRN QLTSDLLDEW
AKGNVRQQRA AQYGRALQAM EANKYDEARK TLQPLLAAEP GNAWYLDLAT DIDLGQNKAN
EAINRLKNAR DLRTNPVLQL NLANAYLQGG QPQEAANILN RYTFNNKDDS NGWDLLAQAE
AALNNRDQEL AARAEGYALA GRLDQAISLL SSASSQVKLG SLQQARYDAR IDQLRQLQER
FKPYTKM