Gene EcHS_A2628 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2628 
Symbol 
ID5590903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2638031 
End bp2639494 
Gene Length1464 bp 
Protein Length487 aa 
Translation table11 
GC content54% 
IMG OID640921745 
ProductM48 family peptidase 
Protein accessionYP_001459272 
Protein GI157161954 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCAGGC AGTTGAAAAA AAACCTGGTT GCAACCCTCA TTGCTGCTAT GACCATTGGT 
CAGGTAGCCC CGGCGTTTGC CGACAGCGCA GACACCTTGC CGGATATGGG AACCTCCGCA
GGAAGCACGC TTTCCATTGG TCAGGAAATG CAGATGGGCG ACTATTATGT CCGCCAGCTA
CGCGGCAGCG CGCCGTTAAT TAATGACCCG CTGTTAACGC AATATATTAA TTCGCTGGGG
ATGCGTCTGG TTTCGCATGC CAATTCGGTT AAGACACCGT TTCATTTTTT TCTGATCAAC
AACGACGAAA TTAACGCCTT TGCTTTCTTT GGCGGCAACG TGGTGCTGCA CTCTGCCCTG
TTCCGTTATT CCGATAACGA AAGTCAACTG GCTTCAGTTA TGGCGCACGA AATCTCCCAC
GTCACCCAAC GTCACCTGGC GCGAGCGATG GAAGATCAAC AACGCAGCGC GCCGCTGACC
TGGGTCGGCG CGTTAGGTTC TATTTTACTG GCGATGGCCA GTCCGCAGGC GGGGATGGCG
GCGCTGACCG GTACACTGGC GGGAACACGT CAGGGGATGA TCAGTTTCAC CCAGCAAAAT
GAACAGGAAG CGGACCGCAT TGGTATTCAG GTGCTGCAAC GCTCGGGATT CGATCCGCAG
GCGATGCCAA CCTTCCTCGA AAAATTACTC GATCAGGCGC GTTACTCCTC GCGCCCGCCG
GAAATTTTAT TGACTCACCC GTTGCCGGAA AGTCGTCTGG CAGATGCCCG CAACCGTGCT
AATCAGATGC GACCGATGGT GGTGCAGTCG TCGGAAGATT TCTATCTGGC GAAAGCGCGC
ACACTGGGGA TGTATAATTC CGGACGTAAC CAGCTCACCA GTGATTTGCT GGATGAATGG
GCGAAAGGAA ACGTTCGTCA GCAACGAGCG GCGCAATATG GTCGTGCTTT ACAGGCGATG
GAAGCCAATA AATACGACGA GGCGCGTAAA ACGCTGCAAC CGTTACTGGC GGCAGAACCT
GGCAACGCAT GGTATCTCGA TCTGGCTACC GATATCGATC TTGGGCAAAA CAAAGCCAAT
GAGGCAATCA ATCGCCTGAA AAATGCCCGT GATTTGCGCA CCAATCCGGT GTTGCAGCTC
AACCTGGCAA ACGCTTATCT GCAAGGCGGT CAACCACAAG AAGCGGCCAA TATTCTCAAT
CGCTACACCT TTAATAATAA AGATGACAGC AACGGCTGGG ATTTACTGGC ACAGGCGGAA
GCCGCGCTAA ATAACCGCGA TCAGGAGCTG GCTGCGCGAG CAGAAGGTTA TGCGCTCGCC
GGACGACTCG ATCAGGCCAT TTCGCTGTTG AGTAGCGCCA GTTCGCAGGT GAAATTAGGC
AGCCTGCAAC AAGCGCGTTA CGATGCGCGC ATCGACCAGT TGCGCCAGCT GCAGGAACGC
TTTAAGCCTT ATACCAAGAT GTAA
 
Protein sequence
MFRQLKKNLV ATLIAAMTIG QVAPAFADSA DTLPDMGTSA GSTLSIGQEM QMGDYYVRQL 
RGSAPLINDP LLTQYINSLG MRLVSHANSV KTPFHFFLIN NDEINAFAFF GGNVVLHSAL
FRYSDNESQL ASVMAHEISH VTQRHLARAM EDQQRSAPLT WVGALGSILL AMASPQAGMA
ALTGTLAGTR QGMISFTQQN EQEADRIGIQ VLQRSGFDPQ AMPTFLEKLL DQARYSSRPP
EILLTHPLPE SRLADARNRA NQMRPMVVQS SEDFYLAKAR TLGMYNSGRN QLTSDLLDEW
AKGNVRQQRA AQYGRALQAM EANKYDEARK TLQPLLAAEP GNAWYLDLAT DIDLGQNKAN
EAINRLKNAR DLRTNPVLQL NLANAYLQGG QPQEAANILN RYTFNNKDDS NGWDLLAQAE
AALNNRDQEL AARAEGYALA GRLDQAISLL SSASSQVKLG SLQQARYDAR IDQLRQLQER
FKPYTKM