Gene EcHS_A2091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2091 
Symbol 
ID5595287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2075189 
End bp2076055 
Gene Length867 bp 
Protein Length288 aa 
Translation table11 
GC content54% 
IMG OID640921232 
ProductS49 family peptidase 
Protein accessionYP_001458776 
Protein GI157161458 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATCTAC CTCATCTGGC CCAGCGCCTT TTTAATACAC CGCTGGCGCT GCACCCAAGT 
AAAGCTGAAG TCATCATGGC ATCCGTTATG GACCGGTTTG GCATCAGTAA AATCGAATCC
TCTCTTGCCA TGGATGATGA CTGGTATGGA TACGACGATA ACAGGGGGCG GGAATCCCGT
AGCGACCCGG GTTATGACAA TGTGTTGGGC GTCGCTGTCA TCCCGATATG TGGGACCCTG
GTGCAGAAGC TGGGTAGCCT GCGCCCATAC AGTGGCATGA CAGGCTATGA CGGCATTCGT
CAGGCCTTCC TGACTGCGAT GGAAGACCCC GATATTACAG GGATCTGCCT GGATATTGAT
TCGCCAGGAG GCGAGGTCGC CGGATGTTTC GATCTGGTCG ATGTCATTTA TGGCGCCCGC
GGGAAAAAGC CCATCCATGC CATTCTGACG GAAAGCGCCT ATTCCGCCGC CTATGCGATT
GCCAGTGCGG CGGACCGGAT TTCTGTTCCC CGAACCGCTG GTGTGGGTTC AGTTGGTGTG
ATCACTATGC ACCTTGACTG GACCCAGCGG ATAAAAGATG ACGGCCTCAA AGTCACCATC
ATCACCTACG GTTCCCGTAA GGCTGAGGGG GCACCGCTGA GAGAGCTGTC AGATGAAGCG
CTGGCGGCTA TTCAGCAGGA CATCAACACC ATGGGCGAAT TGTTTGTGAA TACCGTCGCC
AGAAATCGGG GGATTAGCGC AAAGGTTATC AAAAGTACTC AGGCTGCCTG TTTTATGGCT
GCTGATGGTG TGGAACTTGG ACTGGCTGAT GAGGTGTGTC CTCCTGATGC TGCGTTCAGA
AACTTACTTG AAAAAACAGG AGCCTGA
 
Protein sequence
MNLPHLAQRL FNTPLALHPS KAEVIMASVM DRFGISKIES SLAMDDDWYG YDDNRGRESR 
SDPGYDNVLG VAVIPICGTL VQKLGSLRPY SGMTGYDGIR QAFLTAMEDP DITGICLDID
SPGGEVAGCF DLVDVIYGAR GKKPIHAILT ESAYSAAYAI ASAADRISVP RTAGVGSVGV
ITMHLDWTQR IKDDGLKVTI ITYGSRKAEG APLRELSDEA LAAIQQDINT MGELFVNTVA
RNRGISAKVI KSTQAACFMA ADGVELGLAD EVCPPDAAFR NLLEKTGA