Gene EcHS_A1850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1850 
SymbolsppA 
ID5591434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1866919 
End bp1868775 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content52% 
IMG OID640920994 
Productprotease 4 
Protein accessionYP_001458546 
Protein GI157161228 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00705] signal peptide peptidase SppA, 67K type
[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.0115674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAACCC TTTGGCGATT TATTGCCGGA TTTTTTAAAT GGACGTGGCG TCTGCTGAAT 
TTCGTCCGTG AAATGGTACT TAACCTGTTC TTTATTTTCC TCGTACTGGT TGGTGTGGGG
ATTTGGATGC AGGTCAGTGG TGGTGATTCG AAAGAAACGG CCAGTCGTGG CGCACTGCTG
CTGGACATTT CTGGTGTGAT CGTCGATAAA CCCGACAGTT CTCAGCGGTT TAGTAAATTA
AGCCGCCAGC TGCTTGGTGC CAGTTCCGAT CGTCTGCAGG AAAACTCACT GTTTGATATC
GTCAACACTA TTCGCCAGGC GAAGGACGAC CGCAATATCA CCGGTATTGT GATGGATCTG
AAAAACTTCG CAGGCGGCGA CCAACCGTCT ATGCAGTACA TCGGCAAAGC TCTGAAAGAG
TTTCGTGACA GTGGGAAACC GGTTTATGCC GTTGGCGAGA ACTACAGCCA GGGGCAATAT
TATCTCGCCA GTTTCGCCAA TAAAATTTGG CTGTCTCCGC AAGGCGTGGT TGATCTGCAC
GGCTTTGCCA CCAACGGTCT GTACTACAAA TCGTTGCTGG ATAAGCTGAA AGTTTCCACC
CATGTGTTCC GCGTGGGTAC GTATAAATCT GCCGTTGAAC CGTTTATTCG TGATGATATG
TCACCGGCAG CCCGCGAAGC TGACAGCCGC TGGATTGGTG AGCTGTGGCA AAACTATCTG
AATACTGTTG CCGCTAACCG GCAGATCCCT GCTGAGCAGG TATTCCCTGG CGCGCAAGGG
TTGCTTGAGG GTTTAACCAA AACCGGTGGC GATACCGCGA AATATGCACT GGAAAACAAG
CTGGTCGATG CACTGGCATC GAGTGCGGAA ATCGAAAAAG CACTGACCAA AGAATTCGGC
TGGAGTAAGA CTGATAAAAA TTATCGCGCC ATCAGTTATT ACGATTACGC ATTGAAAACG
CCGGCAGATA CCGGTGACAG CATCGGTGTC GTCTTTGCTA ATGGCGCAAT TATGGATGGC
GAGGAAACTC AGGGGAATGT TGGCGGTGAT ACCACTGCGG CACAAATCCG CGACGCTCGC
CTTGACCCGA AAGTGAAAGC GATTGTCCTG CGTGTTAATA GCCCAGGCGG CAGCGTTACC
GCGTCTGAAG TGATTCGCGC TGAACTGGCA GCAGCCCGGG CAGCGGGTAA GCCTGTGGTT
GTATCGATGG GCGGCATGGC GGCATCTGGT GGTTACTGGA TTTCCACGCC AGCTAATTAC
ATTGTGGCTA ACCCCAGCAC CCTGACCGGT TCTATCGGTA TCTTCGGCGT GATCACCACC
GTAGAAAATA GTCTGGATTC GATTGGTGTT CATACCGATG GTGTCTCAAC TTCACCGCTG
GCAGATGTTT CTATCACCAG GGCACTGCCG CCGGAAGCAC AGCAGATGAT GCAGTTAAGC
ATTGAGAATG GCTATAAACG CTTTATCACG CTGGTTGCTG ATGCGCGTCA TTCGACGCCG
GAGCAGATTG ATAAAATCGC CCAGGGCCAC GTCTGGACCG GTCAGGATGC AAAAGCTAAC
GGGCTGGTCG ATAGCCTCGG GGATTTCGAT GATGCAGTTG CCAAAGCAGC AGAGCTGGCA
AAAGTGAAAC AGTGGCATCT GGAATACTAC GTTGATGAAC CGACCTTCTT CGACAAAGTG
ATGGACAACA TGTCTGGTTC TGTCCGGGCA ATGTTGCCAG ATGCGTTCCA GGCCATGTTA
CCTGCACCGC TGGCCTCGGT AGCCTCTACT GTTAAAAGTG AAAGCGACAA GCTGGCCGCG
TTTAACGACC CACAAAACCG TTATGCGTTT TGCCTGACCT GCGCCAACGT GCGTTAA
 
Protein sequence
MRTLWRFIAG FFKWTWRLLN FVREMVLNLF FIFLVLVGVG IWMQVSGGDS KETASRGALL 
LDISGVIVDK PDSSQRFSKL SRQLLGASSD RLQENSLFDI VNTIRQAKDD RNITGIVMDL
KNFAGGDQPS MQYIGKALKE FRDSGKPVYA VGENYSQGQY YLASFANKIW LSPQGVVDLH
GFATNGLYYK SLLDKLKVST HVFRVGTYKS AVEPFIRDDM SPAAREADSR WIGELWQNYL
NTVAANRQIP AEQVFPGAQG LLEGLTKTGG DTAKYALENK LVDALASSAE IEKALTKEFG
WSKTDKNYRA ISYYDYALKT PADTGDSIGV VFANGAIMDG EETQGNVGGD TTAAQIRDAR
LDPKVKAIVL RVNSPGGSVT ASEVIRAELA AARAAGKPVV VSMGGMAASG GYWISTPANY
IVANPSTLTG SIGIFGVITT VENSLDSIGV HTDGVSTSPL ADVSITRALP PEAQQMMQLS
IENGYKRFIT LVADARHSTP EQIDKIAQGH VWTGQDAKAN GLVDSLGDFD DAVAKAAELA
KVKQWHLEYY VDEPTFFDKV MDNMSGSVRA MLPDAFQAML PAPLASVAST VKSESDKLAA
FNDPQNRYAF CLTCANVR