Gene EcE24377A_1990 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1990 
SymbolsppA 
ID5587437 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1974459 
End bp1976315 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content52% 
IMG OID640925662 
Productprotease 4 
Protein accessionYP_001463065 
Protein GI157158760 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00705] signal peptide peptidase SppA, 67K type
[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0152485 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGAACCC TTTGGCGATT TATTGCCGGA TTTTTTAAAT GGACGTGGCG TCTGCTGAAT 
TTCGTCCGTG AAATGGTACT TAACCTGTTC TTTATTTTCC TCGTACTGGT TGGTGTGGGG
ATCTGGATGC AGGTCAGTGG TGGTGATTCG AAAGAAACGG CCAGTCGTGG CGCACTGCTG
CTGGACATTT CTGGTGTGAT CGTCGATAAA CCCGACAGTT CTCAGCGGTT TAGTAAATTA
AGCCGCCAGC TGCTTGGTGC CAGTTCCGAT CGTCTGCAGG AAAACTCACT GTTTGATATC
GTCAACACCA TTCGCCAGGC GAAGGACGAC CGTAATATCA CCGGTATCGT GATGGATCTG
AAAAACTTCG CAGGCGGCGA CCAGCCGTCT ATGCAGTACA TCGGCAAAGC CCTGAAAGAG
TTTCGTGACA GCGGGAAACC GGTTTATGCC GTTGGCGAGA ACTACAGCCA GGGGCAATAT
TATCTCGCCA GTTTCGCCAA TAAAATTTGG CTGTCACCGC AAGGCGTGGT TGATCTGCAC
GGTTTTGCTA CCAACGGTCT GTACTACAAA TCGTTGCTGG ATAAGCTGAA AGTTTCCACC
CATGTGTTCC GCGTGGGTAC GTATAAATCT GCCGTTGAAC CGTTTATTCG TGATGATATG
TCACCGGCAG CCCGCGAAGC TGACAGCCGC TGGATTGGTG AGCTGTGGCA AAACTATCTG
AATACTGTTG CCGCTAACCG GCAGATCCCT GCTCAGCAGG TATTCCCTGG CGCGCAAGGG
TTGCTTGAGG GTTTAACCAA AACCGGTGGC GATACCGCGA AATATGCACT GGAAAACAAG
CTGGTCGATG CACTGGCATC GAGTGCCGAA ATCGAAAAAG CTCTGACCAA AGAGTTCGGC
TGGAGTAAGA CTGATAAAAA TTATCGCGCC ATCAGTTATT ACGATTACGC ATTGAAAACG
CCGGCAGATA CCGGTGACAG CATCGGTGTC GTCTTCGCTA ATGGCGCAAT TATGGATGGC
GAGGAAACTC AGGGGAATGT TGGCGGTGAT ACCACTGCGG CACAAATCCG CGACGCTCGC
CTTGACCCGA AAGTGAAAGC GATTGTCCTG CGTGTTAATA GCCCAGGCGG CAGCGTTACC
GCGTCTGAAG TGATTCGCGC TGAACTGGCA GCAGCCCGGG CAGCGGGTAA GCTTGTGGTT
GTATCGATGG GCGGCATGGC GGCATCTGGT GGTTACTGGA TTTCCACGCC AGCTAATTAC
ATTGTGGCTA ACCCCAGCAC CCTGACCGGT TCTATCGGTA TCTTCGGCGT GATCACCACC
GTAGAAAATA GTCTGGATTC GATTGGTGTT CATACTGATG GTGTCTCAAC TTCACCGCTG
GCGGATGTTT CTATCACCAG GGCACTGCCG CCGGAAGCGC AGCAGATGAT GCAATTAAGC
ATTGAGAATG GCTATAAACG CTTTATCACG CTGGTTGCTG ATGCGCGTCA TTCGACGCCG
GAGCAAATTG ATAAAATCGC CCAGGGCCAC GTCTGGACCG GTCAGGATGC AAAAGCTAAC
GGGCTGGTCG ATAGTCTCGG GGATTTCGAT GATGCGGTTG CCAAAGCAGC AGAGCTGGCA
AAAGTGAAAC AGTGGCATCT GGAATACTAC GTTGATGAAC CGACCTTCTT CGACAAAGTG
ATGGACAACA TGTCTGGTTC TGTCCGGGCA ATGTTGCCAG ATGCGTTCCA GGCCATGTTA
CCTGCACCGC TTGCCTCGGT AGCCTCTACC GTTAAAAGTG AAAGCGACAA GCTGGCCGCG
TTTAACGACC CACAAAACCG TTATGCGTTT TGCCTGACCT GCGCCAACGT GCGTTAA
 
Protein sequence
MRTLWRFIAG FFKWTWRLLN FVREMVLNLF FIFLVLVGVG IWMQVSGGDS KETASRGALL 
LDISGVIVDK PDSSQRFSKL SRQLLGASSD RLQENSLFDI VNTIRQAKDD RNITGIVMDL
KNFAGGDQPS MQYIGKALKE FRDSGKPVYA VGENYSQGQY YLASFANKIW LSPQGVVDLH
GFATNGLYYK SLLDKLKVST HVFRVGTYKS AVEPFIRDDM SPAAREADSR WIGELWQNYL
NTVAANRQIP AQQVFPGAQG LLEGLTKTGG DTAKYALENK LVDALASSAE IEKALTKEFG
WSKTDKNYRA ISYYDYALKT PADTGDSIGV VFANGAIMDG EETQGNVGGD TTAAQIRDAR
LDPKVKAIVL RVNSPGGSVT ASEVIRAELA AARAAGKLVV VSMGGMAASG GYWISTPANY
IVANPSTLTG SIGIFGVITT VENSLDSIGV HTDGVSTSPL ADVSITRALP PEAQQMMQLS
IENGYKRFIT LVADARHSTP EQIDKIAQGH VWTGQDAKAN GLVDSLGDFD DAVAKAAELA
KVKQWHLEYY VDEPTFFDKV MDNMSGSVRA MLPDAFQAML PAPLASVAST VKSESDKLAA
FNDPQNRYAF CLTCANVR