Gene EcE24377A_3717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_3717 
SymboldegQ 
ID5588791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp3710828 
End bp3712195 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content52% 
IMG OID640927340 
Productserine endoprotease 
Protein accessionYP_001464707 
Protein GI157156170 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000159943 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC AAACCCAGCT GTTGAGTGCA TTAGCGTTAA GTGTCGGGTT AACTCTCTCG 
GCGTCATTTC AGGCCGTCGC GTCGATTCCA GGCCAGGTTG CCGATCAGGC CCCTCTCCCC
AGTCTGGCTC CAATGCTGGA AAAAGTGCTT CCGGCAGTGG TGAGCGTACG GGTGGAAGGA
ACGGCCAGTC AGGGACAGAA AATCCCGGAA GAATTCAAAA AGTTTTTTGG TGATGATTTA
CCGGATCAAC CTGCACAACC CTTCGAAGGT TTAGGTTCCG GTGTCATCAT CAACGCCAGT
AAAGGCTATG TGCTGACCAA CAACCATGTG ATTAATCAGG CACAGAAAAT CAGTATTCAG
CTCAATGATG GGCGCGAGTT TGATGCAAAA CTGATTGGTA GCGATGACCA GAGCGATATC
GCCCTGTTAC AAATTCAAAA CCCGAGCAAA TTAACGCAAA TCGCTATTGC CGACTCCGAT
AAATTGCGCG TCGGTGATTT TGCCGTAGCG GTCGGTAACC CATTTGGCCT TGGGCAAACC
GCCACCTCTG GCATTGTTTC CGCATTAGGA CGCAGCGGGT TGAATCTTGA AGGTCTGGAA
AACTTTATCC AGACAGATGC TTCCATTAAC CGCGGTAACT CCGGCGGTGC ACTGTTAAAC
CTTAACGGTG AGTTAATTGG CATCAACACT GCAATCCTTG CGCCTGGCGG CGGGAGCGTC
GGGATTGGAT TTGCCATCCC CAGTAATATG GCGCGAACAC TGGCGCAGCA GCTTATCGAC
TTTGGTGAAA TCAAACGCGG TTTGTTAGGC ATCAAAGGCA CCGAGATGAG TGCCGATATC
GCCAAAGCCT TCAACCTTGA CGTGCAGCGT GGCGCGTTTG TCAGCGAAGT GTTGCCAGGT
TCTGGCTCAG CGAAAGCGGG CGTCAAAGCG GGCGATATTA TTACCAGCCT CAACGGCAAA
CCGCTGAATA GCTTTGCTGA GTTGCGCTCT CGTATCGCGA CCACCGAGCC GGGCACGAAA
GTGAAGCTTG GCCTGCTGCG TAACGGCAAG CCACTGGAAG TAGAAGTGAC GCTCGATACC
AGCACCTCTT CGTCGGCCAG CGCTGAAATG ATCACGCCAG CGCTGGAAGG TGCAACGTTG
AGCGATGGTC AGCTAAAAGA TGGCGGCAAA GGTATTAAGA TCGATGAAGT TGTCAAAGGA
AGCCCAGCTG CTCAGGCTGG CTTGCAAAAA GACGATGTGA TCATTGGCGT CAACCGCGAT
CGGGTGAACT CGATTGCTGA AATGCGTAAA GTGCTGGCGG CAAAACCGGC CATCATCGCC
CTGCAAATTG TACGCGGCAA TGAAAGCATC TATCTGCTGA TGCGTTAA
 
Protein sequence
MKKQTQLLSA LALSVGLTLS ASFQAVASIP GQVADQAPLP SLAPMLEKVL PAVVSVRVEG 
TASQGQKIPE EFKKFFGDDL PDQPAQPFEG LGSGVIINAS KGYVLTNNHV INQAQKISIQ
LNDGREFDAK LIGSDDQSDI ALLQIQNPSK LTQIAIADSD KLRVGDFAVA VGNPFGLGQT
ATSGIVSALG RSGLNLEGLE NFIQTDASIN RGNSGGALLN LNGELIGINT AILAPGGGSV
GIGFAIPSNM ARTLAQQLID FGEIKRGLLG IKGTEMSADI AKAFNLDVQR GAFVSEVLPG
SGSAKAGVKA GDIITSLNGK PLNSFAELRS RIATTEPGTK VKLGLLRNGK PLEVEVTLDT
STSSSASAEM ITPALEGATL SDGQLKDGGK GIKIDEVVKG SPAAQAGLQK DDVIIGVNRD
RVNSIAEMRK VLAAKPAIIA LQIVRGNESI YLLMR