Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_3669 |
Symbol | |
ID | 5111917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 3976124 |
End bp | 3977491 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640493874 |
Product | serine endoprotease |
Protein accession | YP_001178377 |
Protein GI | 146313303 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0364754 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.327931 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAA AAAACCAGCT GTTAAGCGCG ATTGCGTTAA GTGTCGGGTT ATCTCTCACG GCGTCATGGC CTGCGGCTGC GGCCCTGCCT TCGCAAGTGC CTGGTGAGGC AGCCATTCCA AGCCTTGCGC CAATGCTTGA AAAAGTGTTG CCGGCGGTGG TCAGCATTAA GGTTGAGGGC ACGGCGGCAC AGAGCCAACG TATACCAGAA GAGCTAAAAA AATATTTTGG TGAAGAGGGA CCCGATCAGC AAACCCAGCC GTTTGAAGGG CTTGGCTCCG GGGTAGTGAT TGATGCGGCG AAAGGCTACG TGCTGACGAA CAATCACGTT ATCAGCCAGG CCGATAAAAT CAGCGTGCAA ATGAATGATG GTCGCGAGTT CGATGCCAAA CTGATTGGCA GCGACGATCA AAGTGACATC GCGCTACTGC AAATCCAGAA TCCCAGCAAG CTGACGCAAA TTGTTATCGC GGATTCCGAC AAATTGCGTG TCGGAGATTT TGCCGTGGCC GTGGGTAACC CTTTCGGTCT GGGCCAGACG GCAACCTCAG GCATTATCTC TGCGTTGGGT CGTAGCGGTC TAAATCTCGA AGGCCTGGAG AACTTTATCC AGACTGACGC CTCCATTAAC CGCGGGAATT CCGGCGGCGC ACTGCTAAAC CTCAACGGTG AGCTGATTGG TATTAACACC GCGATTCTTG CTCCCGGTGG CGGCAGCGTT GGTATTGGCT TTGCGATCCC CAGCAATATG GCCAAAATCC TCTCGCAACA GCTGATTGAA TCCGGCGAGG TAAAACGCGG TCTGCTGGGA ATTAAAGGCA TGGAAATGAG CGCTGATATC GCAAAAGCCT TTAATCTCGA CGTGCAGCGT GGGGCGTTTG TCAGTGAAGT TCTGGCGAAC TCCGGCTCCG CAAAAGCGGG TGTGAAATCG GGCGATATCA TCGTTAGCCT CAACGGTAAG CCGCTGAGCA GCTTCGCTGA ATTACGCTCA CGTGTTGCCA CCACGGAACC TGGGACGAAG GTGAAGCTTG GGTTGCTGCG CGATGGCAAA CCGCTGGAAG TCGAAGTTAC GCTGGATAAG AGCACCTCAT CGTCGGCCAG TGCTGAACTG ATTGCCCCTG CACTACAAGG TGCGGCGCTC AGCGATGGTC AGCTTAAAGA CGGTACAAAA GGTATTTCTA TCGAGAGCGT CGAAAAGAGC AGTCCCGCCG CGCAGGCAGG CCTGCATCAG GATGACGTGA TTATCGGTGT TAACCGCAGC CGTGTTCAGT CGATCGCCGA GATGCGCAAG GTCCTGGAAA GCAAGCCAGC AGTGATTGCG CTGCAAATCA TCCGTGGCAA CGATACGCTC TACATTCTAT TACGCTAA
|
Protein sequence | MKKKNQLLSA IALSVGLSLT ASWPAAAALP SQVPGEAAIP SLAPMLEKVL PAVVSIKVEG TAAQSQRIPE ELKKYFGEEG PDQQTQPFEG LGSGVVIDAA KGYVLTNNHV ISQADKISVQ MNDGREFDAK LIGSDDQSDI ALLQIQNPSK LTQIVIADSD KLRVGDFAVA VGNPFGLGQT ATSGIISALG RSGLNLEGLE NFIQTDASIN RGNSGGALLN LNGELIGINT AILAPGGGSV GIGFAIPSNM AKILSQQLIE SGEVKRGLLG IKGMEMSADI AKAFNLDVQR GAFVSEVLAN SGSAKAGVKS GDIIVSLNGK PLSSFAELRS RVATTEPGTK VKLGLLRDGK PLEVEVTLDK STSSSASAEL IAPALQGAAL SDGQLKDGTK GISIESVEKS SPAAQAGLHQ DDVIIGVNRS RVQSIAEMRK VLESKPAVIA LQIIRGNDTL YILLR
|
| |