Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ent638_0701 |
Symbol | |
ID | 5111386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Enterobacter sp. 638 |
Kingdom | Bacteria |
Replicon accession | NC_009436 |
Strand | + |
Start bp | 797520 |
End bp | 799010 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640490872 |
Product | serine endoprotease |
Protein accession | YP_001175439 |
Protein GI | 146310365 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACACAG CAATTTTGCG TAATCTGTTA ATCGAGATTG AGAACATGAA AAAAACAACA TTAGCAATGA GTGCACTCGC ATTGAGTTTA GGTTTAGCGT TGTCTCCTCT GTCTGCAAGC GCAGCCGAGA CCGCATCTTC GGTCACAACC GCGCAGCAGA TGCCAAGCCT GGCCCCGATG CTTGAGAAAG TGATGCCATC GGTGGTGAGT ATTAACGTTG AGGGAAGCAC AACCGTTAAT ACGCCGCGCA TGCCGCGCAA CTTCCAGCAG TTCTTTGGCG ATAATTCACC GTTCTGCCAG GACGGCTCAC CATTCCAGAG CTCACCGTTC TGTCAGGGCG GTGGTGCGGG CGATGACACC CCTGGCGGTA ACGGTGGCGG TCAGCAGCAA AAATTCATGG CGCTGGGATC GGGCGTGATT ATTGACGCGG CGAAAGGCTA TGTCGTCACC AATAACCACG TTGTCGATAA CGCCAGCACC ATCAAAGTAC AGTTGAGCGA TGGGCGTAAA TTTGATGCCA AAGTGGTGGG TAAAGACCCG CGCTCTGACA TCGCTCTGAT TCAGATTCAG GATCCAAAAA ACCTGACGGC CATTAAACTG GCTGATTCCG ATGCCCTGCG CGTCGGTGAT TATACCGTAG CCATCGGTAA CCCGTTCGGC CTGGGTGAAA CGGTGACATC CGGTATCGTT TCTGCACTGG GTCGTAGCGG CCTGAATGCG GAAAACTATG AAAACTTTAT CCAGACGGAT GCGGCCATTA ACCGCGGTAA CTCCGGCGGT GCGCTGGTTA ACCTGAACGG TGAGCTGATC GGTATCAACA CCGCGATTCT GGCACCGGAC GGCGGCAACA TCGGTATCGG TTTTGCGATT CCAAGTAACA TGGTGAAAAA CCTGACCAGC CAGATGGTTG AATTCGGCCA GGTGAAACGC GGTGAACTGG GTATTCTGGG CACGGAACTG AACTCCGAAC TGGCGAAGGC AATGAAAGTT GACGCGCAGC GCGGGGCCTT TGTCAGCCAG GTGATGCCAA ATTCCTCTGC CGCGAAAGCC GGTATCAAGG CGGGTGACGT GATCACCACC CTGAATGGTA AGCCAATCAG CAGCTTTGCG GCGCTGCGCG CTGAAGTCGG CTCAATGCCG GTGGGCAGCA AAGTGACGTT GGGTCTGCTG CGCGACGGTA AACCTGTTAG CGTTAACCTT GAACTGCAGC AGAGCAGTCA GACTCAGGTC GATTCCAGCT CTATCTTCAG CGGTATTGAA GGTGCGGATA TGAGCAACAA AGGGGCTGAT AAAGGGGTGG TGGTGAGTGA AGTTAAAGCC AACAGCCCAG CGGCCCGTAT CGGCCTGAAA AAAGGCGATG TGATTATTGG CGCTAACCAG CAGCCGGTGA AAAATATTGC TGAACTGCGT AAAATTCTCG ACAGCAAACC TAACGTGCTG GCGCTGAATA TTCAGCGTGG TGATACCACT CTGTACCTGT TGATGCAGTA A
|
Protein sequence | MYTAILRNLL IEIENMKKTT LAMSALALSL GLALSPLSAS AAETASSVTT AQQMPSLAPM LEKVMPSVVS INVEGSTTVN TPRMPRNFQQ FFGDNSPFCQ DGSPFQSSPF CQGGGAGDDT PGGNGGGQQQ KFMALGSGVI IDAAKGYVVT NNHVVDNAST IKVQLSDGRK FDAKVVGKDP RSDIALIQIQ DPKNLTAIKL ADSDALRVGD YTVAIGNPFG LGETVTSGIV SALGRSGLNA ENYENFIQTD AAINRGNSGG ALVNLNGELI GINTAILAPD GGNIGIGFAI PSNMVKNLTS QMVEFGQVKR GELGILGTEL NSELAKAMKV DAQRGAFVSQ VMPNSSAAKA GIKAGDVITT LNGKPISSFA ALRAEVGSMP VGSKVTLGLL RDGKPVSVNL ELQQSSQTQV DSSSIFSGIE GADMSNKGAD KGVVVSEVKA NSPAARIGLK KGDVIIGANQ QPVKNIAELR KILDSKPNVL ALNIQRGDTT LYLLMQ
|
| |