Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1921 |
Symbol | prc |
ID | 5592467 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1931745 |
End bp | 1933793 |
Gene Length | 2049 bp |
Protein Length | 682 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640921064 |
Product | carboxy-terminal protease |
Protein accession | YP_001458615 |
Protein GI | 157161297 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00000000000149225 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATGT TTTTTAGGCT TACCGCGTTA GCTGGCCTGC TTGCAATAGC AGGCCAGACC TTCGCTGTAG AAGATATCAC GCGTGCTGAT CAAATTCCGG TATTAAAGGA AGAGACGCAG CATGCGACGG TAAGTGAGCG CGTAACGTCG CGCTTCACCC GTTCTCATTA TCGCCAGTTC GACCTCGATC AGGCATTTTC GGCCAAAATC TTTGACCGCT ACCTGAATCT GCTCGATTAC AGCCACAACG TGCTGCTGGC AAGCGATGTT GAACAGTTCG CGAAAAAGAA AACCGAGTTA GGCGATGAAC TGCGTTCAGG CAAACTCGAC GTTTTCTACG ATCTCTACAA TCTGGCGCAA AAGCGCCGTT TTGAGCGTTA CCAGTACGCT TTGTCGGTAC TGGAAAAGCC GATGGATTTC ACCGGCAACG ACACTTATAA CCTTGACCGC AGCAAAGCGC CCTGGCCGAA AAACGAGGCT GAGTTGAACG CGCTGTGGGA CAGTAAAGTC AAATTCGACG AGTTAAGCCT GAAGCTGGCA GGAAAAACGG ATAAAGAAAT TCGTGAAACC CTGACACGCC GCTACAAATT TGCCATTCGT CGTCTGGCGC AAACCAACAG CGAAGATGTT TTCTCGCTGG CAATGACGGC GTTTGCGCGT GAAATCGACC CGCATACCAA CTATCTTTCC CCGCGTAATA CCGAACAGTT CAACACTGAA ATGAGTTTGT CGCTGGAAGG TATTGGCGCA GTGCTGCAAA TGGATGATGA CTACACCGTT ATCAATTCGA TGGTGGCAGG TGGTCCGGCA GCGAAGAGTA AAGCTATCAG CGTTGGTGAC AAAATTGTCG GTGTTGGTCA AACAGGCAAG CCGATGGTTG ACGTGATTGG CTGGCGTCTT GATGATGTGG TTGCCTTAAT TAAAGGGCCG AAGGGCAGTA AAGTTCGTCT GGAAATTTTA CCTGCTGGTA AAGGGACCAA GACCCGCACT GTAACATTGA CCCGTGAACG TATTCGTCTC GAAGACCGCG CGGTTAAAAT GTCGGTGAAG ACCGTCGGTA AAGAGAAAGT CGGCGTGCTG GATATTCCGG GCTTCTATGT GGGTTTGACA GACGATGTCA AAGTTCAACT GCAGAAACTG GAAAAACAGA ATGTCAGCAG TGTGATCATC GACCTGCGTA GCAATGGCGG TGGGGCGTTG ACCGAAGCGG TATCGCTCTC AGGTCTGTTT ATTCCTGCGG GTCCCATTGT TCAGGTCCGC GATAACAACG GCAAGGTTCG TGAAGACAGC GATACCGACG GGCAGGTGTT CTATAAAGGC CCGCTGGTGG TACTGGTTGA CCGTTTCAGT GCTTCGGCTT CAGAAATCTT TGCCGCGGCA ATGCAGGATT ACGGTCGTGC GCTGGTTGTG GGTGAACCGA CGTTTGGTAA AGGCACCGTT CAGCAATACC GTTCATTGAA CCGTATTTAC GATCAGATGT TACGTCCTGA ATGGCCAGCG CTGGGTTCTG TGCAGTACAC GATCCAGAAA TTCTATCGCG TTAACGGCGG CAGTACGCAA CGTAAAGGCG TAACGCCAGA CATCATCATG CCGACGGGTA ATGAAGAAAC GGAAACGGGT GAGAAATTCG AAGATAACGC GCTGCCGTGG GATAGCATCG ATGCCGCGAC TTATGTGAAA TCAGGAGATT TAACTGCCTT TGAACCGGAG CTGCTGAAGG AACATAATGC GCGTATCGCG AAAGATCCTG AGTTCCAGAA CATCATGAAG GATATCGCGC GCTTCAACGC TATGAAGGAC AAGCGCAATA TCGTTTCTCT GAACTACGCT GTGCGTGAGA AAGAGAATAA TGAAGATGAT GCGACGCGTC TGGCGCGTTT GAACGAACGC TTTAAACGCG AAGGTAAACC GGAGTTGAAG AAGCTGGATG ATCTACCGAA AGATTACCAG GAGCCGGATC CTTATCTGGA TGAGACGGTG AATATCGCAC TCGATCTGGC GAAGCTTGAA AAAGCCAGAC CCGCGGAACA ACCCGCTCCC GTCAAGTAA
|
Protein sequence | MNMFFRLTAL AGLLAIAGQT FAVEDITRAD QIPVLKEETQ HATVSERVTS RFTRSHYRQF DLDQAFSAKI FDRYLNLLDY SHNVLLASDV EQFAKKKTEL GDELRSGKLD VFYDLYNLAQ KRRFERYQYA LSVLEKPMDF TGNDTYNLDR SKAPWPKNEA ELNALWDSKV KFDELSLKLA GKTDKEIRET LTRRYKFAIR RLAQTNSEDV FSLAMTAFAR EIDPHTNYLS PRNTEQFNTE MSLSLEGIGA VLQMDDDYTV INSMVAGGPA AKSKAISVGD KIVGVGQTGK PMVDVIGWRL DDVVALIKGP KGSKVRLEIL PAGKGTKTRT VTLTRERIRL EDRAVKMSVK TVGKEKVGVL DIPGFYVGLT DDVKVQLQKL EKQNVSSVII DLRSNGGGAL TEAVSLSGLF IPAGPIVQVR DNNGKVREDS DTDGQVFYKG PLVVLVDRFS ASASEIFAAA MQDYGRALVV GEPTFGKGTV QQYRSLNRIY DQMLRPEWPA LGSVQYTIQK FYRVNGGSTQ RKGVTPDIIM PTGNEETETG EKFEDNALPW DSIDAATYVK SGDLTAFEPE LLKEHNARIA KDPEFQNIMK DIARFNAMKD KRNIVSLNYA VREKENNEDD ATRLARLNER FKREGKPELK KLDDLPKDYQ EPDPYLDETV NIALDLAKLE KARPAEQPAP VK
|
| |