Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A1936 |
Symbol | prtB |
ID | 5592883 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 1945756 |
End bp | 1947816 |
Gene Length | 2061 bp |
Protein Length | 686 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640921081 |
Product | protease 2 |
Protein accession | YP_001458630 |
Protein GI | 157161312 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1770] Protease II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 49 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTACCAA AAGCCGCCCG CATTCCCCAC GCCATGACGC TTCATGGCGA TACGCGCATC GATAATTACT ACTGGCTGCG GGACGATACG CGTTCTCAGC CGGAAGTCCT GGACTACCTG CAACAAGAAA ATAGTTACGG TCATCGGGTG ATGGCCTCAC AACAAGCCTT GCAGGATCGC ATCTTAAAGG AAATCATCGA CCGCATTCCG CAAAGAGAAG TTTCTGCGCC CTACATCAAA AATGGCTACC GCTATCGGCA TATTTATGAA CCAGGCTGTG AATATGCTAT CTACCAGCGT CAATCGGCGT TCAGTGAAGA GTGGGACGAG TGGGAAATAT TGCTCGATGC CAACAAGCGC GCGGCTCATA GTGAGTTTTA TTCGATGGGC GGAATGGCGA TTACGCCCGA TAACACCATT ATGGCGCTGG CAGAAGATTT TCTTTCCCGA CGCCAGTACG GCATTCGTTT TCGTAATCTG GAAACAGGTA ACTGGTACCC GGAACTGCTG GATAACGTTG AACCCAGCTT TGTCTGGGCA AATGACTCCT GGACTTTCTA CTATGTTCGC AAGCATCCAG TGACGCTGCT GCCTTATCAG GTCTGGCGTC ACGCTATCGG TACTCCAGCA TCGCAAGATA AACTGATCTA CGAAGAAAAA GACGATACCT ATTACGTCAG CCTGCATAAA ACGACGTCGA AGCACTATGT AGTCATTCAT TTGGCCAGCG CCACCACCAG TGAAGTTCGC CTGCTGGACG CGGAAATGGC CGATGCCGAG CCGTTTGTTT TTCTGCCGCG CCGCAAAGAT CACGAATACA GCCTTGATCA CTACCAGCAT CGGTTTTATC TGCGTTCCAA CCGCAACGGC AAAAACTTTG GCTTATACCG TACCCGTATG CGTGATGAGC AACAGTGGGA AGAGTTAATT CCGCCACGCG AAAACATCAT GCTGGAAGGG TTTACGCTGT TTACCGACTG GCTGGTGGTT GAAGAGCGTC AGCGCGGGTT AACCAGTTTG CGCCAAATTA ACCGCAAGAC CCGGGAAGTC ATTGGTATTG CCTTTGATGA TCCGGCCTAT GTGACCTGGA TTGCCTACAA TCCAGAATCT GAAACCGCGC GATTGCGTTA TGGTTATTCT TCCATGACCA CACCAGACAC TTTGTTTGAA CTGGATATGG ATACCGGTGA GCGTCGTGTA TTAAAACAAA CGGAAGTTCC TGGTTTTGAT GCGGCGAATT ACCGCAGTGA ACACCTGTGG ATAGTCGCCC GTGATGGCGT CGAAGTTCCG GTTTCGCTGG TCTATCATCG CAAACATTTT CGCAAAGGAC ACAACCCGCT GCTGGTGTAT GGCTATGGTT CTTACGGCGC AAGTATTGAT GCCGATTTCA GTTTTAGCCG CTTGAGTTTG TTAGATCGTG GCTTTGTCTA CGCCATTGTC CATGTTCGCG GCGGTGGTGA GCTGGGGCAA CAATGGTACG AAGACGGAAA ATTTCTGAAG AAGAAAAATA CGTTTAATGA TTATCTTGAT GCCTGCGATG CATTGTTAAA ACTGGGCTAT GGCTCTCCTT CGCTTTGTTA TGCGATGGGC GGGAGTGCGG GGGGCATGTT GATGGGCGTT GCGATTAATG CACGCCCTGA ATTATTCCAC GGCGTTATCG CCCAGGTACC GTTTGTTGAT GTTGTAACAA CAATGCTTGA TGAATCAATT CCTCTTACCA CTGGTGAGTT TGAAGAGTGG GGGAATCCGC AGGATCCGCA ATATTACGAG TATATGAAAA GCTACAGCCC ATATGACAAC GTCACCGCAC AGGCTTATCC GCATTTACTG GTAACGACCG GTTTGCACGA TTCTCAGGTG CAATATTGGG AACCGGCAAA ATGGGTCGCT AAATTGCGCG AGCTGAAAAC CGATGACCAT CTTTTATTGC TCTGTACCGA CATGGACTCA GGCCATGGCG GTAAATCTGG TCGCTTTAAA TCGTACGAAG GCGTAGCGAT GGAATATGCT TTTCTGGTCG CGCTGGCGCA GGGAACATTA CCCGCTACGC CTGCGGATTA A
|
Protein sequence | MLPKAARIPH AMTLHGDTRI DNYYWLRDDT RSQPEVLDYL QQENSYGHRV MASQQALQDR ILKEIIDRIP QREVSAPYIK NGYRYRHIYE PGCEYAIYQR QSAFSEEWDE WEILLDANKR AAHSEFYSMG GMAITPDNTI MALAEDFLSR RQYGIRFRNL ETGNWYPELL DNVEPSFVWA NDSWTFYYVR KHPVTLLPYQ VWRHAIGTPA SQDKLIYEEK DDTYYVSLHK TTSKHYVVIH LASATTSEVR LLDAEMADAE PFVFLPRRKD HEYSLDHYQH RFYLRSNRNG KNFGLYRTRM RDEQQWEELI PPRENIMLEG FTLFTDWLVV EERQRGLTSL RQINRKTREV IGIAFDDPAY VTWIAYNPES ETARLRYGYS SMTTPDTLFE LDMDTGERRV LKQTEVPGFD AANYRSEHLW IVARDGVEVP VSLVYHRKHF RKGHNPLLVY GYGSYGASID ADFSFSRLSL LDRGFVYAIV HVRGGGELGQ QWYEDGKFLK KKNTFNDYLD ACDALLKLGY GSPSLCYAMG GSAGGMLMGV AINARPELFH GVIAQVPFVD VVTTMLDESI PLTTGEFEEW GNPQDPQYYE YMKSYSPYDN VTAQAYPHLL VTTGLHDSQV QYWEPAKWVA KLRELKTDDH LLLLCTDMDS GHGGKSGRFK SYEGVAMEYA FLVALAQGTL PATPAD
|
| |