Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1679 |
Symbol | pqqL |
ID | 6145570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1676947 |
End bp | 1679730 |
Gene Length | 2784 bp |
Protein Length | 927 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641616555 |
Product | M16B family peptidase |
Protein accession | YP_001743733 |
Protein GI | 170681032 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAACC TCTGTTTCTT ACTGACGTTA GTGGCAACTC TGTTGCTCCC CGGGCGACTG ATTGCCGCCG CCTTACCGCA GGATGAAAAG TTAATTACCG GGCAACTGGA CAATGGCTTG CGATATCTGA TTTATCCGCA TGCTCAACCA AAGGATCAGG TAAATTTATG GCTGCAAATT CATACCGGTT CATTGCAGGA AGAGGACAAT GAGCGCGGCG TGGCTCATTT TGTAGAACAT ATGATGTTTA ACGGCACAAA AACATGGCCG GGTAATAAAG TCATCGAAAC ATTTGAGTCA ATGGGCCTTC GTTTTGGTCG CGATGTTAAT GCCTATACCA GCTATGACGA AACAGTGTAT CAGGTGAGTT TGCCGACTAC GCAGAAACAA AATCTGCAAC AAGTGATGGC AATCTTCAGT GAATGGAGTA ATGCCGCAAC CTTTGAAAAA CTCGAAGTAG ACGCTGAACG TGGCGTAATT ACTGAGGAAT GGCGTGCCCA TCAGGATGCA AAATGGCGCA CCTCTCAGGC GCGCCGCCCT TTCCTGCTGG CAAATACCCG TAATTTAGAC CGTGAACCTA TCGGCCTGAT GGATACCGTC GCCACGGTCA CACCGGCACA ATTGCGCCAA TTTTATCAAC GCTGGTATCA ACCAAATAAT ATGACCTTTA TTGTGGTCGG CGATATCGAC AGTAAAGAAG CGCTGGCGCT GATAAAGGAT AATTTAAGTA AGCTTCCGGC TAACAAAGCA GCTGAAAATC GCGTCTGGCC GACAAAAGCC GAAAACCACC TGCGCTTTAA TATCATCAAT GATAAAGAAA ACCGGGTGAA CGGCATCGCA CTCTATTATC GCCTGCCAAT GGTGCAAGTG AGCGATGAGC AAAGCTTTAT CGAACAAGCT GAATGGAGCA TGTTAGTTCA GCTGTTCAAC CAACGTCTGC AAGAACGCAT ACAGTCGGGC GAGTTGAAGA CTATTTCTGG CGGCACTGCG CGCAGCGTCA AAATTGCACC CGATTATCAG TCGCTGTTTT TCCGTGTAAA TGCACAAGAC GATAATATGC AGGATGCTGC GAATGCATTA ATGGCAGAGT TGGCAACCAT TGATCAGCAT GGCTTTTCTG CTGAAGAACT CGATGATGTT AAATCTACCC GCCTTACCTG GCTGAAAAAT GCGGTTGATC AGCAAGCTGA GCGTGATTTA CGTATGCTGA CCAGTCGCCT GGCATCCAGC TCATTAAATA ATACGCCGTT CTTGTCGCCG GAAGAGACAT ATCAACTTTC GAAACGTCTG TGGCAGCAAA TTACCGTGCA AAGTCTGGCG GAAAAATGGC AGCAGTTAAG AAAGAACCAG GACGCATTTT GGGAGCAAAT GGTAAACAAT GAGGTTGCCG CCAAAAAAGC ATTGTCTCCT GCGGCTATCC TGGCGCTGGA AAAAGAGTAC GCCAACAAAA AGCTGGCGGC TTACATCTTC CCAGGCAGAA ATTTATCGTT AACAGTAGAC GCTGACCCAC AGGCGGAAAT TAGCAGCAAA GAAACGCTGG CTGAGAATCT GACATCATTA ACACTTTCCA ATGGTGCCAG GGTTATTCTG GCAAAATCCG CGGGTGAAGA GCAAAAGCTA CAAATTACTG CCGTATCTAA TAAAGGCGAT TTAAGTTTCC CTGCGCAGCA AAAATCACTT ATCGCGCTGG CAAATAAAGC AGTAAGCGGA AGCGGCGTTG GTGAACTCTC CTCTTCCAGC CTGAAACGCT GGAGTGCGGA AAACTCGGTA ACCATGAGCA GTAAAGTCAG TGGCATGAAT ACATTGCTCT CTGTTAGCGC GCGGACTAAT AACCCTGAAC CTGGTTTCCA GTTGATTAAC CAGCGAATCA CCCACAGCAC GATTAACGAT AATATTTGGG CATCGCTACA AAATGCTCAA ATTCAGGCGT TGAAAACGCT CGACCAGCGT CCAGCGGAGA AATTCGCCCA GCAGATGTAT GAGACGCGCT ATGCTGATGA CCGCACGAAA TTACTGCAAG AAAATCAGAT TGCACAGTTT ACTGCCGCAG ATGCGCTGGC TGCCGATCGC CAGTTGTTTT CATCTCCAGC GGATATCACC TTTGTCATTG TCGGTAATGT CGCAGAAGAC AAACTCGTGG CGTTAATTAC GCGTTACTTA GGATCAATCA AACACTCTGA TTCGCCATTA GCCGCAGGTA AACCATTAAC TCGCGCGACG GACAACGCAT CGGTTACTGT AAAAGAACAA AATGAACCTG TGGCACAGGT TTCACAGTGG AAGCGTTATG ATTCCCGGAC ACCTGTAAAT CTGGAGACGC GTATGGCGCT CGATGCTTTT AACGTCGCAC TGGCAAAAGA TCTACGCGTT AATATTCGTG AACAGGCATC TGGAGCATAC AGCGTTTCTT CTCGCCTCTC GGTTGATCCT CAGGCCAAAG ATATCAGTCA TTTGCTGGCT TTTACTTGTC AACCAGAACG ACATGATGAA CTGTTAACGT TAGCGAATGA AGTGATGGTT AAGCGCCTGG CTAAAGGGAT CAGTGAGCAA GAACTGAATG AATACCAGCA AAACGTTCAG CGCAGCCTCG ATATCCAACA GCGTAGCGTT CAACAATTAG CGAACACCAT TGTAAATAGT CTTATTCAAT ATGACGATCC TGCAGCATGG ACTGAGCAGG AGCAATTATT GAAACAAATG ACGGTAGAGA ATGTTAACAC TGCCGTTAAA CAATATCTTT CTCATCCGGT GAATACCTAT ACCGGAGTAT TATTGCCAAA ATAA
|
Protein sequence | MRNLCFLLTL VATLLLPGRL IAAALPQDEK LITGQLDNGL RYLIYPHAQP KDQVNLWLQI HTGSLQEEDN ERGVAHFVEH MMFNGTKTWP GNKVIETFES MGLRFGRDVN AYTSYDETVY QVSLPTTQKQ NLQQVMAIFS EWSNAATFEK LEVDAERGVI TEEWRAHQDA KWRTSQARRP FLLANTRNLD REPIGLMDTV ATVTPAQLRQ FYQRWYQPNN MTFIVVGDID SKEALALIKD NLSKLPANKA AENRVWPTKA ENHLRFNIIN DKENRVNGIA LYYRLPMVQV SDEQSFIEQA EWSMLVQLFN QRLQERIQSG ELKTISGGTA RSVKIAPDYQ SLFFRVNAQD DNMQDAANAL MAELATIDQH GFSAEELDDV KSTRLTWLKN AVDQQAERDL RMLTSRLASS SLNNTPFLSP EETYQLSKRL WQQITVQSLA EKWQQLRKNQ DAFWEQMVNN EVAAKKALSP AAILALEKEY ANKKLAAYIF PGRNLSLTVD ADPQAEISSK ETLAENLTSL TLSNGARVIL AKSAGEEQKL QITAVSNKGD LSFPAQQKSL IALANKAVSG SGVGELSSSS LKRWSAENSV TMSSKVSGMN TLLSVSARTN NPEPGFQLIN QRITHSTIND NIWASLQNAQ IQALKTLDQR PAEKFAQQMY ETRYADDRTK LLQENQIAQF TAADALAADR QLFSSPADIT FVIVGNVAED KLVALITRYL GSIKHSDSPL AAGKPLTRAT DNASVTVKEQ NEPVAQVSQW KRYDSRTPVN LETRMALDAF NVALAKDLRV NIREQASGAY SVSSRLSVDP QAKDISHLLA FTCQPERHDE LLTLANEVMV KRLAKGISEQ ELNEYQQNVQ RSLDIQQRSV QQLANTIVNS LIQYDDPAAW TEQEQLLKQM TVENVNTAVK QYLSHPVNTY TGVLLPK
|
| |