Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2163 |
Symbol | |
ID | 6066999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 2368676 |
End bp | 2371459 |
Gene Length | 2784 bp |
Protein Length | 927 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641601570 |
Product | peptidase M16 domain-containing protein |
Protein accession | YP_001725129 |
Protein GI | 170020175 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.618629 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAACC TCTGTTTCTT ACTGACGTTA GTGGCAACTC TGTTGCTCCC CGGGCGACTG ATTGCCGCCG CCTTACCGCA GGATGAAAAG TTAATTACCG GGCAACTGGA CAATGGCTTG CGATATATGA TTTATCCGCA TGCTCATCCA AAGGATCAGG TAAATTTATG GCTGCAAATT CATACCGGTT CATTGCAGGA AGAAGACAAT GAGCGCGGCG TGGCTCATTT TGTAGAACAT ATGATGTTTA ACGGCACAAA AACATGGCCG GGTAATAAAG TCATCGAAAC ATTTGAGTCA ATGGGCCTGC GTTTTGGTCG CGATGTTAAT GCCTATACCA GCTATGACGA AACGGTGTAT CAGGTGAGTT TGCCGACTAC GCAGAAACAA AATCTGCAAC AAGTGATGGC AATCTTCAGT GAATGGAGTA ATGCCGCAAC CTTTGAAAAA CTCGAAGTAG ACGCTGAACG TGGCGTAATT ACTGAGGAAT GGCGTGCCCA TCAGGATGCG AAATGGCGCA CCTCTCAGGC GCGCCGCCCT TTCCTGCTGG CAAATACCCG TAATTTAGAC CGTGAACCTA TCGGCCTGAT GGATACTGTC GCCACGGTCA CACCGGCACA ATTGCGCCAA TTTTATCAAC GCTGGTATCA ACCAAATAAT ATGACCTTTA TCGTGGTCGG CGATATCGAC AGTAAAGAAG CGCTGGCGCT GATAAAGGAT AATTTAAGTA AGCTTCCGGC TAACAAAGCA GCTGAAAATC GCGTCTGGCC GACAAAAGCC GAAAACCACC TGCGCTTTAA TATCATTAAT GATAAAGAAA ACCGGGTGAA CGGCATCGCA CTCTATTATC GCCTGCCAAT GGTACAAGTG AACGATGAGC AAAGCTTTGT CGAACAAGCT GAATGGAGCA TGTTAGTTCA GCTGTTCAAT CAACGTCTGC AGGAACGCAT ACAGTCGGGC GAGTTGAAGA CTATTTCTGG CGGCACTGCG CGCAGCGTTA AAATTGCACC CGATTATCAG TCGCTGTTTT TCCGTGTAAA TGCACGAGAC GATAATATGC AGGATGCTGC GAATGCATTA ATGGCAGAGT TGGCAACCAT TGATCAGCAT GGTTTTTCTG CTGAAGAACT CGATGATGTC AAATCTACCC GCCTCACCTG GCTGAAAAAT GCGGTTGATC AGCAAGCTGA ACGTGATTTA CGTATGCTGA CCAGTCGCCT GGCATCCAGC TCATTAAATA ATACGCCGTT CTTGTCGCCG GAAGAGACAT ATCAACTTTC GAAACGTCTG TGGCAGCAAA TTACCGTGCA AAGTCTGGCG GAAAAATGGC AGCAGTTAAG AAAGAACCAG GACGCATTTT GGGAGCAAAT GGTAAACAAT GAGCTTGCCG CCAAAAAAGC ATTGTCTCCT GCGGCTATCC TGGCGCTGGA AAAAGAGTAC GCCAACAAAA AGCTGGCGGC TTACATCTTC CCAGGCAGAA ATTTATCGTT AACAGTAGAC GCTGACCCAC AGGCGGAAAT TAGCAGCAAA GAAACGCTGG CTGAGAATCT GACATCATTA ACACTTTCCA ATGGTGCCAG GGTTATTCTG GCAAAATCCG CGGGTGAAGA GCAAAAGCTA CAAATTACTG CCGTATCGAA TAAAGGCGAT TTAAGTTTCC CTGCGCAGCA AAAATCACTT ATCGCGCTGG CAAATAAAGC AGTTAGCGGA AGCGGCGTTG GCGAACTCTC CTCTTCCAGC CTGAAACGCT GGAGTGCGGA AAACTCGGTA ACCATGAGCA GTAAAGTCAG TGGCATGAAT ACGTTGCTCT CCGTTAGCGC ACGGACTAAT AACCCTGAAC CTGGTTTTCA GTTGATTAAC CAGCGAATCA CCCACAGCAC AATTAACGAC AATATTTGGG CATCGCTACA AAATGCTCAA ATTCAGGCGT TGAAAACGCT CGACCAGCGT CCAGCGGAGA AATTCGCCCA GCAGATGTAT GAGACGCGCT ATGCTGATGA CCGCACGAAA TTACTGCAAA AAAAACAGAT TGCACAGTTT ACTGCCGCAG ATGCGCTGGC TGCCGATCGC CAGTTGTTTT CATCTCCAGC GGATATCACG TTTGTCATTG TCGGTAATGT CTCAGAAGAC AAACTCGTGG CGTTAATTAC GCGTTACTTA GGATCAATCA AACACTCTGA TTCGCCATTA GCCGCAGGTA AACCATTAAC TCGCGCGACG GACAACGCAT CGGTTACTGT AAAAGAACAA AATGAACCTG TGGCACAGGT TTCACAGTGG AAGCGTTATG ATTCCCGGAC ACCTGTTAAT CTGGCGACGC GTATGGCGCT CGATGCTTTT AACGTCGCAC TGGCAAAAGA TCTACGTGTT AATATTCGTG AACAGGCATC TGGAGCATAC AGCGTTTCTT CTCGCCTCTC GGTTGATCCT CAGGCCAAAG ATATCAGTCA TTTGCTGGCT TTTACTTGTC AACCAGAACG ACATGATGAA CTGTTAACGT TAGCGAATGA AGTGATGGTT AAGCGTCTGG CTAAAGGGAT CAGTGAGCAA GAACTGAATG AATACCAGCA AAACGTTCAG CGCAGCCTCG ATATCCAACA GCGTAGCGTT CAACAATTAG CGAACACTAT TGTAAATAGT CTTATTCAAT ATGACGATCC TGCAGCATGG ACTGAGCAGG AGCAATTGTT GAAACAAATG ACGGTAGAGA ATGTTAACAC TGCCGTTAAA CAATATCTTT CTCATCCGGT AAATACTTAT ACCGGAGTAT TATTGCCAAA ATAA
|
Protein sequence | MRNLCFLLTL VATLLLPGRL IAAALPQDEK LITGQLDNGL RYMIYPHAHP KDQVNLWLQI HTGSLQEEDN ERGVAHFVEH MMFNGTKTWP GNKVIETFES MGLRFGRDVN AYTSYDETVY QVSLPTTQKQ NLQQVMAIFS EWSNAATFEK LEVDAERGVI TEEWRAHQDA KWRTSQARRP FLLANTRNLD REPIGLMDTV ATVTPAQLRQ FYQRWYQPNN MTFIVVGDID SKEALALIKD NLSKLPANKA AENRVWPTKA ENHLRFNIIN DKENRVNGIA LYYRLPMVQV NDEQSFVEQA EWSMLVQLFN QRLQERIQSG ELKTISGGTA RSVKIAPDYQ SLFFRVNARD DNMQDAANAL MAELATIDQH GFSAEELDDV KSTRLTWLKN AVDQQAERDL RMLTSRLASS SLNNTPFLSP EETYQLSKRL WQQITVQSLA EKWQQLRKNQ DAFWEQMVNN ELAAKKALSP AAILALEKEY ANKKLAAYIF PGRNLSLTVD ADPQAEISSK ETLAENLTSL TLSNGARVIL AKSAGEEQKL QITAVSNKGD LSFPAQQKSL IALANKAVSG SGVGELSSSS LKRWSAENSV TMSSKVSGMN TLLSVSARTN NPEPGFQLIN QRITHSTIND NIWASLQNAQ IQALKTLDQR PAEKFAQQMY ETRYADDRTK LLQKKQIAQF TAADALAADR QLFSSPADIT FVIVGNVSED KLVALITRYL GSIKHSDSPL AAGKPLTRAT DNASVTVKEQ NEPVAQVSQW KRYDSRTPVN LATRMALDAF NVALAKDLRV NIREQASGAY SVSSRLSVDP QAKDISHLLA FTCQPERHDE LLTLANEVMV KRLAKGISEQ ELNEYQQNVQ RSLDIQQRSV QQLANTIVNS LIQYDDPAAW TEQEQLLKQM TVENVNTAVK QYLSHPVNTY TGVLLPK
|
| |