Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0983 |
Symbol | |
ID | 4068650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1245283 |
End bp | 1248042 |
Gene Length | 2760 bp |
Protein Length | 919 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637982990 |
Product | peptidase M48, Ste24p |
Protein accession | YP_590060 |
Protein GI | 94968012 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00288747 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000195095 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGGGTCTTC GTCGTCTCGT CGCTTTCCTT CTGCTTTCCA CAGCCTCGCT TTTCTCCCAA ACCGCCCCGG TCTGCACTCT TCCGGCGATC CAGGTAAAGT CCGCGCCCGG CAACATCTTC ACCGAGGCTC AGGAGAACCA GCTTGGTGAC GTCGTCGTTG ACCAGTCGCT CCCGTATATC ACCATCATTC ACGACGAAAC ACTCGCCAGG CCCTTGCGCG AAATTGGTGC CAAGCTCATT GCCCAGCTTC CCACGACGAT GACGTATCAC TTCGATGTCA TCGAGCTCGA TGAAGCCAAT GCCTTCAGCT TGCCCGGCGG ACACATCTAT GTTTCGCGCA AGCTGATTGC GCTGACGCAG ACGGAGGACG AACTCGCCGC CGTCCTCGCC CACGAACTCG GCCATGAGGT CACGCATCAG GGAGCCGCGA CCTTCTCCCG CCTTTTTGCA ACCTTGCTTG GCGTTCACCA GGTCACTACC GCCGACGACA TCCGGCACCG CTATAACCAA ATGCTCGACC TGCAAGCGTC CAAAAAGGTA AAGCTCAGCA AGGAAGAAGA CGATCAGCTC GATGCCGACG CTGTCGCCAT CCAAGCCCTC GCCCGTGCCG GTTATAGCGC CTCGGCTTTT GCGTCCATGT TTGACCGCGT CGCCGAAACC CATGGCGGGA AGAGCAGCTT CTTCGCGGAT CTCTTCGGCA TGACCAAGCC CAACACCGCG CGCTTCCGCA AAATCCAGAA GACGATCGCC ACCCTGCCGA CAGAATGTTC AACCCAGCGC GCGACCGGCG ACTCCTTCGC GAAATGGCGT ACCAGCATCA TCGAGTACGA TCCCACCGCA GCGGCCGAGG CCCATGCTCC GGGTTTGCTC TGGCAAAAGA AGCTCACGCC CGCATTGCGT CCCGGTGTGA AAGAGATCCA GTACAGTAGC GATGGCCGTT ACCTCCTCAT CCAGGACAGC GGCGCCATTC ACGTCGCAAC CCGAGATCCG CTGAAAGAAG TTTTCCAGAT TCCTGCCCAG CGCGCCTATC CCGCCAAGTT CTCTCTTGAT GGCCAGCGCA TCAGCTTTTA CATGGGCAAC GGCGAGCCCC GCATCGAGGT GTGGAGCGTC GCCGACCAGA AGCGCGTAGA GGTGCATGAA CTCCACGTGA AGCAGCGCGA GTGTCCGCAG TCGGAACTCT CGGCCGATGG CAAGGTTTTC GCATGTATCC AGGCGCGCGA AGTCAGCGAC GGCGTTTATT TCGACCTCGT TCTCTACGAC GTTTCAAACG GCGCGGAGCT GCTGCGCTTC CCGAAGATCC GCGACGCCAC CGGCTGGAAC GCTTACGTCC TGCTGCGTCA GTACACAGCG GTCGTCAATC GCCACGGTTC CTTCGAGCTG GTTTCCATGC ATTTCTCGCC AGATGGTCAC TGGCTGGTCG CGGCGTTCCT GAACCGCATG GTGGCCTTCG ATCTGACCCA GAGAACGAAA GTTGAGTTCC CGCCGCAGAT CGCAAAACTG CTCGGTAATA GCTTCGTCTT TCTATCCCCG GACCGTGTGT TGGTGTCGAA GTTCGTCGGC GCATCCAGTG CCGATGTCCG CAGCTTCCCC GGCGGCGACC TCATCAAGCC CGACATCCTT ATTGGTGCTG GCTGGGTCCG TCCCACCGAG GACCCCAACC ATGTCCTCGT TGGTCCGCTC GACGACTACC CGCTCGGCAT CGTTGACATC AACACCAACA AGGTCGCGCT TCGTCTCAAA GGCCGCAGCG CCAACGTCTG GAAGGACGAG TTCGCGATGG AAAGGACGGG CGGCGACCTC TCGGTCAACA ACCTCCCCGA TGCCAAAGAG AAAGCGCTCC TCCATCTTTC CGACAGCCAG CTCGGCAGCG TCGATCTCGG CGTGATCTCC GGCGACCTTT CCTGGATTGC CTACACCGAA GGCGCCCGCG GCGGCGTGTG GAACCTGGCA ACCGGCGAGC GTGCTTACCA TCTGCGCGAT TTCAACGGCG CGTATTTCGC AGGCGATACC GTCCACGCGG ACATGCCGAA ATTCGAGAAG AGCCCGCGCG TCATCGCCCA CATGAAGCTG CCCGGCTCCG CCGTCACTAC CAACGAGTTA CCAGGCGACG ATCACTTCCT TCAGTACGGG CCGATCGTGC TCGCATTCCG CTCCGGTAAA ATCGGTCGCC ACCCGAAGGA CGAATGGTAC GACTCGGAAG TCACGCAAGT CCGTGGCCTC GACGTCGCCA CCCTCAAAGA GCTCTGGACC GAGCCCATCG CGAAACGTGC CGACGCCTTC CACGTGCACT CCACCGGCGA TACCTTCGTC ATTGAACGCG ACGAGAAAGA TGTCGTGAAC CTCGAGTTCC AGCAGCTATC CACCGGAAAG CCGCTCGCGA AGCTCGCCAT TAAAACCAAC AAACATTCCT TCAATGTGAT CGACGCCGTC ATCGCCGGTG ACTACCTCAT CGTGGCGGAC GATCAGTCTC GTTCGACCAT TTACAAAACC AGTGGCGAAG AGGTGGCGCG CGTCTTCGGC GGATACATCA TGCCCTCGGC GAAAGCCGGC GTCCTCGCCG TGCAATCCGA AGAGCAGGGC GTCTTCCTCT ACGAACTGGC AACGGGCAAA AAGATCGACA CCCTCCAGCT CGGCCGCCCC ATCGTGTATT ACAACTTCGA CTCGACCGGA GAAAAACTCT TCGTCCTCAC CTCCGACCAG GTGGCCTACG CCTTCGACGT CGAGAAGCTG AAAACCAGCG CGACCACCGG CGCGAACTGA
|
Protein sequence | MGLRRLVAFL LLSTASLFSQ TAPVCTLPAI QVKSAPGNIF TEAQENQLGD VVVDQSLPYI TIIHDETLAR PLREIGAKLI AQLPTTMTYH FDVIELDEAN AFSLPGGHIY VSRKLIALTQ TEDELAAVLA HELGHEVTHQ GAATFSRLFA TLLGVHQVTT ADDIRHRYNQ MLDLQASKKV KLSKEEDDQL DADAVAIQAL ARAGYSASAF ASMFDRVAET HGGKSSFFAD LFGMTKPNTA RFRKIQKTIA TLPTECSTQR ATGDSFAKWR TSIIEYDPTA AAEAHAPGLL WQKKLTPALR PGVKEIQYSS DGRYLLIQDS GAIHVATRDP LKEVFQIPAQ RAYPAKFSLD GQRISFYMGN GEPRIEVWSV ADQKRVEVHE LHVKQRECPQ SELSADGKVF ACIQAREVSD GVYFDLVLYD VSNGAELLRF PKIRDATGWN AYVLLRQYTA VVNRHGSFEL VSMHFSPDGH WLVAAFLNRM VAFDLTQRTK VEFPPQIAKL LGNSFVFLSP DRVLVSKFVG ASSADVRSFP GGDLIKPDIL IGAGWVRPTE DPNHVLVGPL DDYPLGIVDI NTNKVALRLK GRSANVWKDE FAMERTGGDL SVNNLPDAKE KALLHLSDSQ LGSVDLGVIS GDLSWIAYTE GARGGVWNLA TGERAYHLRD FNGAYFAGDT VHADMPKFEK SPRVIAHMKL PGSAVTTNEL PGDDHFLQYG PIVLAFRSGK IGRHPKDEWY DSEVTQVRGL DVATLKELWT EPIAKRADAF HVHSTGDTFV IERDEKDVVN LEFQQLSTGK PLAKLAIKTN KHSFNVIDAV IAGDYLIVAD DQSRSTIYKT SGEEVARVFG GYIMPSAKAG VLAVQSEEQG VFLYELATGK KIDTLQLGRP IVYYNFDSTG EKLFVLTSDQ VAYAFDVEKL KTSATTGAN
|
| |