Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0450 |
Symbol | |
ID | 4071697 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 534074 |
End bp | 537697 |
Gene Length | 3624 bp |
Protein Length | 1207 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982454 |
Product | protease |
Protein accession | YP_589529 |
Protein GI | 94967481 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.470359 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCATTC GAAATCGGCT CTTAAGCGCG ACAACACGCG TTGCGCTCTT GCTGTGTGCA TGCTCCATGG TTTTCGCCCA AGCGCCCTTA ATTCAAAATC GCGTCACAGC CCCTATCGAA AACAGTAAGA CAATCAAGAT TCGTCAGACA GTCTCGCCGT TGGTCGCGAA GAGTGCAGAC AAAGGCCGAC TCGCTGGCGA TCGTAATCTC GGGCAGATGC TGCTCATGCT GTCCCCGACG AAGGAACAGA ACACCGCGCT GGAAGCCGTC ATTAAAGCGG AGCACACTCC GGGATCGGCC AAGTACCATC ACTGGCTCAA GGCTTCGGAG ATTGCGACGA AGTATGGCGT CTCCGAACCG GACACGACTG CGGTTCGGGG ATGGCTGGCG TCGCAGGGAT TCGAGGTGAA GCACGTCGCC AACAGCCGAC GTTTTGTGGT TTTCAGCGGT ACGGTGGCGC AAGTGGAAGC GGCATTCCAC ACCCAGATGC ACCAGTACGA GCTCTCAGGG AACTCGTTTA TTGCAAACTC CCAGGAAGTC CAGATCCCTG CGGCGTTGGC GCCGGTGGTT CGTGGCGTCG TCCGCCTGAC CAGCACGCCG AAAAACAACA ACGTGAAGAT CGTAGGCAAG GCCGCGTTCG ATAAAGAGAA AGGCCAGATC ACCTTCACGA ACGGTGAGCA TGCAATCACG CCCGCGGATT TCGCGACGAT CTACAACCTC AATCCGCTTT ATCAGGCAGG CATCAACGGC GCAGGGCAGT CGATTGCGAT CGTGGCGCGC AGTGATATTT ATTCGCGCGA TGTCTTCGAC TTCATGAGCA TCTTTGGAGT TTCGTTTGGC GGCTTCTACT ACACCATCAA CGGAGACGAT CCGGGCTATG TTTCGGGATC GGACGTTGAG GCCACCCTCG ACCTCACCTG GGCGGCTGCA ATTGCACCGG GTGCAACCCC AAACATCGTG ATTTCGCAGA GCAACTTCGC TGACGGCGTC GATATCTCCG CCGCATTCAT TGTGGACAAC AATCTCGCGC CGGTAATGAG CACGAGCTTC AGCTCCTGCG AGCAGCAGAT GGGGCCGGTC GGCACTGAGT TCTATTACTC GCTCTGGGCG CAGGCAGCCG CCGAGGGCAT TACCGCCGTG GTCTCCAGTG ACGACAGTGG CGGCGCAGGT TGCGATCTTC CGGGAAGCGG TACCTTCGCG CAGAACGGCC TCGCCGTGAA TGCGCTCGCG TCCACGCCGT TCAACGTCGC CGTCGGTGGT ACGCAGTTCG ACGACACAGC GGATCCGAGC AAGTATTGGT CCTCCACGAA CGATTCCACG ACCAAAGCGT CCGTGCTTTC CTACATCCCC GAAAAAGCCT GGAACGAGAG CAGCATTGAT TCGGGCAACG TCAGCCTATG GGCGGGCGGT GGTGGTGTAA GCACGCTTTG GACGAAGCCT GAATGGCAGA TCGGAACTGG CGTTCCCGCC GATGGAATGC GTGACTTGCC CGACGTCTCG CTGACTGCGG CCGGTCACGA CGGATACGTA TTGTGCTTCG GTGGTTCTTG CGAGAGTGGA GGCATTTATA CGGTGGGCGG AACATCGGCG TCGGCGCCGG CGTTTGCCGC GATCATGGCA CTGGTCAACC AGCAGACCGG ATCTCCGCAA GGAAATCCGA ACTACGTGAT CTACCAACTC GCGGCGCAGC ACCCGGAATT CTTCCACGAC ACCACAGTTG GAGATAACAA AGTTCCGGAT ATGAACGGCG AGTTCACTGT CGGATATTCG ACCGGTGTGG GCTACGACCT TGCGACCGGC TTGGGATCAT TCGATGCGAA CTCGCTGGTG ACGAACTGGA ACAACGTCAC CTTCAGCGGA ACCAACACGA CGTTGAGTGG CCCGGCGGGC GGCTTGACCT TCGTGCATGG CGCAGGCGTG CCGGTCACGG CGAGCGTCAG CGCAGCGTCA GGCAGCAAAC TCCCGACCGG CAATGTCGCA TTCTTTACGG ACAACCCGCT CGGCCTCGCC ACTCCATTCG GCGTCGGTGC CGCTGCGCTG GATAACACGG GCGACGCGAC TACCTCCCTC GCCGCGATTC CCGGCGGCAC GCACTCGCTT ACAGCTCGGT ACGGAGGCGA CGCAACCTTC ACGGCCAGTA CCTCGAACGC GGTTACAGTG ACGGTTACGC CCGAGCCTTC GAATACGTAT TTCGTGGCTG GCGTGGGTGG AAGCACCGTC ACCTCGGCTG AAGCAAAGTA CGGCGACCCT CTGGTGATGG CCGTTCTTGT GCAAGGCAAT TCGCTCGTCG GACACCCGAC TGGCTCCGTT TCGCTGAGTG AAGGCAGCAC CGACCTTGGC ACGCGCTACC TTAATTATGG TGAACACGAG GATGCGGAGC AGGGATCGAG TTCGGTCTTC GGAGTGATCG GTTTCCCAGT CGGCGTACAC CAATTGACCG CGAGCTACAC CGGCGATCCC AGCTTCAATC CGAGTACGTC CACGAACTTC CAACTCACCA TCGTTAAAAG CGATTCAACC ATCTCATCCC TGCAATTCCA GGGCTCGGCA CTCTCCGGTG CGCCACTCCC TGTCTTCGGA CAGGTCTCCC TGGCGTCGGG CACTCTCATG CCGATCTCCG GTTCGGTAAC TTTCACAGCA GCTTCCGACA AAACAACTGT GAACCTCGGG AGCTTAACCA TCGATGCGAC CAGCGGCACG TTCGCAGGCA GGGTCAGCTT CCCATCTGCT GGAAGCTGGG TGTTGACTGC GGTTTATGGC GGTGACAGTA ACGTAACCGG CACTCAAACC CAGACTCGCG TCGCCGTCGA CAGCAGCGAA GCCACGACGA TGTCGCTGAG TTCAAATGCA CCCTCAGTTC CTGCCGGAGG TTCGGTGACA TTCACCGCGC AGGTAAGTTC TCCCGTGGTG CTCCGGCTAC CGACCGGCAC GGTGACATTC ATGGACGGCA CGGCTTCGCT CGGCACCGCA ACGTTGGATG GATACGGAAT CGGAAAATTC ACGACCACCA GCCTGACCGG TGGATCACAC TCGATTACCG CGAACTACGG TGGCGATGCG ATCTTCCGTG CTACCTCGGC AAGTGTCAGT CAGTCGATCA GTGACTTTGC GGTGCAGCCA ACGACCGCGG CTGTGTCCAT CAAGGTCGGA CAATCCGGCA CTGCGTTGAT CGCACTCACT CCGCAAGGTG GGTTTAATCA AGCCGTCACC TTTAGCTGTT CTGGTCTGCC TTCCGGCGCA AGTTGCACGT TTGCACCAGC AACGCTAACG CCAACAGGCA CGGATGTTGC GACTGACACG ATGACAATTG CGACCAGCGG AAGTGGCGCG GCTGCACATC GTGCTGAGAA CCGACGGATG AATTGGCTCG CTAGTTCCGG CTTTGGTCTG GCTGGCGTGC TGTTGCTGGT ACCGATCTGC AATCGCAAGC GGCGGGCGCG TCTCGTCGTT CTGGCGGGAC TCATGCTGAT GCTCGGACTG TGGGGATGCG GTGGCAGTTC CTCCTCATCG CCCAAGCCGC CGCCTCCGAA CCCGATGGTC GGAACTTACA GCGTAACGGT GACAGCGACC TCAGGCACTG GATCTGCGCA TGCGGCAGAT CTGTCGGTCA CCATCACTCA GTAG
|
Protein sequence | MSIRNRLLSA TTRVALLLCA CSMVFAQAPL IQNRVTAPIE NSKTIKIRQT VSPLVAKSAD KGRLAGDRNL GQMLLMLSPT KEQNTALEAV IKAEHTPGSA KYHHWLKASE IATKYGVSEP DTTAVRGWLA SQGFEVKHVA NSRRFVVFSG TVAQVEAAFH TQMHQYELSG NSFIANSQEV QIPAALAPVV RGVVRLTSTP KNNNVKIVGK AAFDKEKGQI TFTNGEHAIT PADFATIYNL NPLYQAGING AGQSIAIVAR SDIYSRDVFD FMSIFGVSFG GFYYTINGDD PGYVSGSDVE ATLDLTWAAA IAPGATPNIV ISQSNFADGV DISAAFIVDN NLAPVMSTSF SSCEQQMGPV GTEFYYSLWA QAAAEGITAV VSSDDSGGAG CDLPGSGTFA QNGLAVNALA STPFNVAVGG TQFDDTADPS KYWSSTNDST TKASVLSYIP EKAWNESSID SGNVSLWAGG GGVSTLWTKP EWQIGTGVPA DGMRDLPDVS LTAAGHDGYV LCFGGSCESG GIYTVGGTSA SAPAFAAIMA LVNQQTGSPQ GNPNYVIYQL AAQHPEFFHD TTVGDNKVPD MNGEFTVGYS TGVGYDLATG LGSFDANSLV TNWNNVTFSG TNTTLSGPAG GLTFVHGAGV PVTASVSAAS GSKLPTGNVA FFTDNPLGLA TPFGVGAAAL DNTGDATTSL AAIPGGTHSL TARYGGDATF TASTSNAVTV TVTPEPSNTY FVAGVGGSTV TSAEAKYGDP LVMAVLVQGN SLVGHPTGSV SLSEGSTDLG TRYLNYGEHE DAEQGSSSVF GVIGFPVGVH QLTASYTGDP SFNPSTSTNF QLTIVKSDST ISSLQFQGSA LSGAPLPVFG QVSLASGTLM PISGSVTFTA ASDKTTVNLG SLTIDATSGT FAGRVSFPSA GSWVLTAVYG GDSNVTGTQT QTRVAVDSSE ATTMSLSSNA PSVPAGGSVT FTAQVSSPVV LRLPTGTVTF MDGTASLGTA TLDGYGIGKF TTTSLTGGSH SITANYGGDA IFRATSASVS QSISDFAVQP TTAAVSIKVG QSGTALIALT PQGGFNQAVT FSCSGLPSGA SCTFAPATLT PTGTDVATDT MTIATSGSGA AAHRAENRRM NWLASSGFGL AGVLLLVPIC NRKRRARLVV LAGLMLMLGL WGCGGSSSSS PKPPPPNPMV GTYSVTVTAT SGTGSAHAAD LSVTITQ
|
| |