Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3141 |
Symbol | |
ID | 4070256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3733333 |
End bp | 3735351 |
Gene Length | 2019 bp |
Protein Length | 672 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637985161 |
Product | physarolisin II |
Protein accession | YP_592216 |
Protein GI | 94970168 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4934] Predicted protease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATCC CGCCTCGCTT CCTGCCATGC CTGCTGCTTG CGACTGCAAC GGTCGCAAGC TTGCCGGCAC AAACCGCCAC CAAGACCGTT CTCCCCAACA ACGTTCCGAA ATTCACCGCA TCGAGCGTCG ATCTCGGTCC GGCTGATCCC ACCCAGCAAA TCACGGTGAC GATGACGCTT GCGTCCAAGA ACGCCAGCGG ACTCCAGCAG TTCGTCAGCG ACATCCGCAC TCCCGGTACT GGTTCCTATC ACGAATTTCT AACGCCTGCG CTGTTCGCCA CCAAGTACGG CGCAGCCGAC GCGACGCTTA CCGCCGTCAA GACCTTCGCA GCCGCGAATG GTCTCACCAT TACGCACACC GCGCCCAACA AACTCGTGAT GTCGTTGCGC GGAACTGTCG CCGCCGTCGA AAACGCGTTC TCGGTTCCGA TCCACAACTA CAAGAAGAAC GGCGAGACGC TGCGCGTGAA CGTGACCAAC CCGCAGATCT CAACTTCGCT GGTTGGAAAG GTGACGGGCG TTCACATTGC CGACTTCAAC TTCAAGTCGC ATGCCGTGAT GCCGCTCGAC CCGAACGGCA AAACGCAGAA GCCGGTTCCG CTCTCTATCT CTCCGCACGG CCTCTTCTTC GCCAGCGGCT GCTTCCGTAA TCCGCAGACC ATTACCGCCA GCGGTGGTGG TGCGACGGCC ACGTATGCCG GCAACCGATA TGGATCGGAC ATCACCAGTG GACCTCCGAA TCTTCCTCCC TGCGGCTATG ACGTTGCCGA TGTGTATGCC GGCTACAACC TGTGGCCGAT GTACAACGCT GGCCTCGATG GCACCGGCGA AACTATCGTC ATTATTGACG CTTTCGGCTC ACCGACGATC CAGGCCGACG CCAATACCTT CTCGGCGATC AACGGTCTTC CCGCCCTGAA CTCCACCAAC TTCCAGGTTG TCGGCGCCAA TGCCGGCGGC AACGCCAGTT GGGCGGGTGA GACCACGCTC GACGTGGAAT GGGCGCACGC AATCGCTCCC AACGCGAAGA TCGTCCTTGA GGTCGCGCCG ACCAACAGTT TCGTGGACCT CTTTTACGCC GAAGTGGACG CCATTGCGAA TCACCGCGGT ATCGTGATCT CCAATAGCTG GGGCGGCTTT GAAACTTTCA CCGATTCCTC GCTCCGCGGT GCGTTCGACT TCATCATGAT GGAAGCGATC TCCGTGGGTA TCGACGTTAA CTTCTCCACC GGCGACTACG GCGATAACGT ATCCGTGCTT GGCTACGCCG ACGTGCAGTA CCCGGGCAGC TCACCATTCG CGACGGCCGT AGGCGGCACC AGCCTCGCGC TCACCAACAC CAAAACCAAG ACGATGAAGT TCCAAACCGG ATGGGGCAAC AACATCACCC GCCTGGTGGA CGGCACCACT GGGGCGCCGG ACGATCCGCC GCTTATGGAA GGATTCATCT TCGGCGCCGG CGGCGGGAAC AGCAACGTCT ACACCAAGCC GAGCTGGCAG GTGGGAACCA ACCAGCCTCG TCGCGCGCTG CCTGATATCG CATGGCTCGC CGATCCTTAC ACCGGTGTCG AGATTATCCA GACCATCAGC GGTAGCCAGT ACATCGAGGT CATTGGCGGA ACCAGCCTCG CTGCGCCGAT GTTCTCTGGT ATTTGGGCGA TCGCCAACCA GAAAGCAAAT ACTACGATCG GTCTCGGCGA TGCGGCATCG CAGCTCTACA GCATGCCGTC CGGCTCGATC AAAGACGTCG TGCCCTTTAA CACCGCGAAC AACGTGCGCG GCGTTCTGAC CGATGCTTAC GGAACGTACG AAGAGAGTTC AACTACTCTT GCAGCTCCGC TCGCCTACAC CCGCGGCTTC TACAGCGCGC TGTACCAGGG CGCGAGTTCG CACAGCTGGT ACGACCTGAC GTTCGGTACC GACTCCACGC TCTTCACCAA GCAAGGGTGG GACAACGTAA CCGGCTGGGG CACTCCCAAC GGCCTCAACT TCGTGACCGC CATCGCCAAC AAGAAATAG
|
Protein sequence | MKIPPRFLPC LLLATATVAS LPAQTATKTV LPNNVPKFTA SSVDLGPADP TQQITVTMTL ASKNASGLQQ FVSDIRTPGT GSYHEFLTPA LFATKYGAAD ATLTAVKTFA AANGLTITHT APNKLVMSLR GTVAAVENAF SVPIHNYKKN GETLRVNVTN PQISTSLVGK VTGVHIADFN FKSHAVMPLD PNGKTQKPVP LSISPHGLFF ASGCFRNPQT ITASGGGATA TYAGNRYGSD ITSGPPNLPP CGYDVADVYA GYNLWPMYNA GLDGTGETIV IIDAFGSPTI QADANTFSAI NGLPALNSTN FQVVGANAGG NASWAGETTL DVEWAHAIAP NAKIVLEVAP TNSFVDLFYA EVDAIANHRG IVISNSWGGF ETFTDSSLRG AFDFIMMEAI SVGIDVNFST GDYGDNVSVL GYADVQYPGS SPFATAVGGT SLALTNTKTK TMKFQTGWGN NITRLVDGTT GAPDDPPLME GFIFGAGGGN SNVYTKPSWQ VGTNQPRRAL PDIAWLADPY TGVEIIQTIS GSQYIEVIGG TSLAAPMFSG IWAIANQKAN TTIGLGDAAS QLYSMPSGSI KDVVPFNTAN NVRGVLTDAY GTYEESSTTL AAPLAYTRGF YSALYQGASS HSWYDLTFGT DSTLFTKQGW DNVTGWGTPN GLNFVTAIAN KK
|
| |