Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2676 |
Symbol | |
ID | 4071930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3160473 |
End bp | 3162089 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984693 |
Product | carboxyl-terminal protease |
Protein accession | YP_591751 |
Protein GI | 94969703 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0914328 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTAGTT CACGTCGCAC TATCTTCCTT GTTGTATTGA TTCTCCTCGC TTGCGGCTGC CTCGGCATGC TCTTCGGACA GAAGATCACT GGCGCCAGCG ACAACGAAAT TCGCGACGAT CTGCGCACCT TCTCCAGCGT CTACGACGTT GTCGAGCAGA ATTACGCGGA ACCGGTCAGC GCCGATAAAG CCATCTACAA CGGCGCCATC CCTGGCATGC TTCGCGTCCT CGATCCGCAC TCCAATTTCT TCGACCCGAA GCAATACGCC CTGCTGCGTG AAGAGCAGCG CGGCAAATAC TACGGCGTCG GTATGCAGGT TGGCCCGCGC AATAACAAGG TCATCGTAAT CGCGCCGTTC GCGGGCGCGC CGGCCTACCG CGCCGGTATC CGCCCCGGTG ACGTCATCAT CGCCGTGGAC GGCAAGCCCA CCGACAACAT GAGCACCAGT GACGTCGCTG ACCTGCTTAA AGGGCCGAAG GGCACCACGG TTCGCATCGC GGTCATCCGC GAGGGCAGCG AAAAGCCGCT CGAGTTCAGC GTCATTCGCG ACGAGATTCC TCGCTACTCC GTAGATGTTC ACTTCATGAT TCGTCCGGGC ATTGGCTACA TGCACGTCTC CGGCTTCCAG GAAACGACCG AGCACGAAGT GCAGGAAGCC CTCGACCAGA TGGGCGATTT GAAAGGCCTG ATCCTCGACC TGCGCCAGAA CCCCGGCGGC CTGTTGAGCG AAGGCGTGGG CGTGGCCGAC AAGTTCCTGA AGAAGGGACA GGTCATCGTC TCGCACCACG GCCGCAGCAG CCCGGAGAAG ATCTACCGCG CGCCGCACGG CAACAATGGG CGCGATTATC CGCTGGTAGT GCTGGTCAAT CGCGGCACCG CCTCGGCAGC TGAGATTGTC AGCGGCGCGA TCCAGGACCA CGATCGCGGC CTGATCGCCG GCGAAACCAC GTTCGGAAAG GGTTTAGTAC AAACGGTTTA TCCGCTTTCG GAGAACACCG GTTTGGCGCT GACCACCGCG CACTACTACA CGCCGAGCGG ACGCCTGATC CAGCGTGAAT ACGCGGGCGT GTCGCTTTAC GACTACTACT ACAACCCCGC CGACAACGAT AACAACGCCA ACAAGGAAGT GAAGCTAACT GATAGTGGGC GAACGGTTTA CGGCGGCGGC GGCATTACGC CCGACGTGAA AATTGCTCCG CAAAAAGGCA ATCCCTTCCA GGATCGCCTG CTCATCAAGT ACGCGTTCTT CAACTTCTCG AAACACTACA TGGCGCTGCA TCACACGGTA GATAAAGGAT TTAACGTGGA TGATGCGGTG ATGCAGGAGT TCCGGAAGTT CCTCGATGAG CAGAAGATTG CGTTCAACGA GGCGGAGCTG AAGGACAACG ACGAATGGAT TCGCGGCAAC ATCAAGGCGG AACTTTTCGT GAACCAGTTC GGTGCGCAGG AAGGGCTTCG CGTCCATGCG GAGACCGACC CGATGGTGTT GAAAGGCTTG GACTTACTGC CCCAGGCGAA GCAACTGGCG GACAACGCGC GGAAGACGAT TGCGGAGAAA TCCGCGGGGA CCGCGACTGC GTCAGGTGCG AAGGTTGCAG CGAACCAGAA CCAGTAG
|
Protein sequence | MRSSRRTIFL VVLILLACGC LGMLFGQKIT GASDNEIRDD LRTFSSVYDV VEQNYAEPVS ADKAIYNGAI PGMLRVLDPH SNFFDPKQYA LLREEQRGKY YGVGMQVGPR NNKVIVIAPF AGAPAYRAGI RPGDVIIAVD GKPTDNMSTS DVADLLKGPK GTTVRIAVIR EGSEKPLEFS VIRDEIPRYS VDVHFMIRPG IGYMHVSGFQ ETTEHEVQEA LDQMGDLKGL ILDLRQNPGG LLSEGVGVAD KFLKKGQVIV SHHGRSSPEK IYRAPHGNNG RDYPLVVLVN RGTASAAEIV SGAIQDHDRG LIAGETTFGK GLVQTVYPLS ENTGLALTTA HYYTPSGRLI QREYAGVSLY DYYYNPADND NNANKEVKLT DSGRTVYGGG GITPDVKIAP QKGNPFQDRL LIKYAFFNFS KHYMALHHTV DKGFNVDDAV MQEFRKFLDE QKIAFNEAEL KDNDEWIRGN IKAELFVNQF GAQEGLRVHA ETDPMVLKGL DLLPQAKQLA DNARKTIAEK SAGTATASGA KVAANQNQ
|
| |