Gene Acid345_2676 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2676 
Symbol 
ID4071930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3160473 
End bp3162089 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content59% 
IMG OID637984693 
Productcarboxyl-terminal protease 
Protein accessionYP_591751 
Protein GI94969703 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0914328 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGTT CACGTCGCAC TATCTTCCTT GTTGTATTGA TTCTCCTCGC TTGCGGCTGC 
CTCGGCATGC TCTTCGGACA GAAGATCACT GGCGCCAGCG ACAACGAAAT TCGCGACGAT
CTGCGCACCT TCTCCAGCGT CTACGACGTT GTCGAGCAGA ATTACGCGGA ACCGGTCAGC
GCCGATAAAG CCATCTACAA CGGCGCCATC CCTGGCATGC TTCGCGTCCT CGATCCGCAC
TCCAATTTCT TCGACCCGAA GCAATACGCC CTGCTGCGTG AAGAGCAGCG CGGCAAATAC
TACGGCGTCG GTATGCAGGT TGGCCCGCGC AATAACAAGG TCATCGTAAT CGCGCCGTTC
GCGGGCGCGC CGGCCTACCG CGCCGGTATC CGCCCCGGTG ACGTCATCAT CGCCGTGGAC
GGCAAGCCCA CCGACAACAT GAGCACCAGT GACGTCGCTG ACCTGCTTAA AGGGCCGAAG
GGCACCACGG TTCGCATCGC GGTCATCCGC GAGGGCAGCG AAAAGCCGCT CGAGTTCAGC
GTCATTCGCG ACGAGATTCC TCGCTACTCC GTAGATGTTC ACTTCATGAT TCGTCCGGGC
ATTGGCTACA TGCACGTCTC CGGCTTCCAG GAAACGACCG AGCACGAAGT GCAGGAAGCC
CTCGACCAGA TGGGCGATTT GAAAGGCCTG ATCCTCGACC TGCGCCAGAA CCCCGGCGGC
CTGTTGAGCG AAGGCGTGGG CGTGGCCGAC AAGTTCCTGA AGAAGGGACA GGTCATCGTC
TCGCACCACG GCCGCAGCAG CCCGGAGAAG ATCTACCGCG CGCCGCACGG CAACAATGGG
CGCGATTATC CGCTGGTAGT GCTGGTCAAT CGCGGCACCG CCTCGGCAGC TGAGATTGTC
AGCGGCGCGA TCCAGGACCA CGATCGCGGC CTGATCGCCG GCGAAACCAC GTTCGGAAAG
GGTTTAGTAC AAACGGTTTA TCCGCTTTCG GAGAACACCG GTTTGGCGCT GACCACCGCG
CACTACTACA CGCCGAGCGG ACGCCTGATC CAGCGTGAAT ACGCGGGCGT GTCGCTTTAC
GACTACTACT ACAACCCCGC CGACAACGAT AACAACGCCA ACAAGGAAGT GAAGCTAACT
GATAGTGGGC GAACGGTTTA CGGCGGCGGC GGCATTACGC CCGACGTGAA AATTGCTCCG
CAAAAAGGCA ATCCCTTCCA GGATCGCCTG CTCATCAAGT ACGCGTTCTT CAACTTCTCG
AAACACTACA TGGCGCTGCA TCACACGGTA GATAAAGGAT TTAACGTGGA TGATGCGGTG
ATGCAGGAGT TCCGGAAGTT CCTCGATGAG CAGAAGATTG CGTTCAACGA GGCGGAGCTG
AAGGACAACG ACGAATGGAT TCGCGGCAAC ATCAAGGCGG AACTTTTCGT GAACCAGTTC
GGTGCGCAGG AAGGGCTTCG CGTCCATGCG GAGACCGACC CGATGGTGTT GAAAGGCTTG
GACTTACTGC CCCAGGCGAA GCAACTGGCG GACAACGCGC GGAAGACGAT TGCGGAGAAA
TCCGCGGGGA CCGCGACTGC GTCAGGTGCG AAGGTTGCAG CGAACCAGAA CCAGTAG
 
Protein sequence
MRSSRRTIFL VVLILLACGC LGMLFGQKIT GASDNEIRDD LRTFSSVYDV VEQNYAEPVS 
ADKAIYNGAI PGMLRVLDPH SNFFDPKQYA LLREEQRGKY YGVGMQVGPR NNKVIVIAPF
AGAPAYRAGI RPGDVIIAVD GKPTDNMSTS DVADLLKGPK GTTVRIAVIR EGSEKPLEFS
VIRDEIPRYS VDVHFMIRPG IGYMHVSGFQ ETTEHEVQEA LDQMGDLKGL ILDLRQNPGG
LLSEGVGVAD KFLKKGQVIV SHHGRSSPEK IYRAPHGNNG RDYPLVVLVN RGTASAAEIV
SGAIQDHDRG LIAGETTFGK GLVQTVYPLS ENTGLALTTA HYYTPSGRLI QREYAGVSLY
DYYYNPADND NNANKEVKLT DSGRTVYGGG GITPDVKIAP QKGNPFQDRL LIKYAFFNFS
KHYMALHHTV DKGFNVDDAV MQEFRKFLDE QKIAFNEAEL KDNDEWIRGN IKAELFVNQF
GAQEGLRVHA ETDPMVLKGL DLLPQAKQLA DNARKTIAEK SAGTATASGA KVAANQNQ