Gene Acid345_4232 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4232 
Symbol 
ID4073158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5016842 
End bp5018107 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content58% 
IMG OID637986263 
Productcarboxyl-terminal protease 
Protein accessionYP_593306 
Protein GI94971258 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0793] Periplasmic protease 
TIGRFAM ID[TIGR00225] C-terminal peptidase (prc) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAC TCCAAAGGAG TCTTTGCCCG CCAAAGCAGC CCATGTCAAA GATCACGAAG 
TTCGTTCTGT TAAGCAGCTC TCTCCTTGTC GTTCTGTTTG TCGTATACGG AAGCCTCGGC
GTTCGCGCCG ATTCCAAGAA CGATGGCGCG TACCGCCAGC TCGGCGTGTA CAGCGAGGTA
CTCTCGCGCA TCCGTACAGA GTACGTCGTC GATCCGGACA TGAACCTCGT GACCGACGGA
GCGCTACATG GCCTGCTCGA GTCGCTCGAC GCGAACTCCA GCTACCTGAG CCCGACGGAG
TACAAGCAGT ACCAGCAGCG TAAGTCCGAG GGCAAAGCGG GCATCGGTGC TGCGATCTCT
AAGCGGTACG GTTACGCCGC GGTGGTCTCA GTCATTCCAG GTGGCCCCGC CGACAAAGCC
CAGGTGGAAA GCGGCGACAT TATTGAAGCA ATCGAGGGCA AAACGACTCG CGAGATGTCG
CTGGCTGAGA TTGATGGCAT CCTCGCTGGA CAGCCCGGCT CGGTCATTAA TTTGAGCATT
GTGCGCCCGC GCAAGGCGCA GCCACAGAAG ACGCCGATCA CGCGAGAGGT AGTCACTACC
CCGCCAGTCG CCGAGAAGCT GATGGAAGAC AGCATCGGTT ATATCAAGGT CATTACCTTT
ACCAAGGGCC GCACGCAACA GGTAGCCGAA CAGGTGAAGG CCGCGCAGAA ACAGGGCGCG
AAGAAACTTA TCCTCGACCT GCGGAATTGC GGAGCCGGCG AGGAACAGGA AGGCGTCGCT
ACGGCGAATC TCTTCCTGAA CCACGGCATG ATCGCCTACC TGCAAGGCCA GAAATTCGCC
AAGCAGACGT TCACAGCTGA AGCCTCTAAG GCCATTACAA ATCTCCCGCT GGTCGTGCTG
GTCAACAAAG GTACGGCTGG TCCAGCCGAG ATCGTCGCGG CGTCGGTATT GGAGAATGCT
CGCGGTGATG TTCTTGGCGA CAAGACATTC GGTGATGGCG CAGTCCAGCA GCTCTTCCCA
ATGAGCGATG GTTCTGCCCT CATGCTCTCG ATCGCGAAGT ACTACTCGCC GAGCGGCAAA
GCTATCCAGG ACACGGCAGT CACGCCCAAC ATCCTGGTTG CCGACAACGA CGACTACGCT
TCTCCGGATG ATGGCGACGA TACCACTGAC AATGCCAACC AGCCGGAAAC GCGCCAGAAG
GACCAGACCG ACGAGCAGCT TCGCCGCGCG GTCGAGGTTC TGAAGAATAA AGACCAGCAC
AGCTAG
 
Protein sequence
MSGLQRSLCP PKQPMSKITK FVLLSSSLLV VLFVVYGSLG VRADSKNDGA YRQLGVYSEV 
LSRIRTEYVV DPDMNLVTDG ALHGLLESLD ANSSYLSPTE YKQYQQRKSE GKAGIGAAIS
KRYGYAAVVS VIPGGPADKA QVESGDIIEA IEGKTTREMS LAEIDGILAG QPGSVINLSI
VRPRKAQPQK TPITREVVTT PPVAEKLMED SIGYIKVITF TKGRTQQVAE QVKAAQKQGA
KKLILDLRNC GAGEEQEGVA TANLFLNHGM IAYLQGQKFA KQTFTAEASK AITNLPLVVL
VNKGTAGPAE IVAASVLENA RGDVLGDKTF GDGAVQQLFP MSDGSALMLS IAKYYSPSGK
AIQDTAVTPN ILVADNDDYA SPDDGDDTTD NANQPETRQK DQTDEQLRRA VEVLKNKDQH
S