Gene Acid345_3173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3173 
Symbol 
ID4071243 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3762156 
End bp3764501 
Gene Length2346 bp 
Protein Length781 aa 
Translation table11 
GC content63% 
IMG OID637985193 
Productpeptidase S49 
Protein accessionYP_592248 
Protein GI94970200 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0616] Periplasmic serine proteases (ClpP class) 
TIGRFAM ID[TIGR00706] signal peptide peptidase SppA, 36K type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.792634 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAATACG AGCTTAAATA CCAGCATGTG CTTGCCGAGG TCCTGAGTTC TCCTTGGGCC 
ATCCTTCCCG AGACGCTCAA GTCCATCACC TCGGTGCTGG CCTTCCGCGC CGCCGGTCAT
GAGTTGGGCG AAGAAGAGAT TCTCTCGCGG GTTGGCGAGC TGCGCACCGA TCGTGAACCG
TATGCCGTCA CGGCCGAAGG GCGTTCGCCC AAGGCTTCGC CGGTTACCTC GGGCGCGGTC
ATGGTCATCC CGATCTATGG CGTCATCGGC CCCAAGGCTT CGCAGTTTGA GCGCGCCTCG
AGCGGTGGCG GCACTGGCAT CGATGCGTTG ACGCAAACCT TCCGCTCTGC GCTTTCCAAT
CCTGATATTT CGGCCATCGT CTTCGACGTG GATTCCCCTG GCGGCAGCGT CTTCGGCATT
GCCGAGCTCG CGGACGAGAT CTATGCCGGC CGCGGGAAAA AGAAGATTGT CGCGCAGGTC
GCGCCGCGCG CTGCCAGCGC TGCTTACTGG CTTGCCGCTT CTGCCGGCGA AGTCGTTGTC
ACACCCAGTG GTCAGGCGGG GTCCATCGGC GTGTTTGTCG CGCATGAAGA TCTCTCGAAA
GCGCTCGATA TGCAGGGCGT CAAGGAAACT CTCATCAGCG CCGGCAAGTA CAAGGTCGAA
GGCGCGAGTT CCCAGCCACT CAGCGACGAG GCTCGGGCGG CCATGCAGAA CATGGTCGAT
CAGTATTACG GCGCATTCGT GCAGGGCGTT GCCCGCGGCC GCGCCGTCAC TGCCGCGACG
GTCCGCAATA GCTTTGGTGA AGGTCGCGTC GTCAGCGCGC AGGATGCGCT CCAGCTCGGC
ATGGTGGACC GCATTGCCAC CCTCGACCAG ACCATCGCTG CGCTGCTTGG CGGCCGTCCT
GCGAAGAGCG CCAGCGCACA AGTTCCTGTA TCAGTTCCGG CTGCTTCGGT GGCCGAAAAA
CCGGCGTCAG CCGAAATCAC AGAGGAGGCC ACCATGGCCA CTGCACCAAC TCCGGCGGCA
GGCGCCGCTG AAATCAACGT TCTCCGCGAC GTCGCAATTG CGTCGGAAGA GCAGCGCGTC
AAGGGCATCA CTGCACTCGC CCGCCACGCA AGCATGACTG ACAAGCTCAG CGGCTGGCTC
CGCGAGGGCA AGACGATCGA TGCGGTTTCG GAAGAGATCG TTGACCTTCA GAAGAAGGGC
GCGAAGCCGG TCAACATCCC CGCGCCGGGT GCCCAGGTGG ATCTCAACGA TCGCGAGCAG
AGGCAGTATT CCATCCTGCG TGCGGTCCGC AGCATGGTGC TGGCGCAGAA GCGCGACGAG
AAGCTCGGCA GCGACACGGA TGCCAGTTTC GAGCGCGAAG TCAGCGACAC CATCGCCAAG
AAGCTGAACC GCGAAACGAG CGGCATCTAT ATCCCCACGA ACCTGCGGGC GACGGTGCCG
GGCCTCGATC CCAAGGCTGT GCTCAATAGC GGATCTTCGC CGGGCACCAA CTTCGTGCAG
ACCACGATCC GGCCTGACGA ATTCATTGAC CTGCTGCGCA ATCGCCTGGT GGTGATGAAG
ATGGGCGCCC GCAAGCTCGG CGATCTCCAG GGCAACCTGC AGCTCCCCAA GCAAACGGCT
GCGGCGACTC TCTACTGGAC CGGTGAGAAC CCGGGCAGCG CGGTGACCGC CACCGATCAG
ACCACCGGCA GCGTCACGCT CTCGCCGAAG CAGGCGATGG CGCAGACGGC TTACAGCCGT
CAGTTCATCA TCCAGTCCTC GATCGACGCC GAGCAGTTCG TGCGTGAGGA CCTGGCCAAC
ATCTTCGCGT TGGGCGTGGA TCTCGCGGCG CTCGTCGGCA GCGGCACCTC GAACCAGCCG
AAGGGCATCG TGAACCAGAG CGGCGTCGGC ACCGAGGCGA TCGCCACCGA TGGCGGCGCT
ATCACGTACT CCATCATCAC CAAGGCGCAG GAAGATCTCG AGGAGAGCAG CATCCCGTTA
ATCGCCCCGG GCATCGCCAC CACGCCAGGT GTCAAGAAGA AGTTGCGCAA CACCGCCGAG
CTCTCCAACA CCATCTCGCT GCCCATCTGG CACAGCGACG ACACGGTCGC GGGTTACCCG
GCGATGTCTT CGAACCAGTT GCCCTCGAAC ACATCCAAGG GCAGCGGTAC GAACCTCCAC
ACCATGATCG TCGGCGACTG GGCCCAGCTC ATCCTCGGCG AGTGGGGCGC ACTCGAGATC
ATCGCTGATC CGTACACCCA GGCGGGCAAG GGCAACGTCG TGTTGACCGG TTCCATGCTG
GTCGATATCG CGGTGCGCTA TGCGCAGGCC TTCGTCGTCA TCAACGACAT CAACCCGACC
TCGTAA
 
Protein sequence
MKYELKYQHV LAEVLSSPWA ILPETLKSIT SVLAFRAAGH ELGEEEILSR VGELRTDREP 
YAVTAEGRSP KASPVTSGAV MVIPIYGVIG PKASQFERAS SGGGTGIDAL TQTFRSALSN
PDISAIVFDV DSPGGSVFGI AELADEIYAG RGKKKIVAQV APRAASAAYW LAASAGEVVV
TPSGQAGSIG VFVAHEDLSK ALDMQGVKET LISAGKYKVE GASSQPLSDE ARAAMQNMVD
QYYGAFVQGV ARGRAVTAAT VRNSFGEGRV VSAQDALQLG MVDRIATLDQ TIAALLGGRP
AKSASAQVPV SVPAASVAEK PASAEITEEA TMATAPTPAA GAAEINVLRD VAIASEEQRV
KGITALARHA SMTDKLSGWL REGKTIDAVS EEIVDLQKKG AKPVNIPAPG AQVDLNDREQ
RQYSILRAVR SMVLAQKRDE KLGSDTDASF EREVSDTIAK KLNRETSGIY IPTNLRATVP
GLDPKAVLNS GSSPGTNFVQ TTIRPDEFID LLRNRLVVMK MGARKLGDLQ GNLQLPKQTA
AATLYWTGEN PGSAVTATDQ TTGSVTLSPK QAMAQTAYSR QFIIQSSIDA EQFVREDLAN
IFALGVDLAA LVGSGTSNQP KGIVNQSGVG TEAIATDGGA ITYSIITKAQ EDLEESSIPL
IAPGIATTPG VKKKLRNTAE LSNTISLPIW HSDDTVAGYP AMSSNQLPSN TSKGSGTNLH
TMIVGDWAQL ILGEWGALEI IADPYTQAGK GNVVLTGSML VDIAVRYAQA FVVINDINPT
S