Gene Acid345_4142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4142 
Symbol 
ID4072333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4901048 
End bp4903171 
Gene Length2124 bp 
Protein Length707 aa 
Translation table11 
GC content60% 
IMG OID637986173 
Productprolyl oligopeptidase 
Protein accessionYP_593216 
Protein GI94971168 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1505] Serine proteases of the peptidase family S9A 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.199893 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATTC GTTGTTTGCT CGTCCTCTTC CTGTTCTCTA TGACTCTTCA AGCTGCTCAA 
CCTGGCGTAG TCGAAGGCGG CAATGGCATC ACTCTTCCGC CGCCCCCTCC CACCGCGCAG
AAGCCCGTCA CCGAGACGAT TCACGGAGTC ACGATCACCG ACCCCTATCG TTGGCTCGAA
GACCAGCAGA GTCCCGAGAC GCGCGCGTGG ATTGATACGC AGATGAAGTA CACCGAGCAG
TACCTGTCGC AGGTGAAGGT TCGTCCGGAG ATCGAGAAAG AGTTGGGCCG CCTGGAGCGC
GTGGAGCAAT ACACCATCCC CACCGAGCGC GGCGATATGT ACTTCTTCAA AAAGCGCCTC
GCCGATGAGA ACCAGGGCTC CATCTACCTC CGCCGCGGCC TTCACGGTGA CGACCAGCGC
CTTGTAGATG CGACCAAACT CAGCGCCGAC CAGAACACCT CGATCCAGAT CAACGACATC
TCGAAAGACG GCAATCTCCT CGTGTACGGA ACGCGCTCCG GCGGCGCCGA TGAAGAAGCC
GTCCACATTC TCGACACCGC TACCGCCAAA GAGCTTCCCG ATTCCCTGCC CAGCGCGCGC
TACTTCGGCA TCCAGCTCAG CCCCGACGCG CAAGGCCTCT ACTACTCGCG CATGGAGAAG
GAAGGCTCGA GCGTTTACTA CCACAAACTC GGTAGCGACC CGAAGAGCGA CGATCTGATC
TTCGGCAAGA AATTCGAAGG CGAAGAATTC GGCCCAATGC AGCTGATCTC CGAGCACATC
ACGGAGAACG AGCGCTATCT CGTCGTCACC GTGGCGCACG GCGTTCCGCC CAAGCGCGTG
GACATTTACG CCAAAGACCT GCGCAAGCCC GACTCGCAAG TCGTGAAGGT GATTCACGGC
ATCGAGAGCC GCTTCACGCC GGTGAATTTC GGCGACGATT TCTACGTGAT GACCGACTAC
AACGCGCCCA ACTATCGCGT AGTAAAGGTC CGCATCGGCG ACTCCGACCC GCAGCACTGG
ACCACCGTCG TCCCCGAAGC CAAAGATCCT ATCAACAGCA TCTCGATTGT CGGCGGCAAG
CTCTTCGTCA GCGGCTTGCA CGACGTTGTG ACGCAGACCC GCATCTTCAC CCTCGACGGC
AAAGAGACCG GCCGCATCAA CTATCCGACG ATCGGTGAGG CCACCAACGT CTTCGGCCGC
GAAGACAGCG AGCACGGCTT CTACAGCTTC GAGTCATTCA TCATCCCGCC GACCATTTAC
CACTACGACG TAAAGACCGG CAAACCCGAG GTCTTCGCTA AACCCAACGT TCCGTTCGAC
TCCGCTCAGT ACGAAGTGAA GCAGGTCTTC TACAAGTCGA AAGACGGCAC CCGCATTCCG
ATGTTCATCT CGTCGAAGAA AGGCGCGAAG CGCGATGGCA AAACCCCGAC GCTGATGTTC
GCCTACGGCG GCTTTCTCGT GGACATGACG CCCTCGTGGA ACCCGGAGTG GGCATGGTGG
ATTGAGCAGG GCGGTTTCTA CGCGCAGCCC AACCTGCGCG GCGGCGGCGA GTACGGCGAA
ACCTGGCACA AGGCCGGCAT GTTCGAGAAG AAGCAGAACG TCTTCGACGA CTTCTTCGGC
GCGGCGCAAT ATCTCGTCGA CGAAAAATAC ACCGACACCA AGCACCTCGC CATCCGTGGC
CGCTCCAACG GCGGCCTGCT GATGGGCGTC GCGATGACCC AGCATCCCGA GATGTTCGGC
GCCATCTGGT GCGGCTATCC GCTGCTCGAC ATGCTCCGCT TCCAGAATTT CTTAGTCGGC
AAATGGTGGA CCAGCGAATA CGGCTCCGCC GAAAACGCCG ACCAGTTCCC CTACCTATTG
AAGTATTCGC CGTATCACAA CGTGAAACCG GGCACCAAGT TCCCGGCCAT CATGTTCAAC
ACCGGCGACA GTGATACCCG CGTCGCGCCA CTGCACGCGC GCAAGATGAC CGCGCTCGTC
CAGCGCGACA ACGCCAACGA CCGCCCCATC TTGCTGCATT ATCAAACCGT CAGCGGCCAC
AGCGCCGGCG TCTCAATCAC GCAAGCCATC AAAGACACCG CCGACGAATT GGCGTTCCTA
TGGAACGAGG TAAGCGGGAA GTAG
 
Protein sequence
MTIRCLLVLF LFSMTLQAAQ PGVVEGGNGI TLPPPPPTAQ KPVTETIHGV TITDPYRWLE 
DQQSPETRAW IDTQMKYTEQ YLSQVKVRPE IEKELGRLER VEQYTIPTER GDMYFFKKRL
ADENQGSIYL RRGLHGDDQR LVDATKLSAD QNTSIQINDI SKDGNLLVYG TRSGGADEEA
VHILDTATAK ELPDSLPSAR YFGIQLSPDA QGLYYSRMEK EGSSVYYHKL GSDPKSDDLI
FGKKFEGEEF GPMQLISEHI TENERYLVVT VAHGVPPKRV DIYAKDLRKP DSQVVKVIHG
IESRFTPVNF GDDFYVMTDY NAPNYRVVKV RIGDSDPQHW TTVVPEAKDP INSISIVGGK
LFVSGLHDVV TQTRIFTLDG KETGRINYPT IGEATNVFGR EDSEHGFYSF ESFIIPPTIY
HYDVKTGKPE VFAKPNVPFD SAQYEVKQVF YKSKDGTRIP MFISSKKGAK RDGKTPTLMF
AYGGFLVDMT PSWNPEWAWW IEQGGFYAQP NLRGGGEYGE TWHKAGMFEK KQNVFDDFFG
AAQYLVDEKY TDTKHLAIRG RSNGGLLMGV AMTQHPEMFG AIWCGYPLLD MLRFQNFLVG
KWWTSEYGSA ENADQFPYLL KYSPYHNVKP GTKFPAIMFN TGDSDTRVAP LHARKMTALV
QRDNANDRPI LLHYQTVSGH SAGVSITQAI KDTADELAFL WNEVSGK