Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4142 |
Symbol | |
ID | 4072333 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4901048 |
End bp | 4903171 |
Gene Length | 2124 bp |
Protein Length | 707 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637986173 |
Product | prolyl oligopeptidase |
Protein accession | YP_593216 |
Protein GI | 94971168 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1505] Serine proteases of the peptidase family S9A |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.199893 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATTC GTTGTTTGCT CGTCCTCTTC CTGTTCTCTA TGACTCTTCA AGCTGCTCAA CCTGGCGTAG TCGAAGGCGG CAATGGCATC ACTCTTCCGC CGCCCCCTCC CACCGCGCAG AAGCCCGTCA CCGAGACGAT TCACGGAGTC ACGATCACCG ACCCCTATCG TTGGCTCGAA GACCAGCAGA GTCCCGAGAC GCGCGCGTGG ATTGATACGC AGATGAAGTA CACCGAGCAG TACCTGTCGC AGGTGAAGGT TCGTCCGGAG ATCGAGAAAG AGTTGGGCCG CCTGGAGCGC GTGGAGCAAT ACACCATCCC CACCGAGCGC GGCGATATGT ACTTCTTCAA AAAGCGCCTC GCCGATGAGA ACCAGGGCTC CATCTACCTC CGCCGCGGCC TTCACGGTGA CGACCAGCGC CTTGTAGATG CGACCAAACT CAGCGCCGAC CAGAACACCT CGATCCAGAT CAACGACATC TCGAAAGACG GCAATCTCCT CGTGTACGGA ACGCGCTCCG GCGGCGCCGA TGAAGAAGCC GTCCACATTC TCGACACCGC TACCGCCAAA GAGCTTCCCG ATTCCCTGCC CAGCGCGCGC TACTTCGGCA TCCAGCTCAG CCCCGACGCG CAAGGCCTCT ACTACTCGCG CATGGAGAAG GAAGGCTCGA GCGTTTACTA CCACAAACTC GGTAGCGACC CGAAGAGCGA CGATCTGATC TTCGGCAAGA AATTCGAAGG CGAAGAATTC GGCCCAATGC AGCTGATCTC CGAGCACATC ACGGAGAACG AGCGCTATCT CGTCGTCACC GTGGCGCACG GCGTTCCGCC CAAGCGCGTG GACATTTACG CCAAAGACCT GCGCAAGCCC GACTCGCAAG TCGTGAAGGT GATTCACGGC ATCGAGAGCC GCTTCACGCC GGTGAATTTC GGCGACGATT TCTACGTGAT GACCGACTAC AACGCGCCCA ACTATCGCGT AGTAAAGGTC CGCATCGGCG ACTCCGACCC GCAGCACTGG ACCACCGTCG TCCCCGAAGC CAAAGATCCT ATCAACAGCA TCTCGATTGT CGGCGGCAAG CTCTTCGTCA GCGGCTTGCA CGACGTTGTG ACGCAGACCC GCATCTTCAC CCTCGACGGC AAAGAGACCG GCCGCATCAA CTATCCGACG ATCGGTGAGG CCACCAACGT CTTCGGCCGC GAAGACAGCG AGCACGGCTT CTACAGCTTC GAGTCATTCA TCATCCCGCC GACCATTTAC CACTACGACG TAAAGACCGG CAAACCCGAG GTCTTCGCTA AACCCAACGT TCCGTTCGAC TCCGCTCAGT ACGAAGTGAA GCAGGTCTTC TACAAGTCGA AAGACGGCAC CCGCATTCCG ATGTTCATCT CGTCGAAGAA AGGCGCGAAG CGCGATGGCA AAACCCCGAC GCTGATGTTC GCCTACGGCG GCTTTCTCGT GGACATGACG CCCTCGTGGA ACCCGGAGTG GGCATGGTGG ATTGAGCAGG GCGGTTTCTA CGCGCAGCCC AACCTGCGCG GCGGCGGCGA GTACGGCGAA ACCTGGCACA AGGCCGGCAT GTTCGAGAAG AAGCAGAACG TCTTCGACGA CTTCTTCGGC GCGGCGCAAT ATCTCGTCGA CGAAAAATAC ACCGACACCA AGCACCTCGC CATCCGTGGC CGCTCCAACG GCGGCCTGCT GATGGGCGTC GCGATGACCC AGCATCCCGA GATGTTCGGC GCCATCTGGT GCGGCTATCC GCTGCTCGAC ATGCTCCGCT TCCAGAATTT CTTAGTCGGC AAATGGTGGA CCAGCGAATA CGGCTCCGCC GAAAACGCCG ACCAGTTCCC CTACCTATTG AAGTATTCGC CGTATCACAA CGTGAAACCG GGCACCAAGT TCCCGGCCAT CATGTTCAAC ACCGGCGACA GTGATACCCG CGTCGCGCCA CTGCACGCGC GCAAGATGAC CGCGCTCGTC CAGCGCGACA ACGCCAACGA CCGCCCCATC TTGCTGCATT ATCAAACCGT CAGCGGCCAC AGCGCCGGCG TCTCAATCAC GCAAGCCATC AAAGACACCG CCGACGAATT GGCGTTCCTA TGGAACGAGG TAAGCGGGAA GTAG
|
Protein sequence | MTIRCLLVLF LFSMTLQAAQ PGVVEGGNGI TLPPPPPTAQ KPVTETIHGV TITDPYRWLE DQQSPETRAW IDTQMKYTEQ YLSQVKVRPE IEKELGRLER VEQYTIPTER GDMYFFKKRL ADENQGSIYL RRGLHGDDQR LVDATKLSAD QNTSIQINDI SKDGNLLVYG TRSGGADEEA VHILDTATAK ELPDSLPSAR YFGIQLSPDA QGLYYSRMEK EGSSVYYHKL GSDPKSDDLI FGKKFEGEEF GPMQLISEHI TENERYLVVT VAHGVPPKRV DIYAKDLRKP DSQVVKVIHG IESRFTPVNF GDDFYVMTDY NAPNYRVVKV RIGDSDPQHW TTVVPEAKDP INSISIVGGK LFVSGLHDVV TQTRIFTLDG KETGRINYPT IGEATNVFGR EDSEHGFYSF ESFIIPPTIY HYDVKTGKPE VFAKPNVPFD SAQYEVKQVF YKSKDGTRIP MFISSKKGAK RDGKTPTLMF AYGGFLVDMT PSWNPEWAWW IEQGGFYAQP NLRGGGEYGE TWHKAGMFEK KQNVFDDFFG AAQYLVDEKY TDTKHLAIRG RSNGGLLMGV AMTQHPEMFG AIWCGYPLLD MLRFQNFLVG KWWTSEYGSA ENADQFPYLL KYSPYHNVKP GTKFPAIMFN TGDSDTRVAP LHARKMTALV QRDNANDRPI LLHYQTVSGH SAGVSITQAI KDTADELAFL WNEVSGK
|
| |