Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4193 |
Symbol | |
ID | 4072152 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4965610 |
End bp | 4967676 |
Gene Length | 2067 bp |
Protein Length | 688 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637986224 |
Product | peptidase S9, prolyl oligopeptidase active site region |
Protein accession | YP_593267 |
Protein GI | 94971219 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.259535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.164737 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTCGTC GTTTTCTCGT CGCTCTTTTA TTGCTGAGTT CTTTCGCGGT CGCGCAGTCC AAACGTGCCT TCACCTTCGA CGACATGATG AAGCTCAAGC GCGTCGCCGA GCCCTACCTC TCGCCCGACG GCAAATGGGC CGCCTTCACC GTCACCGACG TTTCGCTTGA AACCAACAAA AAGACGAACC ACATCTGGAT TGTCCCCGTA GCCGGCGGCG AAGCGCGCCA GCTCACCAAC TACAGCGGGG AAGATAACTT CCGTTTTTCC CCAGACGGCA AATCAGCCCT TGCCATCACC GACGCAGAAG GCAGCTCGCA GGTCTATGTT CAGGATTTCG ATACCACCAC CGGCACGCTC ACTGGCGATC CGCGCAAAGT GACCTCGATC TCCACCGAAG TCAGCGCAGC CACCTGGTCG CCCGACGGCA GGAGCATCCT CTTCGTCTCC GCCGTCTGGC CCGATTGCAA AGATGATGCC TGCAATAAGC AGCGCGACGA CGAACGCTCG CAGTCGAAGG TAAAGGCACA AATCTTCACC CATCTCCTCT ACCGTCACTG GAACGCCTAC GGCAACGGCA AGCGCTCGCA CCTCTTCATC CAGTCTCTCG AAGGCGGCGA ACCCCTCGAC CTCACCCCCG GCGATCACGA CGTCCCGCCG TTCTCGCTAG GCGGCCAGGA CCAGTACTCG TTCTCGCCCG ACGGCAAAGA GATCGCCTAC GCCAGCAACC TCGACGAAGT CGAAGCCACC AGCACCAACA CCGACATCTT CGTCGTCCCC GTCACCGGCG GCACGCCGAA GAAGATTTCG ACCTCGCCCG GCGCCGATTC CACGCCGCTC TACTCGCCCG ACGGCAAGTA CATCGCATTT CGGTCGCAAG CCCGCGCTGG TTACGAGAGC GACCGCTTTC GACTGATGCT GTACGAACGC GCGACGGGGA AGACGACGGA GTTGACGCAG GGATTTGATG GCTGGGTGGA ATCGATCGTG TGGCACCCAA ACTCGCGCGG ACTGTTCTTC ACGTCCGAGT TGAAGGGCGA AGCCCCAGTC TACTGGGTGG AACTACAAGG TCACCCGATC GAACTTTGGG CTGGCTTCAA TGACGGGTGC CAAGTCACGC CTGCGGGCAC GTTCCTAGTC TGCGACGTGA TGTCAATCAA AGCTCCCAAC GAGATTCAAA CGATCAAGAT CTCAAACTTG AAGGAGATCA CACACAAACC CGAAGGCGGC GGGTCCATTG ACCAGATCAC CCACATCAAC GCCCCGATCC TCGACCAAGT CCAGATGCAG CCCATTGAGC CTTTCTGGTT CACCGGCGCC GAGGGAGTAA AAGTCCAAGG CTTCCTCGTG AAGCCGCCGA ACTTCGATGC CTCCAAGAAA TACCCCGTGA AGTTTCTCAT CCACGGCGGC CCGCAAGGCG CTTGGGGCGA TGATTGGTCC TTCCGCTGGA ATCCCGAACT CTTCGCCGCC AACGGCTACC TGGTCATCAT GGTCAATCCG CGCGGGTCCA CTGGCTACGG ACAGAAGTTC ATTGACGATA TCAATGGCGA TTGGGGTGGC CGCGCGTATC AAGACCTGAT GCTAGGCCTC GACTACGCCG AGAAAAACTT CGCGAACGTC GACAAGGACC GCGAGTGCGC TCTCGGCGCC AGCTATGGCG GCTACATGGC GAATTGGCTT GAGGGTCACA CCACCCGCTT CAAGTGCATC GTCTCGCACG ACGGCATGTT CAACACCGTC TCAGCCTTCG GCACCACCGA AGAGCTTTGG TTCAACAACT GGGAGTTCAA AGGCACGCCG TGGACGAACC CGGAAATGTA CAAGAAGTGG TCGCCGAACC AGAGCGTCGC GAATTTCAAG ACCCCGATGC TCGTCGTGCA CGGCCAGCTC GACTACCGTC TCGACGTCAG CGAGGGCTTC CAGCTCTTCA CGTACCTGCA GTTGCAGAAG GTCCCGTCGA AGATGCTCTA CTTCCCCGAC GAAGGTCACT GGGTCCTGAA GCCGCAAAAC TCGCAGCTCT GGTACAAAAC CGTCAACGAT TGGGTCGACC AGTGGACGAA AAAATAA
|
Protein sequence | MVRRFLVALL LLSSFAVAQS KRAFTFDDMM KLKRVAEPYL SPDGKWAAFT VTDVSLETNK KTNHIWIVPV AGGEARQLTN YSGEDNFRFS PDGKSALAIT DAEGSSQVYV QDFDTTTGTL TGDPRKVTSI STEVSAATWS PDGRSILFVS AVWPDCKDDA CNKQRDDERS QSKVKAQIFT HLLYRHWNAY GNGKRSHLFI QSLEGGEPLD LTPGDHDVPP FSLGGQDQYS FSPDGKEIAY ASNLDEVEAT STNTDIFVVP VTGGTPKKIS TSPGADSTPL YSPDGKYIAF RSQARAGYES DRFRLMLYER ATGKTTELTQ GFDGWVESIV WHPNSRGLFF TSELKGEAPV YWVELQGHPI ELWAGFNDGC QVTPAGTFLV CDVMSIKAPN EIQTIKISNL KEITHKPEGG GSIDQITHIN APILDQVQMQ PIEPFWFTGA EGVKVQGFLV KPPNFDASKK YPVKFLIHGG PQGAWGDDWS FRWNPELFAA NGYLVIMVNP RGSTGYGQKF IDDINGDWGG RAYQDLMLGL DYAEKNFANV DKDRECALGA SYGGYMANWL EGHTTRFKCI VSHDGMFNTV SAFGTTEELW FNNWEFKGTP WTNPEMYKKW SPNQSVANFK TPMLVVHGQL DYRLDVSEGF QLFTYLQLQK VPSKMLYFPD EGHWVLKPQN SQLWYKTVND WVDQWTKK
|
| |