Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4468 |
Symbol | |
ID | 4070951 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5301446 |
End bp | 5303317 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637986507 |
Product | peptidyl-dipeptidase A |
Protein accession | YP_593542 |
Protein GI | 94971494 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.19029 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACGAC CTCACTACGA CGCGAGCACC GCCGCTTGTA CAATTCCCGC CGGAGGTACC ATGCTGCGAA AAGCATTCCT AACTGCTGTC CTTACTGCTG CAATCGTTCC GCTTTCTGTG TTCGCGCAAT CCACTCCCAC CGTCGCCGAC GCCGAAAAGT TCGTGAAAGA CGCCGAGACC AAGCTCGACG ACCTCGGTGT AAAAGCCCAG CGCGCTGAGT GGGTCGCGGA AAACTTCATT ACCGACGACA CCCAGGAAAT CGCGGCCGAA GCCAACGAGA TCGCGAACGC TGAAGCGACG AACTTCGCCA AGCAGACGAT CCAGTTTGAG AAGCTGCAAC TCCCGCCGGA GTTGGCGCGC AAGATGTTGC TCCTCAAACT CGCGGCCATT GCGCCTAGCA ACCCCAAAGA CCTCTCGGAA CTGACCCGGG TGCAGGCCTC GATGGCCGCC GACTACGGCA AGGGCAAGTA TTGCCCCACC ACCGGCAAGC ACGCCGGCGA GTGCCTCGAC ATCACCAAGA TCGAGCACAT CATGGAAACC TCCACCGATC CCGACGAACT GAAGGACCTC TGGATCGGCT GGCACAAGGT CGGCGCACCC ATGCGCCAGC GCTATGCGCG CTTCGTCGAG CTCAGCAATA ATGGTGCCCG CGAAATGGGA TGGGCCGACA CCGGCGCCTA CTGGCGTGCC GGCTACGACA TGCCTCCCGA CCAGTTCAGT GCTGAGTTGG AACGGCTGTG GCAGCAGATG CGTCCGCTGT ACGTTTCTCT GCACACCTAC GTCCGCAACC AGCTGGTAAA GAAGTATGGG GAGCAGGCCG TGAAAGACGG CATGATCCGC GCCGACCTCC TCGGCAACCC CTGGGCCCAG GAGTGGGGCA ACATCTACCC CCTCGTCGCC CCGCCCACCA AGCATCCGCA GCTCGACGTC ACCCAGATTC TGCAAGACAG AAAAGTTGAC GAGCTCGGCA TCGTTCACTA CGGCGAGAAT TTCTTCAAAT CGCTGGGCTT CCCGGCGCTG CCGCAAACTT TCTGGGAGCG CTCTCTCTTC CTGAAACCGA AAGACCGCGA CGTAATCTGC CACGCCAGCG CATGGGACAT CGACAACAAA GATGACCTCC GCATCAAGAC CTGTCTCCAG GTTCGCGCCG ATGACTTCGT CACTGTCCAC CACGAACTTG GCCACAACTT CTATCAGCGC GCCTACAAGG CCCAGTCGCC GCTTTTCGAG AACGGCGCCA ACGACGGCTT CCACGAGGCC ATTGGCGACA CCATCGCGCT CTCCATCACG CCGGAGTACC TGAAGCAGGT CGGCCTCATC GACACCGTCC CGCAACAGGA TGATGTCGCT CTGCTGCTGC GCCAGGCGCT CGATAAGGTC GCGTTCCTAC CCTTCGGATT GCTGATCGAC CAGTGGCGCT GGAAGGTCTT CAACGGGCAG ATCAAGCCGG AAGACTACGA GAAGTCGTGG GTGCAGATGC GCGAGCACTA CCAAGGCGTC TACCCCCCGA CCGACCGCAC CGAAGCCGAC TTCGATCCGG GCGCGAAGTT CCATGTGCCG GCGAACGTCC CGTACACGCG GTACTTCCTC GCGCGCGTTT TGCAATTCCA GTTTTATCGC GCGATGTGCA AAGAGGCCGG ATTCACCGGT CCGCTGCACC AGTGCTCGTT CTACAACAAC AAGAAAGCCG GCGCGAAACT GGATGCCATG CTTGAGATGG GCGCCAGCAA GCCCTGGCCG GAAGAGCTCA AAGTCCTGAC CGGCGAAGAC AAGATGGACG CCGGGGCCAT GCTCGATTAC TTCGCGCCGC TAAAAAAATG GCTCGACGAA CAGAACAAGG GGCAGCAAGC CGGCTGGACT GAACAGAAGT AA
|
Protein sequence | MRRPHYDAST AACTIPAGGT MLRKAFLTAV LTAAIVPLSV FAQSTPTVAD AEKFVKDAET KLDDLGVKAQ RAEWVAENFI TDDTQEIAAE ANEIANAEAT NFAKQTIQFE KLQLPPELAR KMLLLKLAAI APSNPKDLSE LTRVQASMAA DYGKGKYCPT TGKHAGECLD ITKIEHIMET STDPDELKDL WIGWHKVGAP MRQRYARFVE LSNNGAREMG WADTGAYWRA GYDMPPDQFS AELERLWQQM RPLYVSLHTY VRNQLVKKYG EQAVKDGMIR ADLLGNPWAQ EWGNIYPLVA PPTKHPQLDV TQILQDRKVD ELGIVHYGEN FFKSLGFPAL PQTFWERSLF LKPKDRDVIC HASAWDIDNK DDLRIKTCLQ VRADDFVTVH HELGHNFYQR AYKAQSPLFE NGANDGFHEA IGDTIALSIT PEYLKQVGLI DTVPQQDDVA LLLRQALDKV AFLPFGLLID QWRWKVFNGQ IKPEDYEKSW VQMREHYQGV YPPTDRTEAD FDPGAKFHVP ANVPYTRYFL ARVLQFQFYR AMCKEAGFTG PLHQCSFYNN KKAGAKLDAM LEMGASKPWP EELKVLTGED KMDAGAMLDY FAPLKKWLDE QNKGQQAGWT EQK
|
| |