Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2619 |
Symbol | |
ID | 4072028 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3088942 |
End bp | 3090786 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984636 |
Product | peptidase M61 |
Protein accession | YP_591694 |
Protein GI | 94969646 |
COG category | [R] General function prediction only |
COG ID | [COG3975] Predicted protease with the C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.84974 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGCGCT ACGCCGCGGT TCGACTCGTA TGCAGCGTGT TGTGGTTCAT CGTTTATTGT TCGCGATTCT CGGCCGCTGC GGTCACGTGC GCTTCGCCGG AGCCGCTGCC GGGTAATCCG CAATACGAGT ACTTCGTCTC GGTCGCCGAT CACGACCGCC ATCAACTGCA CGTGTCCATC CGATACCGCG CCACGAAGCC GACGGTTTTC CAAATGCCGG TATGGAACGC GCTCTACCAG GTACGTGACT TCGCGCAGTA CGCGACCGAA CTACAAGCAC ACGATGGCAA TGGAACCGCG TTGAGCGTCG AGGCGGAGGG GAATTCCGCT TGGAGAGTTC CGCCGTCAAA CGGCTGCGCG GTCATTGAAT ACAACCTGAA TGCGAATGTT CCCGGGCCGT TCAGCGCGCA GGCGAGCAGC GATCACGTTT TTCTAAACTG GGCGCAGGTG TTGCTCTACG GAGATCGCAA CGCTCCGTTG ATGCTGGCCG TTTCTGATCT TCCTGCGACA TGGTCGTTAC GCGACCTTGG TTTGTTCGAT GAAACCGCGC ACCGGCTCGC GCGTCCCGTG AGTTACGACG CTCTCGTGGA CAGTCCGGTG GAGATGTCGG CGAGCAAGAT CGCGGCCTTC GACGAAGATG GTGCCCAATA TCGGATTGTT GTGGATGCGG ATGACGCTGA TTACAACCTC CCCGCCATCC AGGATGCGCT GCGTAAAGTC GTCCACGCTT CGGTTGACTG GATGCACGAT CGTCCGTTCG ACCAATACAC GTTCCTGTAT CACTTCCCAC GCGGGCCCGT CGGCGGCGGC ATGGAGCACA GCTACGGTAC CGCGATCTCC GCGCCGGCGG ACCGCATGCA CGAAAACGCG CTTGCTCCCA TCAGTACCTC GGCGCACGAG TTCTTTCATC TGTGGAACGT GAAGAGGATC CGGCCGCAAT CGCTCCAGCC CGTCGACTTT CAGCATGAGC AGTACACACG CGCACTGTGG TTTGCCGAGG GCGTGACAAG TACCGCGTCG GAACTGATGC TGGTGCGGGC GGGGCTGGAG AATGAACGCG GGTATCTGTC GCATCTCTCG GCAGTGATTA GCGACTTTGA GGCTCGTCCC GCGCACAAAT TCCAGTCGCC TGAGACCTCG AGCCTGGAGG CATGGATGGA AGGCCACGCC TACTATCGGC GTCCGGAGCG CAGCGTTTCG TACTACACCA GCGGAGAATT GTTAGGCGTT TTGCTTGATC TCGAGATGCG CAAGCGGACG CGGGGAACAA AGTCGTTACG CGACCTGTTC ATTTATCTCA ACGCCGAATA CGCTAAAAAG CACCGCTACT ACGACGATTC CAACGCGGTC CAGCAGGCCG CGGAAAAAGT CGCCGGTGGC AGCTTCCAAT CCTTCTTCGA TAAGTACGTT CGCAGCACGG TGCCAATCCC GTACGATGAC TATCTCCGTT TCGTCGGGCT GACGCTGCAG CCGTTTGCGA TCCTGGGCGT AGATGCTGGG TTCGACGCAT CCGTGAACTT CACCGGCCTG CCGGAGGTGA CCAAGGTGAC GCCGGGAAGC GCGGTGGAAG CCGCGGGCGT GCATGCCGGA GATACATTGA CGGCCATCGA TGAACACGAG TACATGGGTG ATCTCTCGCA CTACCTCGTC GGCCACAAGG CGGGCGACAC GGTGACGTTT CGATTCGCCT CGCGCACCCG AACCATGGCC GTAAAGGTGA CGCTGGCGGA ATCAAAGGGC CCTGCGTTTT CGGTCGTCGA AGAGCCGGCC GCCAGCGTGG AACAGCGCGC TCAACGCGCG GCGTGGATCC GTGGCGACGA CATGGAGAGC GGAGCACACA AATGA
|
Protein sequence | MRRYAAVRLV CSVLWFIVYC SRFSAAAVTC ASPEPLPGNP QYEYFVSVAD HDRHQLHVSI RYRATKPTVF QMPVWNALYQ VRDFAQYATE LQAHDGNGTA LSVEAEGNSA WRVPPSNGCA VIEYNLNANV PGPFSAQASS DHVFLNWAQV LLYGDRNAPL MLAVSDLPAT WSLRDLGLFD ETAHRLARPV SYDALVDSPV EMSASKIAAF DEDGAQYRIV VDADDADYNL PAIQDALRKV VHASVDWMHD RPFDQYTFLY HFPRGPVGGG MEHSYGTAIS APADRMHENA LAPISTSAHE FFHLWNVKRI RPQSLQPVDF QHEQYTRALW FAEGVTSTAS ELMLVRAGLE NERGYLSHLS AVISDFEARP AHKFQSPETS SLEAWMEGHA YYRRPERSVS YYTSGELLGV LLDLEMRKRT RGTKSLRDLF IYLNAEYAKK HRYYDDSNAV QQAAEKVAGG SFQSFFDKYV RSTVPIPYDD YLRFVGLTLQ PFAILGVDAG FDASVNFTGL PEVTKVTPGS AVEAAGVHAG DTLTAIDEHE YMGDLSHYLV GHKAGDTVTF RFASRTRTMA VKVTLAESKG PAFSVVEEPA ASVEQRAQRA AWIRGDDMES GAHK
|
| |