Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1435 |
Symbol | |
ID | 4068814 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 1733349 |
End bp | 1734986 |
Gene Length | 1638 bp |
Protein Length | 545 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637983444 |
Product | peptidase S1C, Do |
Protein accession | YP_590511 |
Protein GI | 94968463 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.600827 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACACCGC CCACCAGCGG TTCCTGGCAG CGCCTGAGAG CCAATCGGTT CGCTTCTGTT CTTGTGATCC TGGCCACTCT CTCGCTCGGC ATTCTGATCG GAACCGTGAT CTCTTCGACT GTGAAGGGCA ACGAAAAACA GGTCAGCAGC TCGGACGCAA CTCCGCTGCA GATCCCTGAG CCCAAGCAGC TTTCCAACCA GTTCGCGCAG ATTGCGAAAC AGCTCGAACC GGCGGTGGTC AACATCAACA CCGAGTCCAC CATGAAGCAC CCTTCCATCA AGGGACGACG CGGCCAGCAA ACGCCTCCCG ATGACGACGA GGATAATCAG GACGATCAGG ACCAAGGCCC GGGCGGCGGG CAGGACAGCC CCTTCCAGGA CTTCTTCGAT CGCTTCTTCG GTGGCCAGGG TGGCGGCGGG CAAATGCCCC AGCAGGACCT CCGTCAGCGC GCCCTTGGCT CCGGCATCAT CATTGACCCG AAGGGCTACA TCATCACCAA CGATCACGTG GTTGATAAAG CCGACAAGAT CAAGGTCAAC CTCATGGGCG ACCCTGAGAC CGTCAGCTAC GACGCTACGG TCATCGGCGT GGACAAGGAA ACCGATCTCG CCGTCATCAA GATCAACGTG AAGCACGATC TTCCTTACGC GAAGCTCGGC AACTCCGAGG GCGTACAGGT CGGTGACTGG GTTCTTGCCC TCGGCAGCCC CTTCGGTCTT AACTCGACCA TGACTGCCGG AATCGTCTCC GCCAAGGGCC GCAACATCGT CCCGCAGCGC CAGTTCCAGC AGTTCATCCA GACCGACGCC GCCATCAACC CCGGCAACTC CGGCGGTCCG CTCGTGGACA TGGCCGGTGA GGTCATCGGC ATCAACACCG CGATCTTCAC CACCGGCGGC GGCTACCAGG GTGTTGGCTT TGCGCTACCC TCCAACACGG TCATACAGGT TTATAACCAG CTCATCGCGC CCGATCACAA GGTCTCGCGC GGCTCCATCG GCGTGGAATT CAACGCGGTA GCGAATCCCG CGGTAGCGCG TGTTTACGGC GTCACCACGG GCGTTACGGT AGCCAACGTC ACTCCCAATG GACCGGCACA AAAGGCCGGC ATCCAGACGG GCGACACCAT CGTTTCTGTG GATGGCAAGC CCGTAAAGAA TGGCGATGAA CTCGTCGCTG ACATCTCTGC GCGCAAGCCG GGCTCGACCG CGAAGGTCGG CTTCGTTCGC AACGGCAAGG AACAGTCTGC AAGCGTCACG ATCGCGGATC GCTCCAAGCT CTACGCCGCG CGTCTTGGCG GCGGTGGCGA AGAGCAGGGT GAAGGCGGCG AAGGCCAGCC CCAGCCCAGC AAGTTCGGTG CGACCGTGCA GAACATTACG CCTGAGATGG CGCAGCAGTT GAAGCTGCCC AACACCAAGG GCGTTGTGGT CAGCAACGTG AAGCAGGACA GCTTTGCGGA GTCTGTCGGC CTTGGCCGCG GCGACGTGAT CCTCGAGATC AACAAACAGC CCGTCACCAA CGAAGACGAT TTCCGCCGCA TTCAGGGCAG CCTCAAGAGC GGTGCTGACG TCGTCTTCCT CGTCCGTCCT CGCGGACGCG ATAACGGAAC CATTTTCATG GCCGGAACCT TGCCGTAA
|
Protein sequence | MTPPTSGSWQ RLRANRFASV LVILATLSLG ILIGTVISST VKGNEKQVSS SDATPLQIPE PKQLSNQFAQ IAKQLEPAVV NINTESTMKH PSIKGRRGQQ TPPDDDEDNQ DDQDQGPGGG QDSPFQDFFD RFFGGQGGGG QMPQQDLRQR ALGSGIIIDP KGYIITNDHV VDKADKIKVN LMGDPETVSY DATVIGVDKE TDLAVIKINV KHDLPYAKLG NSEGVQVGDW VLALGSPFGL NSTMTAGIVS AKGRNIVPQR QFQQFIQTDA AINPGNSGGP LVDMAGEVIG INTAIFTTGG GYQGVGFALP SNTVIQVYNQ LIAPDHKVSR GSIGVEFNAV ANPAVARVYG VTTGVTVANV TPNGPAQKAG IQTGDTIVSV DGKPVKNGDE LVADISARKP GSTAKVGFVR NGKEQSASVT IADRSKLYAA RLGGGGEEQG EGGEGQPQPS KFGATVQNIT PEMAQQLKLP NTKGVVVSNV KQDSFAESVG LGRGDVILEI NKQPVTNEDD FRRIQGSLKS GADVVFLVRP RGRDNGTIFM AGTLP
|
| |