Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0116 |
Symbol | |
ID | 4069491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 124797 |
End bp | 125894 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637982116 |
Product | peptidase M50 |
Protein accession | YP_589195 |
Protein GI | 94967147 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.345861 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAAGCT GGTCCATTCC GATAGGGCGA TTGTTCGGCG TCGAGGTGCG ACTGCACCTG ACGTTCTTTT TCCTGCTGAT GTTCGTGTGG TTTGCGCAGT CGGCGGCGAA CGGAACGGCG CTGGCAATGC GCGGACTTGC GCTGGTGGGC CTGGTATTCG GCTCCGTGGT GCTGCATGAG TTGGGGCATG CACTGGTCGC GATCCGTCTC GGGGTGAAGG TGCGCGGCAT TGTGCTGCTG CCCATCGGCG GCATCACGTT CATGGACGAC AACGCGCCGC ATACGCGACA GACGGCGGCA CGCGACATTC GCATCAGCGC TGCGGGACCG CTGATCAATC TCGGGATAGG GATCGGCGCC GCGATTGCCT GCGTCGGGCT GTTGCACATG AACCTGCTCG CGCGGCCGCT GATTACGCCG GTGAACCTGA CGAAGAGCTT CGTGTGGGCG AACCTGTTTC TCGGACTGTT TAATCTGCTG CCGGCGTATC CCATGGACGG CGGGCGCGTG CTTCGCTCCT GGTATGCGCA ACGCATGGAC TACGTGCAGG CCACGCGGCG GGCGGTGACG ATCGGCCAGA CCTTCGCAGC GCTGTTCATG GTCGCCGGGA TCTGGGCGCC GTGGCTGATG ATGATCGGCT TCTTCCTGTT TATCGGCGCG CAAATTGAAG ATCGGACTGC GATCTTCCAA TCAGTGCTGG AGACGGTGCG GATTGAAGAG GTCATGCTTA CGGACTTCTC GACGCTGTCT TCAGCCGACA CGCTTGAAGA CGCGCTGCAC AAGGCCGTGC ACTCGTTGCA AGACGAATTC CCGGTGATTC GCGGCAGCGA TCTGGTCGGC GTCATCTCGC GCCAACGCAT TCTCGATGCG CTGCGCGGCG GCAATGGCTA TGTGCAAGGC GTGATGAACC GAGGCTTCCA GATCGCACAG AAGTCCGAGA CGCTGGCAGC AATCTTCCGG CGCATCGGAA CCAAGGGCTT GAGCCTGGTG CCGGTGGTGG ATGGCGATCG CCTCGTGGGC ATCGTAACCA TGCAGAATTT GACGCATTCG ATGGCGCTGC TGGCGGAGTC GCGGAAATTG CAGAGGATGG AAGAGTAG
|
Protein sequence | MRSWSIPIGR LFGVEVRLHL TFFFLLMFVW FAQSAANGTA LAMRGLALVG LVFGSVVLHE LGHALVAIRL GVKVRGIVLL PIGGITFMDD NAPHTRQTAA RDIRISAAGP LINLGIGIGA AIACVGLLHM NLLARPLITP VNLTKSFVWA NLFLGLFNLL PAYPMDGGRV LRSWYAQRMD YVQATRRAVT IGQTFAALFM VAGIWAPWLM MIGFFLFIGA QIEDRTAIFQ SVLETVRIEE VMLTDFSTLS SADTLEDALH KAVHSLQDEF PVIRGSDLVG VISRQRILDA LRGGNGYVQG VMNRGFQIAQ KSETLAAIFR RIGTKGLSLV PVVDGDRLVG IVTMQNLTHS MALLAESRKL QRMEE
|
| |