Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3615 |
Symbol | |
ID | 4070135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4277040 |
End bp | 4278101 |
Gene Length | 1062 bp |
Protein Length | 353 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637985638 |
Product | TPR repeat-containing protein |
Protein accession | YP_592690 |
Protein GI | 94970642 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.242924 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.313882 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACTACC GGCCCCTTTC CCTTCTCCTC GGTATCGTGG TCGCAACCAC GGCCATCGGT ACCGCCCAGA TTGAAGAGTA CCGTACCGGC AGCGATTCTC CCGCTGCAGA CACCTTAACG CACTCGACGT TGACCGGTAC GGTGACTTCC GCCGACGGGT CGCCTCTCAA TAATATCCGC ATTGAGGTTC GCAGGATCGG GATCGGCTCA CCGGCGGACG CAACTTACAG CCATGTGAAC GGCTCGTTCG ACTTCGCGAA CCTGCGGCCG GGTTCCTACG AAGTGGTGGC GATTGACGGT GTGATGGAGG CGCGGGAACA GTTCATCGTA CAGAGCCAGT TGGTGTCTCT CAGCCTGCGG ATGCCGGTAA CCCGGTCAGC AGCACCGACC CGCGGCACGA TTTCTGTAGC GGAATTAAAG GTCCCCGATA AAGCCAAGCA CCTGCTCGAC AAGGCGCAAG GGGCGCTGTC GAAGGGTCAC AGCGACGAAG CCGAGAAGCA GGTAGAAGAG GCGCTGCAGG CAGCGCCAGA TTATGCGGCC GCACTGTCAT TCCGCGCCGC GCTGAAACTT ACCCGCAACG ATACGCAATC GGCGCTCGAT GACCTCGACC ACGCGGTAAA GGCCGATCCG AATTTTGCGC AGGCTTACAT GTTGCTGGGA GCGGCGTTTA ACCAGCTAGG CCGCTACGAC GAGGCGCTCC GCAGCTTGGA TCGTGGCTCG ATGTATGACC CTAAGTCATG GCAGGTTTCC TACGAGATGT CGAAGGCGTG GATGGGCAAG CATGATTACG TTCATGCCAT CCAGCAGCTG AACCGGACGG AGTCGTTGGG CGCAGTGAGA ATCGCGGGGC AGGTGCATCT GCTCAAGGGC TACGCGTTCA TGGGCCAGAA ACAATTTGAG CAGGCACAGA CGGAACTGCA GGCGTACTTA ACGTCCGAAC CTCAGAGCAA GATGGCGGGA TCGGTTCGCG CTGCCCTTGC ACAGATCCAG ACCCAGATGG CGCAGAGTCC TGCGGCGTTG ACGTTGCCGA CGATGACGGG GATCTTCGCG CAGGCGCACT GA
|
Protein sequence | MYYRPLSLLL GIVVATTAIG TAQIEEYRTG SDSPAADTLT HSTLTGTVTS ADGSPLNNIR IEVRRIGIGS PADATYSHVN GSFDFANLRP GSYEVVAIDG VMEAREQFIV QSQLVSLSLR MPVTRSAAPT RGTISVAELK VPDKAKHLLD KAQGALSKGH SDEAEKQVEE ALQAAPDYAA ALSFRAALKL TRNDTQSALD DLDHAVKADP NFAQAYMLLG AAFNQLGRYD EALRSLDRGS MYDPKSWQVS YEMSKAWMGK HDYVHAIQQL NRTESLGAVR IAGQVHLLKG YAFMGQKQFE QAQTELQAYL TSEPQSKMAG SVRAALAQIQ TQMAQSPAAL TLPTMTGIFA QAH
|
| |