Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0808 |
Symbol | |
ID | 4068687 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1000440 |
End bp | 1001900 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982815 |
Product | TPR repeat-containing protein |
Protein accession | YP_589887 |
Protein GI | 94967839 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.224492 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.93988 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCGTT TGATGACCAA GCTTATGTGC TATGCCGTGC TCGCGGCACC GTGCGCGTTT GCTACCGGAT CAGGCCAGGA GGCCACTCCG GAGAAAGACA TTCCACTCAC CACTAGTTCG GACGTAGCGC GGCGCGCGTT CCAGGGCGGA CTCGAGAATA TCGAGAACCA GCAGTCCAAG CGTGCCCATG TCGACTTCCG GTCCGCAGTG CGCGCTGACA GCAACTTTGC GCTGGCGCAC TTGTTCCTGG CATACGACAA CGGAAATCCG GCGGAAGAGA AGGCGGAGCT TCAGAAGGCG CGCACGCTGG CCGCGAACGC CTCGAAGCCC GAACAGATGC TGGTGGAATG GATGGCGGGA TCGCGTGAGG GGAAGATGGT TCCGGCGATC TCTGCACTGA ACGATGTGAC GTCCGCGTAT CCCGAAGACA AGTTCCTGCT ATTTCTTGCC GGTCGCTGGA TGGTGCAACA GCAGAATTAC GAAGGTGCGC AGCGGTTCCT CGAGCGGGCA GTGACGATCG ATCCGAATTA TCCGGCGGCC CTGAACGAAC TGGCGTATGC CTACGCCGGC AACAGGATCT TCGACAAGGC ATTCGAGGCA CTCGACAAGT ACGCGAAGCT GCTTCCAGGT GAACCGAATA CGCAGGACTC CTACGGGGAG ATCAGCCTGA AGGCGGGGCG ATTCGAGCAG GCGGTTGAAC ACTACAAGAA AGCGCTGGAA TACGATTCGA CGTTCGTGTG GTCGCAGGTT GGCCTGGGCG ACAGTTACAT GCTGATGGGG AAAGAGCAGC AGGCCCGTGC AGAATATGCG AAGGCGGCTG CGATGGCACC GTCGGATGGC GATCGGCTTA CGTGGCAACT GCAGTCGGCA TTGACCTATA TGTTCGCGCA TCAGCATGAA TCCGCCGATG ATGCGTTCAC GAAGGTGGCT GAAGAAGCGA GTTCGTTGCA TATCGGCAAG CAGGAAGCCA TGGCGCACAG GCTTATGTCG GCTTACGACC CGGATTTGCC GGGGTTCCTG CGGCACACTG CTGACGCTGA AAGCGCGTTG AAGACACGCT CGGATATATC GAAGAGCGAT CGCGACCAGG AGATGGCGCT GCTGTTAAAA TCCCGCGCTG TGCGGGCGGC GGAATTCGGC CGCACGGACA TGGCGAACGA GTCCTTGAAG AAGCTCAGCC AGATGGCGGA GAGCATCCCG GACAACGTAA TTCAACAGGC GAACGAAGGC GCGCAGGGCG GCGTATTGTG GGTCCAGAAG AAATACGCCG AGGCCATACC TCATTTGGAA GAGGACCAGG GGAATCCGTT GAGTGCGGCG CGCCTGTTGC AGGCGTATCG CGAGAGCGGA GACTCGTATC GTGCCGACTC TCGGGCGATA CGGTTAAATA GTTATTACGA GCCGACGCTC GACGATTACC TGGCCAGGCA GCTGTTGAAC GGCAAACAGC GCAAGAAGTA A
|
Protein sequence | MNRLMTKLMC YAVLAAPCAF ATGSGQEATP EKDIPLTTSS DVARRAFQGG LENIENQQSK RAHVDFRSAV RADSNFALAH LFLAYDNGNP AEEKAELQKA RTLAANASKP EQMLVEWMAG SREGKMVPAI SALNDVTSAY PEDKFLLFLA GRWMVQQQNY EGAQRFLERA VTIDPNYPAA LNELAYAYAG NRIFDKAFEA LDKYAKLLPG EPNTQDSYGE ISLKAGRFEQ AVEHYKKALE YDSTFVWSQV GLGDSYMLMG KEQQARAEYA KAAAMAPSDG DRLTWQLQSA LTYMFAHQHE SADDAFTKVA EEASSLHIGK QEAMAHRLMS AYDPDLPGFL RHTADAESAL KTRSDISKSD RDQEMALLLK SRAVRAAEFG RTDMANESLK KLSQMAESIP DNVIQQANEG AQGGVLWVQK KYAEAIPHLE EDQGNPLSAA RLLQAYRESG DSYRADSRAI RLNSYYEPTL DDYLARQLLN GKQRKK
|
| |