Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0446 |
Symbol | |
ID | 4071693 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 526694 |
End bp | 527827 |
Gene Length | 1134 bp |
Protein Length | 377 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637982450 |
Product | TPR repeat-containing protein |
Protein accession | YP_589525 |
Protein GI | 94967477 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.021695 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCGCGGG GCTGTTTTGT CGTCGTATCG CTTTTTTTTC TTCCGTATTT CGTATTCGCC CAGGCCCCCG ATCCTTCCGC AGCCATCGCC TTTGAACAAC AAGGGAAACT CCAGGAAGCG GAGAGGACCT GGCGCGAGGT CATCAAGCAA AATCCCAAAG ATGCGGGTGC CTTTGCCAGT CTGGGTGTAG TCCTGTCGAA GGAACAGAAA TATCAGGAGG CCGCTGCAGC TGATCGCAAA GCGCTCACGC TTGATCCTTC TCTGCCGGGG GTGCAACTCA ATCTTGGACT GGCGGAGTTC AAGCGAAATA ACTTCGTAGC AGCCATACCG CCACTCAAAG CCGCCTACGC CGCGGATCCA GCCAGTAAGC AGGCGGTGAC TTTACTTGGT TTGAGTTACT ACGGGTCGAA GCAATTTGCC GAGGCAAGCA AGTATCTCGG AATCGCCGCC AAGGCAGATC CAGCCAACAT TGAATTGCAT CAGGTGCTGG CGCAAAGCTG TCTCTCCGCC CGGAATTACG ACTGTGCGCT CGAGGAATTC CGCCAAATCT CCGCCCAGAA TCCGCAATCG GCCGCGGTTC ATATGCTCAC CGGCGAGGCG CTCGATGGAA CAGGACACAC CGCTGCCGCC ATTGAGGAAT TCAAAGCCGC GGTCAACATC TCACCGCGCG AGCCTAATCT GCACTTTGGG CTCGGTTATC TTTTCTGGAA GTCGCACCAA TATGACGACG CCAAAGCCGA ATTTGAGAAA GAACTCGCGA TCGATTCGGA TCACGCTCTC GCCTTGGGCT ATCTTGGCGA TATCGCCATG AAACAGAATC GGCTGGAAGA AGCCTCAAAG TTCCTCCGCA AGGCGATCAG TGCCAAGCCT GATCTTCGAA TGGCATATGT AGATCTCGGT TCCGTGCTGA CCGAACAAAA GCAGTATGAG GAAGCCATGG AAGCTCTCAA GCACGCCATC AAACTCGATC CGAGCCAACC GGATGCACAC TTTAAACTCG GCCGAGTTCT GCAGCGACTC GGGCGGTCCG AGGAATCGCG CAAAGAGCTG GCCAAGGTGC GCGAACTTCA TGAGCAGGCC GACGCCCCGC TTGCCACACA ATTGCCGGAG TCTGCCGCCC CGCTGCCGAA ATGA
|
Protein sequence | MARGCFVVVS LFFLPYFVFA QAPDPSAAIA FEQQGKLQEA ERTWREVIKQ NPKDAGAFAS LGVVLSKEQK YQEAAAADRK ALTLDPSLPG VQLNLGLAEF KRNNFVAAIP PLKAAYAADP ASKQAVTLLG LSYYGSKQFA EASKYLGIAA KADPANIELH QVLAQSCLSA RNYDCALEEF RQISAQNPQS AAVHMLTGEA LDGTGHTAAA IEEFKAAVNI SPREPNLHFG LGYLFWKSHQ YDDAKAEFEK ELAIDSDHAL ALGYLGDIAM KQNRLEEASK FLRKAISAKP DLRMAYVDLG SVLTEQKQYE EAMEALKHAI KLDPSQPDAH FKLGRVLQRL GRSEESRKEL AKVRELHEQA DAPLATQLPE SAAPLPK
|
| |