Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3768 |
Symbol | |
ID | 4071052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 4452655 |
End bp | 4453554 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637985791 |
Product | TPR repeat-containing protein |
Protein accession | YP_592842 |
Protein GI | 94970794 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCACAT GGGGGTTCCT TCGGGAACAT GCCCTTTCGC CGCTGGTGTC GCGCCAGAGG AGGTGGCACG TGAATCGTCG AATCATGATC CAACTGTGTG CATTCTTATT GGGCCTGCCG CTCCTCGCGC AAACCCCGAC TCCAACTCCG CCTCCGCCGC AGCAAACGCC ACTGGAAGAC CCCGGACTTC AGAGACCCGC GCCACCAGTG ACCTTGCTGC CTCTCGATAC TTCAACCGCA GCGGAACTGG AAGCCCAGGG CGATCAACTC CGCGGACACA ATCTGTACCT CGATGCGGTA GATAGCTACA AGGCGGCGAT TCGCAAAGAA CCAACACCGT CGCTTTACAA CAAACTTGGC ATCGCACTGA TTCAACTCCG CCATCCGGAA GAATCCATCG AGACGCTGAA CCACGCGATC AAGATGCAGA AGGATTTTTC TGACGCCTGG AACAACCGTG GCGGCGCCTA CTACATGGAA GGAAATTTCA AGAAGGCCAG CAAGGACTTC GAGCACGCCA TAAAAATCAA TGCCGACAAC GCCTCGTACC ACAGCAATCT CGGTTCGGCG TACTTCAACC GGCACGACTA CATCAAGGCG TCCAAGGAAT ACACGATTGC TCTGAAGCTG GACCCCTACG TTTTCGAACG CTCTTCGAAG ATGGGGATCA CCGCTTCCAT GGGCAAGCCG AGTGACCGTG CCGAGTTCGA ATACATGATG GCAAAATTGT TCGCGCAAAC CGGTGACGCC GTGGATTGCC TGCTCCATCT GCGGAAGGCG ATGGAAGAGG GCTACAAGAA CATTAACAAC GTATACAAGG ACCAGGAATT CGCCGCGGTT CGCGCCGATC AGCGCTTCAA AGACCTCATG GCCCAGAAGC CCGAGGCGAT CCCGCAGTAG
|
Protein sequence | MATWGFLREH ALSPLVSRQR RWHVNRRIMI QLCAFLLGLP LLAQTPTPTP PPPQQTPLED PGLQRPAPPV TLLPLDTSTA AELEAQGDQL RGHNLYLDAV DSYKAAIRKE PTPSLYNKLG IALIQLRHPE ESIETLNHAI KMQKDFSDAW NNRGGAYYME GNFKKASKDF EHAIKINADN ASYHSNLGSA YFNRHDYIKA SKEYTIALKL DPYVFERSSK MGITASMGKP SDRAEFEYMM AKLFAQTGDA VDCLLHLRKA MEEGYKNINN VYKDQEFAAV RADQRFKDLM AQKPEAIPQ
|
| |