Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2528 |
Symbol | |
ID | 4072172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2984787 |
End bp | 2986394 |
Gene Length | 1608 bp |
Protein Length | 535 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984545 |
Product | TPR repeat-containing protein |
Protein accession | YP_591603 |
Protein GI | 94969555 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.652305 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAATTTTC GCCGCGGCCT CATTCTCGTT TTCTGCTGTT TGCTGCTGAG CCTTACTGCT GTTGCCCAGG CACACGCGGG GGCACAAACG CTCCTCGTCC TCCCCTTCGA CAACGCATCC CGCGCCCCAG GCCTCGAATG GATCAGCGAG TCCTTCCCTG AGTTGCTCGG TCAGCGCATG GCCTCCCCCT CAACCTATGT CATTAGCCGT GACGAACGCC TGCTCGCCTT CGACCGCTTC GGCATCCCTC AAACCCTTCA TCCTTCCCTC GCCACGCTCT ATCGCATGGC CGAGCAAATG GACGCCGACT ACGTCGTTAT CGGCCACTAC ACGTTCGACG GCAACACTTT CACCGCAAAT GCGCAGCTTC TCGACATGAA GTCACTGAAG TTGGAGCCAT CCGTCACCGC TAGCGGCCCG CTCACCACGC TGATGAATAT CCAGAGCACC CTCGCATGGG ACCTGATCGG CGAAATGCAA AGGCAGCCGA CTGGCTCGAA GGACGAGTTC CTCCGCGCCT CCTCCGGCAT TCGACTCGAC GCCTTCGAAA ACTACGTTCG TGGCATCACC GCCGGCACCC GCCAGGAGAA GATCAATCGC CTGCGCGAGG CCAACCGCCT CAGCCCGAAT TACACCCGCG CCACGCTCGC GCTCGGCAAG GCTTATCTCG ACAATCGCGA CTACGATCAG GCCGTCAACT GGCTCTCGCG AATTCCAAAA AACGATCCGC TCGCAAATGA AGCCAGTTTC GATATCGGCA TCGCGGCCTT TTATCGTGGC GACTTCGAAC GCTCCGCTGA GGCCTTCAAT TTCCTGCTGA CCCGTTTGCC CATGCCCGCG ATCTACAACA ACCTTGGTGT CATCGCCGCC CGGCGCGGCC GGAAAACCGA AGCAGACCTG CTGCAGAAAG CTGTCGCTGC CGATCCCACC GACGCCGACT ACCGTTTCAA CCTCTCTGTT GCGCTCGCCC GTGGCGGCGA CAACGCCGGC GCAGTGCGAC AACTTCGCGA CGCCCAAAAG ACCCATCCCG ACGACGCAGA GATCAAGTCT CTTCTGGATC AGCTTCAAGG CGCGGCAGTC TCCAACGTTT CGCACACCCA GGCAGCGCAG CTCAAACTCC CTATGGAGCG CCTGAAACGC ACCTACGACG AAACTTCGTA TCAGCAAGTG GCCATGGAAA TCGAGAACGT CGCTGAGCAG CGCCTCGCGC AGGCCGATCC CAAGACTCAC GCTGCCTACC ATCTCGAACG CGGACGCGAT CTGTTGAACC AGGGCTTCGC CGCGCAAGCC GAAAAACAAT TCCGCGAGGC GTTGCAATAC GATCCAAACA ATGCCGTTGG TCATGCAGGC TTGGCACGCT CGCTTGAATC GAGCGACCCG GCGGCCTCCG CGCGCGAAGC CGACGCATCC CTCAAACTCC AGTCGAATGT CGACGCCTAC CTAGTCGTGG CACGCTTGGC CGTCGCCCGC AAAGACACCC GCAAAGCGAA TGACGCTGTC GATTCGGCCT TGAAACTTGA GCCTGCGAAT TCCGCCGCAC TGGCACTCAA ACGAAGCATA GAATCAAAGG CGGGTCAGGC CCCGCCATCC GGCAGCACGC CGGAGTAG
|
Protein sequence | MNFRRGLILV FCCLLLSLTA VAQAHAGAQT LLVLPFDNAS RAPGLEWISE SFPELLGQRM ASPSTYVISR DERLLAFDRF GIPQTLHPSL ATLYRMAEQM DADYVVIGHY TFDGNTFTAN AQLLDMKSLK LEPSVTASGP LTTLMNIQST LAWDLIGEMQ RQPTGSKDEF LRASSGIRLD AFENYVRGIT AGTRQEKINR LREANRLSPN YTRATLALGK AYLDNRDYDQ AVNWLSRIPK NDPLANEASF DIGIAAFYRG DFERSAEAFN FLLTRLPMPA IYNNLGVIAA RRGRKTEADL LQKAVAADPT DADYRFNLSV ALARGGDNAG AVRQLRDAQK THPDDAEIKS LLDQLQGAAV SNVSHTQAAQ LKLPMERLKR TYDETSYQQV AMEIENVAEQ RLAQADPKTH AAYHLERGRD LLNQGFAAQA EKQFREALQY DPNNAVGHAG LARSLESSDP AASAREADAS LKLQSNVDAY LVVARLAVAR KDTRKANDAV DSALKLEPAN SAALALKRSI ESKAGQAPPS GSTPE
|
| |