Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2021 |
Symbol | |
ID | 4070351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2419085 |
End bp | 2420365 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984035 |
Product | TPR repeat-containing protein |
Protein accession | YP_591096 |
Protein GI | 94969048 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCGTG CGTGGCTGGT GGCAATTCTG GCGCTTGGCT GTGCATCGGC GCAAACGCGT CAGCCTGCGA AGGTGGCTGC AGCGTCTCCG GTGGACCAGG CCGAAACGGC GATCGGCAAG CAGGATTGGG CGGCGGCCGA GACACTTCTG AAGGACGCCA CCTCGCAGGA GCCCAAGGAC TACCGGGCGT GGTTCGACCT CGGCTACGTT TACACCTCCC AAGACAAGAC CCGAGAGGCA GCCGAGGCAT ACCGGTATTC AGTCGACGCC AAGCCGGACA TCTTTGAGAG CAATCTGAAT CTCGGTATCT CACTTGCGAA ACTTGGCAAT CCAGACGCGG CGAAATACCT GGCCGTGGCT ACGACGCTGA AACCCACAAG CCATCCGGAA GAGGGATATT TCCGGGCATG GCTGTCGCTG GGGCACGTTC TGAGCAAGGA ATCCCCGCAG CGGGCGGCCG AGGCCTATCA GCAGGCCGCG AAGTTCAAGT CGAAAGATCC GGAGCCGCAT CTGAGCGCGG CGCAGATGTA TGAAATAGCG AAGGACACGG CGGGTGCGGA GCGCGAGTAT CAGGTAGTCC TGGCGCTGGA TCCTGGCTCA AAAGAGGCAA TCACCGGGCT GGCGAACATC TACCTGAACG CGAAACGGCT ACCAGAATCC GAGACCATGC TGCGGAAGAT TCTGGCGGGC GATCCAACGA ACAGCAACGC ACAGCTGCAG TTGGCGCGGG TTCTGGCAGC TGAGAACAAG GACGATGACG CGACGGCCGC GTACGACGCC GCTCTCAAGC TGCTTCCGAA TGACGGGGAA GCCCAGAAGT CGGCAGCGGA TTTCTATCTT GCGGCCAAGA AGTATAAAGA GGCGGCGGCG GCCTACGCAC AATTGGTGCA GGCGAAGCCG AATGACGCCG CCCTGCGTGA GTTGTACGGG AATGCCCTAC TGCGGCTTCA TAAGAACGCG GAGGCGCAGG AGCAGGCGCT GATTGCTATC AAGCTGAATC CGAACATGGG AGAGGCGTAC AACGACCTGG CGTTTGCTGC CGCCGAGAAC AAAGATTATG CGCTGTCGCT CAAGGCGTTG GACGCGCGGG CAAAGTTCTA TCCGGAAAAT CAGGGTACCT ACTTCCTTCG TGCGACGAAT TACGATAATC TCCGCTTAGT AAAAGACGCA ATCGCGGCCT ATAAGAAGTT CCTGGCGGTA TCGGATGGCA AATTTCCCGA CCAGGAATGG CAGGCGCGGC ACCGTCTTAT CGCAATAGAC CCTGAATCGA GAAAGAAATG A
|
Protein sequence | MRRAWLVAIL ALGCASAQTR QPAKVAAASP VDQAETAIGK QDWAAAETLL KDATSQEPKD YRAWFDLGYV YTSQDKTREA AEAYRYSVDA KPDIFESNLN LGISLAKLGN PDAAKYLAVA TTLKPTSHPE EGYFRAWLSL GHVLSKESPQ RAAEAYQQAA KFKSKDPEPH LSAAQMYEIA KDTAGAEREY QVVLALDPGS KEAITGLANI YLNAKRLPES ETMLRKILAG DPTNSNAQLQ LARVLAAENK DDDATAAYDA ALKLLPNDGE AQKSAADFYL AAKKYKEAAA AYAQLVQAKP NDAALRELYG NALLRLHKNA EAQEQALIAI KLNPNMGEAY NDLAFAAAEN KDYALSLKAL DARAKFYPEN QGTYFLRATN YDNLRLVKDA IAAYKKFLAV SDGKFPDQEW QARHRLIAID PESRKK
|
| |