Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0424 |
Symbol | |
ID | 4069650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 494523 |
End bp | 495464 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637982428 |
Product | TPR repeat-containing protein |
Protein accession | YP_589503 |
Protein GI | 94967455 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.446862 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.0100173 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGAACGC TCATCGGCTT CTGCTGTCTT CTGATTTCTG GTGTCAATGT GCACGCGCAA GCCGCATCGA ATAGCCCCGC CAATTCGCCT GCATCGCCCA AGGCGGCTCT TCGTCTGGCG CAGGAAGGGC ATTGTAAGGA GGCGTTACCG GCGCTGAAAA AGGGCCTCGC TAGCGCGGCA AAAGACGATC ATCGGGATCT CGCGATGGCT GGCGTGCGCT GCGCGATGTT CATGAACCAG CCAGAGAGTG CGCTGGAATT TCTACGTGTG CTTGAACGCG AGTTCCCGAG CGATCCCGAC ACGCTTTACC TTCTTGTTCA TACCTACTCC GATCTCTCAA CCCATGCCGC AGCAGAGTTG GCGACAAAGC ATGGCAACAC ATACCAGGCT CGCGAATTGA ACGCAGAAGC GCTGGAGTCA CAAGGCAAGT GGCAGGAGGC GGAGAAAGAG TACAAGACGA TTCTCGAGGC GAACCCTAAA GCCGTTGGAA TCCACTTTCG CTTAGGCCGA TTGCTTCTCT CCGCGCCGAA TCCGCCAGCC GATATGGCGG AGCAAGCAAA GAGAGAATTC GAGGCAGAAC TCGCGGTCGA TCCCACAAAT GCAGGCGCGG AATACGTGCT TGGCGAGTTG GCGAAGACAG CCAATTCCTA TGACGACGCA ATCCAACACT TCTCCAAAGC CACCAAACTC GACCCCTCTT TCGCAGCCGC GTATCTCGGT ATTGGAACGA GCTTAGTCGC GCAGAAAAAA TTTGCTGAAG CTGTGACGCC GCTCGAAACA GCGGTGAAGC TACAGCCCGC TAATCCTGCT GGCCACTACA ATCTCGCAAC CGCCTATAGT CGCACCGGAC GCAAAGCAGA TGCCGACCGC GAGTTCGCGA TCCATAACGA GATGATGCAA CGCAGCGGCG GCGCTTCGGC GCCCGTCCAG CAGCCGCAGT AA
|
Protein sequence | MRTLIGFCCL LISGVNVHAQ AASNSPANSP ASPKAALRLA QEGHCKEALP ALKKGLASAA KDDHRDLAMA GVRCAMFMNQ PESALEFLRV LEREFPSDPD TLYLLVHTYS DLSTHAAAEL ATKHGNTYQA RELNAEALES QGKWQEAEKE YKTILEANPK AVGIHFRLGR LLLSAPNPPA DMAEQAKREF EAELAVDPTN AGAEYVLGEL AKTANSYDDA IQHFSKATKL DPSFAAAYLG IGTSLVAQKK FAEAVTPLET AVKLQPANPA GHYNLATAYS RTGRKADADR EFAIHNEMMQ RSGGASAPVQ QPQ
|
| |