Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0912 |
Symbol | |
ID | 4069123 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1146681 |
End bp | 1148927 |
Gene Length | 2247 bp |
Protein Length | 748 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637982919 |
Product | TPR repeat-containing protein |
Protein accession | YP_589989 |
Protein GI | 94967941 |
COG category | [R] General function prediction only [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.795031 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.00626987 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACTTA CATTCGAAAT TCGCGAATGG CTGTTCACGG GAGTGCTCGC CGGCATAGCC TTTGTTTCCG TTTCCTCGAT GTTTGCGCAG ACACCGCAGG CTGCCACGGT GCAAAAGGAC GCTAGTTCCC CCGAGGCTCA CTTCGATTCC GCGCAGACTT TCCAGATTGC TAGTGACCTC ACCCGCGCTG CGGCAGAGTA TCGCAAGGGG ATCTCGGTCG CACTCGAGCG CGTCGGGAAT TTGAAAGTTG CAAAAGGTGA GTTCACCTCG GCTCTCGATT TGCTTCGGAA GGCAGTGGCC ACCGACCCAG AAAACACGGA CGGCCAAATT GATCTCAGCA TCGCCTACTT CCGTGCCGGC AATTACGAGG GCGCGAGGAC CGTGCTGCTA CCTCTTGTGA AGAGCGACCC GGGCAGTGCA CGCGGTCGCA ACCTGATGGG CAAGATCCTT TTCATGCAAG GAGATTTTGA GGGTGCCTCA ACCGAGTTGC AGGCAGCTCT TTCCATCACA CCTGACTTCG ATGTCGCCTA CAGTCTCGCA CTCGCCTATC TTCAATTGAA GAAGTTGCCT CAAGTCACTC CGCTCGTCGA CGAGATGAAA GCCTCTATGG CGAAGTCCCC GGAGCTTTAT ATGCTGCTCG GACAAGCCTA CCGGCAGACT GGCTATTACG ACCAGGCGGT GAGCGAGTTC AAGACTGCGC TTGCGCTCGA TGCCGCGCGT CCGCGCCTGC ATAACCTACT TGGAACGACT TATGTGGCGT TGGGGGGCAA GCAAAATTAC GAACTCGCGC GCGCCGAATT CCAGCAGGAG CTTGCAAAGA ATCCGAAGGA CTATTCGAGC CACTATTATC TTGGCCTGAT CGAGTTGGAA GACGGGCAGT ATGCGAAGGC CGAAGCCGCG CTCAAAACCG CGCATGCCCT CGCGCCCGAC GACCCCGCGG CGATGCTCTT GCTCGGACGC CTCTACGACC AGCAGAAGAA CTGGAATGCC GCGATCGAAG TGTTGCGTCA GGTGCTCGCG CGTTCCTCAG CCCAAGGTGC ATCACCCGTG CAACTCGCGA CTACGCACGA AATGCTGTCG AAGGCGTACG CAGGCGCGGG GCAGATTCCC GAAAGCGAAA AGGAACTCGC CGCCGCGAAT GCGCTCAAAA GTCAAGACGC AAGCAAGAGC GCAACTAGTG ATCCGGCCGT GGCAATCCAA CCGGAAAATA GTGGCAAAGA GCTTCGCGCA ATGCTGATGC AGGGATCGCC CAAGGCTCAG CCATCTGACG CGTCTGAACA GAAGTACGTC GCCGATATCT CGAAACTCCT CGGAAATGCC TATAACAATC TCGGGGTCAT CGACGCGCGT GCCGGCTCCT ACAAGCAGGC CGCCGACGAA TTCAAAGAAG CTGCCAAATG GGACGATTCC ATTCCTCAAC TGGACCGTAA TAGGGGCCTT GCTGCGTTCC GGGCCCAAGC ATACGCCGAT GCGATTCCGC CTCTTGAACG GCTGTTAAAA AAATCTCCAT CTGACTCCAA TCTGCGCGAG TCTCTCGGTC TCAGCTACTA CATGACCGAC AAATTCAAGG AGAGCGCGGC GACGCTTCGA CCGATTGTGG ACACAATGTC GAACAATCCC GGCCTCCTGC TCTCCGCTGG CGTCGCGTTC GTAAAATCCG GAGATATCCC GACCGGTCAA CGCTTGTTCA CTCGCGCCTT TGAGGTCGGC AAGGCGACGC CAGAAATTCA CTTGATTATC GGTCAGGCAT ACGCGGAACA GTCCGATAAC GACGAAGCCC TCGCCGAATT CAAACAGGCT CTTGAACTCA ACCCGAAGTT GCCGGACGCT CACTTCTACA TCGGAATGGT GCGATTTAAG CGTGGTGAAT TCGACGACGC TGCCAAAGAA TTTCAGCAGG AGCTCGAGGT CAATCCTCAG AGCGTTCAAG CGATGTACCA GTTGGCATAC ATCCGAATGC AGCAACACCA GGCGCCCGAA GCCTCAAGTC TGCTTTCGGA AGTGATCAAG CAACAGCCGA ACAATTCAGA TGCCCACTAT CAGCTCGGGA AAGCATTGTT GGAACAAGGT GATGCAGGCG GTGCAACGCG GGAACTTGAA ACCTCGGTGA AGCTACATCC GACTGACTAT GCGTATTTCC AATTGAGTCA CGCGTACGCG CGAACAGGTC GCGAGGCGGA TTCCAAGCAA GCGCTCGAGG AATTCGAAAA GCTGAAGCCT AAACCGAAAA CACCGATGGG TCCCTGA
|
Protein sequence | MKLTFEIREW LFTGVLAGIA FVSVSSMFAQ TPQAATVQKD ASSPEAHFDS AQTFQIASDL TRAAAEYRKG ISVALERVGN LKVAKGEFTS ALDLLRKAVA TDPENTDGQI DLSIAYFRAG NYEGARTVLL PLVKSDPGSA RGRNLMGKIL FMQGDFEGAS TELQAALSIT PDFDVAYSLA LAYLQLKKLP QVTPLVDEMK ASMAKSPELY MLLGQAYRQT GYYDQAVSEF KTALALDAAR PRLHNLLGTT YVALGGKQNY ELARAEFQQE LAKNPKDYSS HYYLGLIELE DGQYAKAEAA LKTAHALAPD DPAAMLLLGR LYDQQKNWNA AIEVLRQVLA RSSAQGASPV QLATTHEMLS KAYAGAGQIP ESEKELAAAN ALKSQDASKS ATSDPAVAIQ PENSGKELRA MLMQGSPKAQ PSDASEQKYV ADISKLLGNA YNNLGVIDAR AGSYKQAADE FKEAAKWDDS IPQLDRNRGL AAFRAQAYAD AIPPLERLLK KSPSDSNLRE SLGLSYYMTD KFKESAATLR PIVDTMSNNP GLLLSAGVAF VKSGDIPTGQ RLFTRAFEVG KATPEIHLII GQAYAEQSDN DEALAEFKQA LELNPKLPDA HFYIGMVRFK RGEFDDAAKE FQQELEVNPQ SVQAMYQLAY IRMQQHQAPE ASSLLSEVIK QQPNNSDAHY QLGKALLEQG DAGGATRELE TSVKLHPTDY AYFQLSHAYA RTGREADSKQ ALEEFEKLKP KPKTPMGP
|
| |