Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1682 |
Symbol | |
ID | 4069350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2034560 |
End bp | 2037943 |
Gene Length | 3384 bp |
Protein Length | 1127 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637983690 |
Product | TPR repeat-containing protein |
Protein accession | YP_590757 |
Protein GI | 94968709 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.592663 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCGAAAGC CCATCGGCAT TTTTGCTGTT CTTGCGCTTA GTTGTTCGTT TGCTATTGCT GCCGACCACA AACCCGATCC TGCCGAAGCG GCGCGATTGA ACAATATCGG CGTGGCGCTG ATGAACCAGC AGCGCATGGA GAAAGCTGTC GAGAAGTTCG ATCTCGCATT GGAGAAGGAC CCGAAGCTGT CGGTGGCATA TCTCGACAAA GGCATCGCGC TGCTGAATTT GCAGAAGCTG CCGGAGTCCG AGGCCGCGCT GAACAAGGCG GGCGAGGCGA TGCCGAAGAA CCCGCGCGTC TGGTACAACC TCGGGTTGCT GAATCGTGGG GCAGGGAAGT ACGACGCTGC GATCGAGAAC TTCAATAGAG TTACGACGAT TGATCCCAAC GACTCCGACA CCTTCTACAT GATCGGCTCG TTGTACCTGC AATTGCAGAA GTACGAGGAT GCAATCGGCG CTTATAAGAG TGCTCTGAAG ATCAATCCGC TGCATGCGTC CGCTGAATTT GGGCTTGCGA AGGCCTTGCA GCGCGCAGGA AAAGTAGAAG AGGCGCGCGA TCACCTGCAC ATCTTCGAGC ACCTGACCAA AGACAAGATC TCCTCGCCGA TGACGTTGAT TTACGGCGAG CAGGGGCGTT ATTCGCTCGC GGAGGATGTG CATACTGGTG CGCCGGAAGT GGGCGCGATG ATTCCGGTGA CGTTTGAAGC GCGGCCGTTG CAAGGCGGGG CGCAAGCGGT TGCCGCGACG GCGGCCGACA CTGCCGGTCT GTGCATGATG GACGTGAATG GCGACGGAAA ATTTGGCATA GTGGCGCTCG GCTCCGGTTC AAGTGCGATT CGCGTTTTCC TGAATGACGG TAGCGGCAAG TTCAAAGAGG CGTCAGCCGC AGAGCATGGG TTGAAAGCTG AAGGGACTGC GATCTCCTGC GCGGTTGGCG ATTTCGACAA CGACGGTCAT CCGGATCTGG CGGTCGCGTT TACTGATCAA CTGCTGCTCT TCCGCAACCT AGGCAATGGC AAGTTCGAGA ATGTTACGAA GGCTGCGGGA ATTTCGGCGT TGAACCATCC GGCTGGAATG TCGTGGGTGG ATTACGACCA CGATGGCGAT CTCGATCTGT TCGTAACGGG TAGTGCAGTC TCCGCGGGAA CCAACGTGCT CTGGCGCAAT AACGGTAACG GTACGTTTAC GGAAGTTGCT GCCGAACGTG GCCTGCAGGG CATTGGCAGC ACAAAGTCCG TTGTGCTTAC CGATCTCAAT AATGATCGTG CTGTGGATCT GCTCATCACT GGCGACACGG GTGCAACGGC GTATATCAAC CCGCGCGAAG GCAAGTTCCA GACGTCCGCT CTCTATGAAG AGAAACTGCC GCCCGCGACC GGCGCGTATG TATTCGACTT CAATAAAGAT GGGTGGATGG ATGTCGTGCT TACGCACGAC GGCACCCCGG GAATTTCGCT CTGGAAGAAT TTGGATGGCA AGCACTTTGA GCGTGTAGCG CTGCCAATTT CGGACGCGCA AGCTGCGTGG GGCGTGACCG CAATTGACGT GGACAACGAT GGTTGGCTCG ACCTCGCTGC AGTCGTGCAG ACGGCGAAGG GGCCCGCGGT TCGGATCTTC CGAAATACCG GATCAGCGGG GTTTGTCGAT GTGTCGAAGG CGATTGGTCT CGACAAACTT CAGCTGCAGA ATCCGCGCGG AGTTGTTGCC GCCGATGTGG ACAGTGACGG TGCAGCCGAT CTGATCGTTT CCCAAGGGAA TGCAGCACCG GTTGTGCTGC ACAACCACGG TGGGAGTGCG AATCATTCTG TGCGAATTAC CCTCGCTGGT CTCGCCGACA ACAAGAGCGC GTTGGGAACG AAGGTCGAAG TCTTTGCCGA CGGTCTTTGG CAAAAATGGG AGATCGTGGG CGGTTCAGGC TACATGTCGC AAGGGCCGAA CGAGATTCTG GCTGGCATTG GCAAAAACAG CGCGGTCGAC ATCGTGCGAA TGCTCTGGCC GGGCGGTGTG GTGCAGGACG AGACAGACAT CGCGATGGAT AAGCCGGTCC ACTTCCTTGA GATCGATCGT CGCGGGAGTT CGTGTCCGAC GCTGTTTGCG TGGAACGGCG AGAAGTATGA GTTTGTCTCC GACGTGATCG GTGCGGCAGT CATCGGCCAC TGGATTTCGC CGACGGAGAA AAACCTCGCT GATCCCGACG AATGGGTGAA GGTGGAAGGT TCGCAGTTGC GCGCGCGCAA CGGCAAGCTG AGCTTGCGCT TCGGCGAACC GATGGAAGAA GTGAACTTCG TTGACCAGGT GCGGCTCGTG GCCGTCGATC ATCCCGCAAA TGCTGATGTT TATCCCGACG AGCGCTTCCT GAGCGCGCCG CCGTTCGCGA GTGGCAAGGT CTTTGTGACT GGTAGGCCAC ATCCGCCTGT GGGGGCGTGG GATGACGCGG GGAACGATGT GCTCGATCTC GTGCGCGAGA ACGATCATCA GTACGTTCGC GACTTCCGCA ATCTTACGTA CGCTGGTTAT GCCAAGCAGC ACGCATTAAC GCTTGATCTC GGTGAATGGA GTCCGAACGC GCCGTTGCGG CTGTTCCTGC AAGGCTTTAT CGAGTACTTC ACCGCAAATT CGATGTACGC GGCTTGGCAG GCGGGAATCA ACCCGGTTGC GCCTTATATT GAGGCGCAGA TGCCGGATGG TTCATGGAAG CGAGTTGTGG ATGACATGGG TTTCCCGGCT GGATTGACGC GCATGATCAC TGTAGACCTG ACCGGCAAGT TGCCGGCGAA CACGCGCAAG ATTCGTATCG TGACCAATCT TCAGATTTAT TGGGACCAGG TGCTGGTGGA CAACGCGGCT CCGGCGGCGA AGACCCGCGT AACCGAATTG CCGCTGTTGT CGTCGGACCT CCAGTTCCGC GGCTATCCAC AGCAGGTCGA CGGCGAAACT CCGGGTGATC TGACTTACAT CTACGAAAAG GCCAGTAAGA CCGGGCCCTT CACCCGTGAG CGCGGGAACT ACACGCATTA CGGCGACGTG ACCGAACTGC TGAAGCAAGT GGACGACCAT TACGTGATCT TTGGCAGCGG GGAAGATATG GACCTTGAGT TCGATCCCGC CGCCTTGCCT AAGCTGCCTG CAGGATGGAA GCGCGACTAT TTCTTCTACG CGAATGGCTT CGTGAAGGAC ATGGACTTCT ACGAGGCGAC GCCATTCACG GTGGCAGACT TGCCATTCCA CAGGATGTCG GCATATCCGT ATCCGGTGGG CGAGCATTAT CCGGATGATC TTGACTCGGT GCGTTACCGG CTGGAATGGG ACGATCGTTT TGACTCTGGC ACAAACGGAG CTGGGAACCA CTTTGGCTTC GATTATGAAA ATCGCCGTCA ATAG
|
Protein sequence | MRKPIGIFAV LALSCSFAIA ADHKPDPAEA ARLNNIGVAL MNQQRMEKAV EKFDLALEKD PKLSVAYLDK GIALLNLQKL PESEAALNKA GEAMPKNPRV WYNLGLLNRG AGKYDAAIEN FNRVTTIDPN DSDTFYMIGS LYLQLQKYED AIGAYKSALK INPLHASAEF GLAKALQRAG KVEEARDHLH IFEHLTKDKI SSPMTLIYGE QGRYSLAEDV HTGAPEVGAM IPVTFEARPL QGGAQAVAAT AADTAGLCMM DVNGDGKFGI VALGSGSSAI RVFLNDGSGK FKEASAAEHG LKAEGTAISC AVGDFDNDGH PDLAVAFTDQ LLLFRNLGNG KFENVTKAAG ISALNHPAGM SWVDYDHDGD LDLFVTGSAV SAGTNVLWRN NGNGTFTEVA AERGLQGIGS TKSVVLTDLN NDRAVDLLIT GDTGATAYIN PREGKFQTSA LYEEKLPPAT GAYVFDFNKD GWMDVVLTHD GTPGISLWKN LDGKHFERVA LPISDAQAAW GVTAIDVDND GWLDLAAVVQ TAKGPAVRIF RNTGSAGFVD VSKAIGLDKL QLQNPRGVVA ADVDSDGAAD LIVSQGNAAP VVLHNHGGSA NHSVRITLAG LADNKSALGT KVEVFADGLW QKWEIVGGSG YMSQGPNEIL AGIGKNSAVD IVRMLWPGGV VQDETDIAMD KPVHFLEIDR RGSSCPTLFA WNGEKYEFVS DVIGAAVIGH WISPTEKNLA DPDEWVKVEG SQLRARNGKL SLRFGEPMEE VNFVDQVRLV AVDHPANADV YPDERFLSAP PFASGKVFVT GRPHPPVGAW DDAGNDVLDL VRENDHQYVR DFRNLTYAGY AKQHALTLDL GEWSPNAPLR LFLQGFIEYF TANSMYAAWQ AGINPVAPYI EAQMPDGSWK RVVDDMGFPA GLTRMITVDL TGKLPANTRK IRIVTNLQIY WDQVLVDNAA PAAKTRVTEL PLLSSDLQFR GYPQQVDGET PGDLTYIYEK ASKTGPFTRE RGNYTHYGDV TELLKQVDDH YVIFGSGEDM DLEFDPAALP KLPAGWKRDY FFYANGFVKD MDFYEATPFT VADLPFHRMS AYPYPVGEHY PDDLDSVRYR LEWDDRFDSG TNGAGNHFGF DYENRRQ
|
| |