Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4194 |
Symbol | |
ID | 4072153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4968550 |
End bp | 4970097 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637986225 |
Product | TPR repeat-containing protein |
Protein accession | YP_593268 |
Protein GI | 94971220 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.233641 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0358885 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCGGG TGCCCCTGCT ACTCGTCCTG CTCTGCGCAC TCCCGCTCTT CGCCCAATCG CCAACACTCG AACAACAAGC TCTCGATGCC TGGCGCACCT ACGACTACCC CACCGCCGAA CGTCTTTACC GCCAGGCTCT CACCGCGAAT CCTACTTCCG GCGACGCCCA CGGTGGACTC GTCCGCACGC TCCTGAACGA AAAGAATCTC CAAGCGGCCC GCGATGCCGC GAATACCGCT CTCCGCGCCG CCCCCGACTC CGCAGCTGTC CAGGCCGCTT CCGGCGATGT CTTCTTCCGC GATGGACGGA TCGAAGATGC CGAAATCGCC TACCGGCGTT CCGCAAAACT CGACGACAAT TGCGCCCGCG CCTGGCACGG CCTCTCGCAG ATCGCGCAGA TCACCTCGAA CTACCGCAGC GCCCGCCGCG ACATCTTCAA AGCCCACGAA CTCGATCCCA AAGATCCTGA AATCTACGAA TCCTGGGCGA GTCGCTTGCC GCGACCGGAG CGCCGCAAGG CCGTCGAATA CCTCGTGGAC CACCACGGCC ACCTCGATCC GGACCGTCTC AACATCCTGC AATCGCGGCT CGCATGGCTT ATCGTCCTTG GCAACAAAAC CGCGTGGAAG CTCGTCAGCA CCAGGGAAAC CGCCAAGCTG AGGCTTGACC ACATCGTTTC TGCCGCGCAT CTTGGCGACG GCCGTTTCAC CACTCCGCCG CGCGAAGGCG CCGTCGCCAT ACATGTCCGC TTCAACGACA AGAAGACGGT CTCCCTTCTC CTCGACACCG GCGCCAGCGG CATCGTGCTG CGCAAGAGCG ACGCCGCCAA AGCCGAGATC AAACAGGTTT ACGACATCGC CACGAAAGGT ATCGGCGACG AAAAGCCCGC TAACGGATAT CTCGGCTGGG CTCACGACGT AAAGATCGGC CCCATCGAAT TTGAGAACGT TCCTGTCACG GTCCTCGACG CAAGGTTCCC CGAAGGCTCC GACGGATTGA TGGGCATCGA TGTCTTCGAG CACTTCCTCA TCACCCTCGA CATCAAGAAC TCCGAGCTAA GTCTCGCGCC TCTGCCCGAG ATTCCCGCCA ACCTTCGCGA CGAAGCCGGC GCCGCCGATC GTTATGTCGC GCCGGCAATG CAGTCGTTCG ACCGGGTCAT GCACCTGGGC GCGCACATCC TCGTCTCCAC CAGTGTCGAC CAGCAACCCG CCGGCCTCTT CTTCCTGGAT ACCGGCGCCT TCGACACCCA GATCGATCCA AATAACGTCT CCAAATCCAA ACTTCAGCCC GCCCCCGGCT TATCTGTGCG CGGCCTCTCC GGTAACGTTC GCGACGTTTA CGTCGCCAGT AACGTTCAGA TCCAGTTCGG CCGCTTCACC CAGGACAACT TCCGCATGGT CGCCATCAGC ATGGATAAAC TCAGCGAAGG CGAAGGCATC GCCCTCGGCG GCATCCTGGG CTTCCCACTC CTAAGCCAGT TCCGCCTGAC CATCGACTAC CGCGACGGCC TCGTCAACTT CGACTACCAC AACTCGAATA AACGCTGA
|
Protein sequence | MPRVPLLLVL LCALPLFAQS PTLEQQALDA WRTYDYPTAE RLYRQALTAN PTSGDAHGGL VRTLLNEKNL QAARDAANTA LRAAPDSAAV QAASGDVFFR DGRIEDAEIA YRRSAKLDDN CARAWHGLSQ IAQITSNYRS ARRDIFKAHE LDPKDPEIYE SWASRLPRPE RRKAVEYLVD HHGHLDPDRL NILQSRLAWL IVLGNKTAWK LVSTRETAKL RLDHIVSAAH LGDGRFTTPP REGAVAIHVR FNDKKTVSLL LDTGASGIVL RKSDAAKAEI KQVYDIATKG IGDEKPANGY LGWAHDVKIG PIEFENVPVT VLDARFPEGS DGLMGIDVFE HFLITLDIKN SELSLAPLPE IPANLRDEAG AADRYVAPAM QSFDRVMHLG AHILVSTSVD QQPAGLFFLD TGAFDTQIDP NNVSKSKLQP APGLSVRGLS GNVRDVYVAS NVQIQFGRFT QDNFRMVAIS MDKLSEGEGI ALGGILGFPL LSQFRLTIDY RDGLVNFDYH NSNKR
|
| |