Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2555 |
Symbol | |
ID | 4072199 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3016823 |
End bp | 3019009 |
Gene Length | 2187 bp |
Protein Length | 728 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637984572 |
Product | TPR repeat-containing protein |
Protein accession | YP_591630 |
Protein GI | 94969582 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGCGA CTGCGACGAA AACGGCTTCC AATGCGGGAA AAATCTGGAT CCAAAGCCCT GCGATTGACT TGATCGTAGG ATGCGGAGCG TGGTCGGCGC CACTGTTGTT GGTGTCGTAC CTGGCGCTCT CGTCTCATGT GGCGACTTGG GCCATCGTAT TTTATGCGCT CGCCCTCTTC TTTAACTACC CGCATTACAT GGCGACAATC TACCGTGCCT ACCATACGGC CGAAGATTTC CAGAAGTATC GGATCTTCAC CGTGCACATC ACGGGTTTGC TACTCCTTAC GGCGCTGCTC TCGCACTTTT ACGTGCGGAT GCTGCCATGG ATTTTCACCC TCTACCTGAC CGCCAGTCCC TGGCATTACA GCGGGCAAAA TTACGGTTTG TTCATGATGT TTGCGCGCCG GGCCGGTGCT TCGCCGAGCG GACGCGAGCG GTGGTCGCTT TACTCGGTTT TCCTGCTGAC ATATGCAATT CTCTTCCTGA ATCTGCACGC TGGCCCTTCC ACTGATCAGC TTTTCGTTTC ACTGAACCTG CCAATTCTGT TTGTGCGCTG GGCGACGATC GCGCTGACCT TTGTCTGCAT TGGGCTCTCG GCGTATGGCC TGAACGCATT GCGTCGGCAA ATTGGCCCCA AAGCCATGCT GCCTTCGGTC ACGTTGCTCT CCACACAGTT CCTCTGGTTC CTCATTCCCA GTTTGCTCGC AGCTTCGGGA CGCTTGGTGC TCCCGCAAAG CCGATACAGT ACTGGCGTGC TCGCGGTGAT GCACTCCGCT CAATATTTGT GGATCACCAG CTACTACGCC AAGCGTGAAG CCACCACCGC GACGGCAAAG TGGCGGCCTG TCGCGTACTT TGCTGTGCTG ATCGCTGGCG GCATCGCATT GTTCATTCCC GGACCATGGT TGGCCAGCTA CGCCTTCCAC TTCGACTTCA CCAGCAGCTT CTTGATCTTC ACAGCGTTAG TAAACCTCCA TCACTTCATC CTCGATGGCG CTATCTGGAA GTTGCGCGAC GGGCGAATCG CTGGATTGCT CCTGAACGCG CCCGCGAAGT TCGCGAAAGC TGCCGGCGAT ACCAGCGGAG CTCTCCTGCG TTTCAGCCGC TGGACGATTG GCCCAAGTAC CGGCGCGCTC GCTCTGCGCG TCGGAATGAT CTGCGGCTTG GTTGGGCTCG CGGTCCTCGA CCAGACACGC TTCGTGCTTG GAATGAGCGT CGAACATCCG ACGCGCCTGC AACGCGCAGC GTCGCTAAAC CCGTACGACG CATCCGTTCA GTTGCGCATC GCGAAGGACG CGGCTGGTGC GGGTGACAAG GATGCCGCCT TGCTGGCGTT CAAGAATGCA CAGAGCGCTC GCCCGAACGA CACCTCCATC CGCGACCAGT ATTTGCGCTT TCTCCTCGCG CAGCAGATGT ATCCCGAAGC GTTTCAACTC ACGACAGAAT GGCTCGCGCG AACTCCGAAT GACGCGGACT TGCTCGTGAA TCGGGGAATC CTCGCGAACT ATGTAGGCAA TTCCGAAGCA GCCCATGCGA GTTGGGAGCA GGCGCTGAAG GTGGATCCTC GCCAGTGGAA CGCGCACTTA TATCTGGCGG AGCTCTTGCA GAGTGAGGGC AAACCGGCAG AGGCGCTCCA GCACTTCCGC GTGTATTTCG ATTCTGTTGC CTCGCTGAAA CCCGCAAACC GGCCAACCGC TGATGTCGTC GTCGGCGCCC TGATTCACAT GGCGGGTTGC ATGTCGGATA CCGGCGACCA GTTTCACGCC ATTCGCAGCT ATGACACCGC AGAGCGTATC GCGCGTGAAT CTGGAGACAA GCGGTTGCAA AGCCTCGTCG CGAGCAGCGC GGCAGATTTT CGCGATAAGC ACGGCAGTTT CTCTGAAGCA CTGCGCCTGT TCCAGCAAGC GCTACGAATC GATAGCTCGA TTCCTTCGGC GGAGCTCGAA GCCAGGGACT TAATCAGCTA CGCGGAACTC TTGCGCGAGC ACCACTTGGA AAACCTCGCT TACGCCTGCC TGCTGCGCGC ACATCTTTTG ATGGAAGGAA ATCAGGGAGT TCCGGAATTC GCGCGCGTCG AGAACAGCTT GAAAGAGTTG GAAAAAACCA GCGACGCCAA GCAGATCGCC GAAGTACAGC ACGATCCAGA TCGCGTCCTG GCGCAGGCAT TGAATCTTCT GAACTGA
|
Protein sequence | MPATATKTAS NAGKIWIQSP AIDLIVGCGA WSAPLLLVSY LALSSHVATW AIVFYALALF FNYPHYMATI YRAYHTAEDF QKYRIFTVHI TGLLLLTALL SHFYVRMLPW IFTLYLTASP WHYSGQNYGL FMMFARRAGA SPSGRERWSL YSVFLLTYAI LFLNLHAGPS TDQLFVSLNL PILFVRWATI ALTFVCIGLS AYGLNALRRQ IGPKAMLPSV TLLSTQFLWF LIPSLLAASG RLVLPQSRYS TGVLAVMHSA QYLWITSYYA KREATTATAK WRPVAYFAVL IAGGIALFIP GPWLASYAFH FDFTSSFLIF TALVNLHHFI LDGAIWKLRD GRIAGLLLNA PAKFAKAAGD TSGALLRFSR WTIGPSTGAL ALRVGMICGL VGLAVLDQTR FVLGMSVEHP TRLQRAASLN PYDASVQLRI AKDAAGAGDK DAALLAFKNA QSARPNDTSI RDQYLRFLLA QQMYPEAFQL TTEWLARTPN DADLLVNRGI LANYVGNSEA AHASWEQALK VDPRQWNAHL YLAELLQSEG KPAEALQHFR VYFDSVASLK PANRPTADVV VGALIHMAGC MSDTGDQFHA IRSYDTAERI ARESGDKRLQ SLVASSAADF RDKHGSFSEA LRLFQQALRI DSSIPSAELE ARDLISYAEL LREHHLENLA YACLLRAHLL MEGNQGVPEF ARVENSLKEL EKTSDAKQIA EVQHDPDRVL AQALNLLN
|
| |