Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0823 |
Symbol | |
ID | 4072349 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1021736 |
End bp | 1023244 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637982832 |
Product | TPR repeat-containing protein |
Protein accession | YP_589902 |
Protein GI | 94967854 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAAGTCA TCGTCATCCC GAGGGACTTT TCCACCCGCC CGCGGTCTGT TTCGCGATGG CATCTCTGGT TGTTCCCACT CGTCACGCTT GCGGTGGCTC TCCGGCTTCC ATCCGGCGCT GCGGCGCAGG CGAAGCCATC CAAGCCCATC GTCAAAGCCA ATGCCAGCGC CTCGCAAGCA GAAGTGGAGC TCGCCCGACG CATCAAGACC GCGGATGCCG CTCGCGCATC CGGGAATACG AAAGCTATCG GAGACGCCAA CCGGAAAGTC ATAGCACTCG CTTTACGGGA ACTTGGACAT CTGCGCTTGA GCGAACTCCT CTCTGCCCAG GCCGTCGAGC TATATCGCAA TTCTCTGAGC TTCGAGGATG CAGTCGATTC GCGGGCGGAT TTGGCAAACG CGATCAACAC CGAGGCGAAA GCGAAGACCA CTCCCACATC GCCGCCTGAC GCCGATCCAT TCGCGAGACC GGACGCCTCC ACCTTCTCAC GCGCCAAGCT CTCTCCGGAG CAACGTACAG CTGCCGAAGG CAAGGAGATC CAGCTTCGCC TCGTCCTTGG CGCGAGTCTT AACGATCTCG CGACCTCTGA GGCGGCGCGC AATCAGTACG GCATGGCGCT CACACATCTC CAGCAGGCTG AGAAGTGGAA CTCGGCGACC ACTGGCCTGG CAAGAAATCT CGGCTTCTGT GCGTTCAAGG TGGGTGACTA CCCTGAGGCG ATTCGTACGC TTTCCCGTGC TCTCGAAGAA CAGCCGCAAG ATGCTCCCGT TCGGGCGATG CTTGGCATGT CGTATTTCGG GAGCAACAAA TATGCAGACG CCGCGAAAAC CTTCGAGCCG CTCGGTGATC GTGGGATGCA GGATACCTCC GTCGGTTACG CCTGGGCGAC ATCGCTCGCT CGGACTGGCG ATCTGAAGAA AGCCGCGGAC GTACTGAATC ACTTCGAGAA CTCCAATCTC TCCCCGGATG CATTACTTCT GGTGGGCCAG CTTTGGACAG AGATGACCGA TTACCAGCAC GCGGTATCGG TTTTCCAGAA AGTTTTGCTG CGGGACCCTT CGTTGCCGAA GGCGCACTTT TTTGAAGGAT TGGCATATCT GAAGTGGGAG AAGTGGAACG AGGCCGCCTC CGACTTCCAG GCGGAACTCG CACTGGTTCC GGGCGACCTC GACGCGAAAT ACACTCTTGG TTTTATTCGT CTGCAGCAAG GCCGCGTAGA CGAAGCACTG GCGTTCTTCA ACGAAGTTCT GGCTGCAGAG CCCAATCACG CGAACGCCCA ATATCAGATC GGCAAGATCA TGCTGGATCG CGGGCGACTG GACGATGCCA TCACCCACTT GGAAATCGCA GCCCGTCTTG ATCCCCAGGC AGACTATATT CACTACCAGC TCCAAGTTGC GTATCGGAAG CGCTCGCGAA TTGCCGAGGC GGACCGCGAA CTGGAAATCT ATAAGCAGCT GAAGTCCCAG GCGCGCGAAC AGTCGTCTTC GCCCAAGCAG AATCCTTAG
|
Protein sequence | MQVIVIPRDF STRPRSVSRW HLWLFPLVTL AVALRLPSGA AAQAKPSKPI VKANASASQA EVELARRIKT ADAARASGNT KAIGDANRKV IALALRELGH LRLSELLSAQ AVELYRNSLS FEDAVDSRAD LANAINTEAK AKTTPTSPPD ADPFARPDAS TFSRAKLSPE QRTAAEGKEI QLRLVLGASL NDLATSEAAR NQYGMALTHL QQAEKWNSAT TGLARNLGFC AFKVGDYPEA IRTLSRALEE QPQDAPVRAM LGMSYFGSNK YADAAKTFEP LGDRGMQDTS VGYAWATSLA RTGDLKKAAD VLNHFENSNL SPDALLLVGQ LWTEMTDYQH AVSVFQKVLL RDPSLPKAHF FEGLAYLKWE KWNEAASDFQ AELALVPGDL DAKYTLGFIR LQQGRVDEAL AFFNEVLAAE PNHANAQYQI GKIMLDRGRL DDAITHLEIA ARLDPQADYI HYQLQVAYRK RSRIAEADRE LEIYKQLKSQ AREQSSSPKQ NP
|
| |