Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4247 |
Symbol | |
ID | 4073174 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5036458 |
End bp | 5039625 |
Gene Length | 3168 bp |
Protein Length | 1055 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637986279 |
Product | TPR repeat-containing protein |
Protein accession | YP_593321 |
Protein GI | 94971273 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAGGC GGATCGGGCG AGGGGCATGG CTTGCGATGA TCGCAGCGTG TTTGCTGGCG ATCGCAGGGC GAAGTTTCGA GTCGCGCCTT CGGGGCAGTG CCCGAGCCGA CAACCTACTG GAAAAAGCAT ACAGCGACCC AAGGACTCTC GAACTGCGGC TGCGAGGGGC GGCGTGGTCT CTGCTTCGCG AGCGCGCAAG CGGGCGGATC CGTGCAGAGG CGCGTTCCGC GGACCTGCTT CGAGCGGAAG CGGAGGTGGC GCGGCTCTAC CAAGCGAATC CGGTTGGGAC GCTGGAGTTG CGTGATCGCG GCCGCGCCAA CCTATTGGAG TGGAGCTTTG ACGAAGCTTT GGGCGACTTC CACCTGGCTC TCGCGCATGA ACCGGAATCG CCGGAGATCC TGAACGATCT GGCCACTGCA TACTATGAAC GCGGAGAAGC GCAAGGGGAC TGGGAGTCGC TAGTAGACGC TTATGAGCTA GAGAGCCGGG CGGTGCAGGC GCGTTCGGGA GATACATTGC TCCTGTTCAA CCGCGCGGTG ATCGCGAAGC GGCTTGGCAT GTACGGCCAA AGCATGGAGG ACTGGAGACG CGTATCCGAA TTGGAAAAGG AAAAGGGATG GGCCGAGGAA GCGCGTTCTA ACTTTCAACA GCTCGCGGCG TTGCAGAAGA CGCGGCTGGA AAAGAATGGA GGACCGCTTC TGAGCGCGAG AGAGTTTGCC GACCGGGTGC AGCCGGAGGA CCCCACGACT TGGAAAGAGG TGGAGCCGCG CGTGGAAGAG TACGTTTCGG AGGCGACGCG GAGTTGGCTT CCGGCAGCGT TTCCGCGAAA AGGCGACGCG GACGAATCAG CGAGGCGCGC GCTCAAGGCG CTGGCGATCG TGCTCGAACG CGGCCACGGC GACCGTTGGT TGCGTGACCT TCTTTGGCAG CCCGGATCCG ATTCGCTGGC AGATGCGGTG GCGTCGCTCT CGACCGCTGC CCAGGCAGAC TGGACGCAAC AGAACTACAG CCTAGGCCGC GCGGCGGCGC AAGACGCCAG GCGAAGTTTC GCGAAGATGG GAAATCGCGC GGGTGAATTG CGCGCTGCGT TTGAAGAGCT GTATGCGAGC GAGTTTGCCG ACATGGGAAC CGTCTGCTCG CGGCAAGCGA GCCAGTTGCA ATCCGCCCTG CGCGAAGTTT CCTACCCCTG GTTGAGCGCG CAGACGGCGC TTGAGCGCTA CAACTGCGAA CTGGAAACGG GAAACTTCGG AGCCTCCGAA TTTCTCAGGC GCGCCCGCCA GATCTCGCAA GCCGCATCCT ATGCGGGCAT CTCTCTGCGC GCGCTCAACT TTCTTGCCGC CGATCGCTTC GCTCGCGGAG ACCTGGCCGG AGGCTGGCAC GCTTCTTCCG ACGGGATTCG CGAGTTCTGG GCGGGCTCGC AGGACCTCAC TTATGGGTAC AACCTCTATA CCACCGTGGA ATTCGGAGTG GAGGTTCGAA ACTCCTGGTT CTCCGATGTC GCGTATGGCG AGCAGGCGCT CTCGCTGGTG GAAGGAAACC AGAAGCCCTT CGCACGGGCC GAAGAACATC TCGCGCTAGC AAAGGCCAGT TTGCTCGCGA AGAGCCCGGC AGTGTCGCTC GAGCATCTTC GTGCGGCAGA GTCTCTTATT GCGGATGTGC CGCCATCCTC CGAAACCAGC AACTTCCGCA TGGACATCGC GACGCAGAGT GCTTACCTGC AAGCGCTTAC CGGCTCCGCC ACGGCGCAGA TCTTTCCGGC GCCGGCGGAG ATTTCGCAGG TCGAGAACGT GTACACCCTC GGCAGCTACT ACACCACGCG CGGCAAGACC CTGGCCTTTG AAGGCAAAGT AGAGGAAGCC AAGAGCGCTT ATCGAAGCGC GGTCGCCCTG GCAGAACACG CGCGGAGAAG CCTGTCTTCC GACGCGGACC GGCTCGCCTG GCGCCATTCC TGGACCGAAC CGTATCTGTT GTGGATTGAT CTCGAACTGA AGACCGGGAA CACGCAGAAG GCTCTCGCGA TTTGGGAGCT CTGCCGGAAC TCCGATCCAG CGGTGCTCCC GCTCGCGAAT GGCAGCCGCG GCACGAAGCT GGATGCATCG GTCATGGAGA ACTCGCTCGC CTCGGCACTT GCGGCGGAAG AAGCTCGGGA CGCCCAGGTG CGACCGAACT TGAGGGATGA TGCGTTGCTG TTGTTCACCC GTCTGCCGGA CCGCATCGTG GCCTGGGCGA TCACGGAACA AGGCATCGAG ACCTCGGTGA TACCCGCGGA CGCCTCGGAC GTGGTGATGC AGGGGCGCTT GTTCCGGGAG TTGTGCGCCC GTCCGTCTTC TTCGATGGAG CAAGTGAGCC TCCAAGGAAG ATCGCTGTAC TCGCAACTGA TTGCGCCGGT CGAAAGCCAG CTTCGCAAGG CGAAAAACGT TTTGATCGAG AACGACGATT CGCTGGCGGG AATTCCATTC CAGGCGCTGA TCGCTCCCGC AGGCAAGTAT TTCTCAGATG AACACGCAAT TCGCTATGTG TCGGGCGCTC GCGACGTCGA ACGAGAATCA GCGTCGGGCG CAGTCGTCAC GCGCGAGACG AAGATGCTGT TGGTCGCCAA CTCCGGGTCA AGCATGGACG GCGTCCAGCC GCTCGACGAT GTGGTGGCGG AGGCGCGATC GGTGTCTTAC CTCTTCCCGC GCGCCGAAGT GCTGGTCGAG CGACAGGCGA CTCTTTCCGC GGTCATGAAG AAGATGCCGC AGGCGGAATC AGTTTACTTC GTAGGCCATG CAGTTTCGGA CGGAGAGCGC ACAGCATTGC TGCTCAGCTC GGAAAGCGGC TCAAGCCAAC CGTCGCTGCT GACGAGCCAG TCCCTCGGCA ACAGCAAGCT GGGCAGCGTC CGGCTTGCGG TGCTTGCCGC GTGCTCGACG CAAGGCGGCA CCGAGCGAAG CTCCGACGAG GCCGACAGCC TGGTGCGCGC ACTTCTAGGG CGCGGCGTCC GGCATGTGGT GGCAAGCGGA TGGGACGTGG ACTCGCAGGT CACTTCAAGA ATGATGGACG CTTTCTACAA GAACCTGCTC CGCGGCGCCA CGGTCTCAGA GGCGCTGGCG GGGGCCGAAG CGGAGACCAG AAGGGCCACG CAACACCCGT ATTACTGGGC CTCTTTCGAT GCGTTTGGAA ATAACTGA
|
Protein sequence | MKRRIGRGAW LAMIAACLLA IAGRSFESRL RGSARADNLL EKAYSDPRTL ELRLRGAAWS LLRERASGRI RAEARSADLL RAEAEVARLY QANPVGTLEL RDRGRANLLE WSFDEALGDF HLALAHEPES PEILNDLATA YYERGEAQGD WESLVDAYEL ESRAVQARSG DTLLLFNRAV IAKRLGMYGQ SMEDWRRVSE LEKEKGWAEE ARSNFQQLAA LQKTRLEKNG GPLLSAREFA DRVQPEDPTT WKEVEPRVEE YVSEATRSWL PAAFPRKGDA DESARRALKA LAIVLERGHG DRWLRDLLWQ PGSDSLADAV ASLSTAAQAD WTQQNYSLGR AAAQDARRSF AKMGNRAGEL RAAFEELYAS EFADMGTVCS RQASQLQSAL REVSYPWLSA QTALERYNCE LETGNFGASE FLRRARQISQ AASYAGISLR ALNFLAADRF ARGDLAGGWH ASSDGIREFW AGSQDLTYGY NLYTTVEFGV EVRNSWFSDV AYGEQALSLV EGNQKPFARA EEHLALAKAS LLAKSPAVSL EHLRAAESLI ADVPPSSETS NFRMDIATQS AYLQALTGSA TAQIFPAPAE ISQVENVYTL GSYYTTRGKT LAFEGKVEEA KSAYRSAVAL AEHARRSLSS DADRLAWRHS WTEPYLLWID LELKTGNTQK ALAIWELCRN SDPAVLPLAN GSRGTKLDAS VMENSLASAL AAEEARDAQV RPNLRDDALL LFTRLPDRIV AWAITEQGIE TSVIPADASD VVMQGRLFRE LCARPSSSME QVSLQGRSLY SQLIAPVESQ LRKAKNVLIE NDDSLAGIPF QALIAPAGKY FSDEHAIRYV SGARDVERES ASGAVVTRET KMLLVANSGS SMDGVQPLDD VVAEARSVSY LFPRAEVLVE RQATLSAVMK KMPQAESVYF VGHAVSDGER TALLLSSESG SSQPSLLTSQ SLGNSKLGSV RLAVLAACST QGGTERSSDE ADSLVRALLG RGVRHVVASG WDVDSQVTSR MMDAFYKNLL RGATVSEALA GAEAETRRAT QHPYYWASFD AFGNN
|
| |