Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4603 |
Symbol | |
ID | 4070760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5450933 |
End bp | 5453101 |
Gene Length | 2169 bp |
Protein Length | 722 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637986643 |
Product | TPR repeat-containing protein |
Protein accession | YP_593677 |
Protein GI | 94971629 |
COG category | [G] Carbohydrate transport and metabolism [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2956] Predicted N-acetylglucosaminyl transferase [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.524488 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGGTC TTTTCATGAA CCCAAAAGCT CTCTCGTCCA CTTTGGCCGT GCTGCTGTTG TCTGCCACGG CGAGCCTTGC ACTGCCGAAT GGGCCTACTT TGTTGGCTCA GCAGAAGCAG GACGCGCCGA CGCCAACCGC TCCGGCCAAG CCGAATGCGG CCCAGCCAAC TTCGAATCCG GAAGCCGACG CCAAGAAACC AGCCAATCGC GCTGACGCCT ACTACCACTA CACGATGGCG CACATGTACG AGGAAATGGT GGCGACGTAC GGCCGCGCCG AGTACGCGAA CAAGGCGATT GAGGAATACC GCGCGGCGAT CACCGCCGAT CCGTCGTCGG ACTATTTAAA CGCCGGACTG GCCGATCTTT ACTGGCGCAC GGGCCGCATT CGCGACGCGG TGCTGGAAGC GCAGGAGATC CTGAAGCGCG ATCCGAAGAA CGTGGATGCG CACCGCCTGC TCGGGCGTAT TTACCTGCGT TCTTTGGGCG ACATGCAGAG CGGCAATAAC CAGTCTCGCG ACATGCAGCG GCTGGCGATT GAGCAGTACG AAGAGATCGT GAAGCTCGAT CCGACGAGCG TGGAAGACCA TCTGCTGCTC GGTCGCCTGT ATTCGTACAG CAACGACCTG ACCAAGGCCG AGAAGGAATT CAAGACCGCC GTCCAGATCC AGCCGGATTC CGAAGAAGCT GTGACCATGC TGGCGTACCT CTATACGCAG GAAGGCGACA CCAAGAAAGC GCAAGAGGTG CTGAGCAACA TTCCCGACGA CGATCGCAGC GCGAAGCTGT ATTCGACTCT GGGCTACACC TACGAAGAGC AGAAGGATTA CAAGAAGGCG ATCGAGGCAT ACCGCAAGGC GGTGATGCTC GATAAAGAGA ACCTCGACTC CGTCCGCGGG CTGGCGCAGA ACCTGTTGAA TGACGGGCAG CTCGATGCCG CGCTGGAGCA GTACAAGATC ATCGTTGATC AGGACCCGAG TGACGCTCAG AGCTACCAGC ACATCGCGGA GATCGACCGC CGCAACGGCA AGTTCGAAGC TGCGCTCGAT GCGCTGAAGA AGGCCTCCGC ACTAGTGCAG GATTCGCAGG AGATCCCGTA CAACATGGCG GTGATCTACG AGGGCCAGGG CCGCTACGAA GACGCCATCA ATACCATCCA GCAGCTTCTT ACGAAGACCG ATAAGCCGGA TGCGTCCTAC AGCTCGGCCG ATCGCAGCAA CCGTTCGATC TTCCTCGAGC GGCTGGGCAA CATCTATCGT GAGGCGAACA AGCCGCAGCA GGCGGTGGAG ACCTTCCGGC GGATGATCGC GCTCGGTGAC GATCCGGCCT CGCGTGCTTA CCAGGAGATG GTGGAGACCT ATCGCGATAA TCGCGATTGG CCGTCAGCCA CAGCAGCGGC GCAGGAAGGC GCAAAGAAAC TTCCCAAAGA TCGCGGGCTG CAACTAGTTC TCGCCGCGCA ATTGGCGGAT GAAGGTAAGG CCGACCAGGC GCTGAGTATT GCGAAGTCGC AACTCAATGG CAAGGCCGCC GATGACCGCG AAGTGTACGT GTCTCTGGCG CAGATGTACA CGCGGTTGAA GAAGTATCCC GAAGCGGAAG ACGCGATCGC GCAGGCGATG AAGCTTGCAG GCACGCAGGA TGAACGGAAC TACGTCACGT TTGTGCAAGG CTCGATCTAC GAGCGTGAGA AGAAATTCGA ACAAGCGGAA GAGGCCTTCC GTAAGGTCAT CAATGCCGAT CCGAAGAACG CCGGCGCGCT GAACTACCTG GGTTATATGC TGGCCGACCG CGGCACGCGC CTCGAAGAAG CGCTTGGCAT GCTGCGCAAG GCCGTGCAGA TGGAACCGCA GAACGGCGCG TATCTCGACT CGCTGGGCTG GGCCTACTTC AAGATGGGCA ACTACGAGCA GGCGGAAGAG AACCTGCGCA AAGCGTCCGA CAAGATCGGC AGCGATCCGA CGGTGCAGGA CCACCTTGGC GATCTTTATC AGAAGACGGG GCGCCTGAAG CTGGCGGCCA CGCAGTGGGA ACGCGCGCTG GACCAGTGGA ACCACTCAGT GCCGGCAGAA GTTGACGCCG ATGATGTGGC GAAGGTGCAG AAGAAGCTGG AGTCGGCGAA GATCAAGCTG GCACAGCAGA CCTCGACGAC TTCTAACACG AAGCAGTGA
|
Protein sequence | MAGLFMNPKA LSSTLAVLLL SATASLALPN GPTLLAQQKQ DAPTPTAPAK PNAAQPTSNP EADAKKPANR ADAYYHYTMA HMYEEMVATY GRAEYANKAI EEYRAAITAD PSSDYLNAGL ADLYWRTGRI RDAVLEAQEI LKRDPKNVDA HRLLGRIYLR SLGDMQSGNN QSRDMQRLAI EQYEEIVKLD PTSVEDHLLL GRLYSYSNDL TKAEKEFKTA VQIQPDSEEA VTMLAYLYTQ EGDTKKAQEV LSNIPDDDRS AKLYSTLGYT YEEQKDYKKA IEAYRKAVML DKENLDSVRG LAQNLLNDGQ LDAALEQYKI IVDQDPSDAQ SYQHIAEIDR RNGKFEAALD ALKKASALVQ DSQEIPYNMA VIYEGQGRYE DAINTIQQLL TKTDKPDASY SSADRSNRSI FLERLGNIYR EANKPQQAVE TFRRMIALGD DPASRAYQEM VETYRDNRDW PSATAAAQEG AKKLPKDRGL QLVLAAQLAD EGKADQALSI AKSQLNGKAA DDREVYVSLA QMYTRLKKYP EAEDAIAQAM KLAGTQDERN YVTFVQGSIY EREKKFEQAE EAFRKVINAD PKNAGALNYL GYMLADRGTR LEEALGMLRK AVQMEPQNGA YLDSLGWAYF KMGNYEQAEE NLRKASDKIG SDPTVQDHLG DLYQKTGRLK LAATQWERAL DQWNHSVPAE VDADDVAKVQ KKLESAKIKL AQQTSTTSNT KQ
|
| |