Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3005 |
Symbol | |
ID | 4071560 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3559710 |
End bp | 3563000 |
Gene Length | 3291 bp |
Protein Length | 1096 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637985024 |
Product | TPR repeat-containing protein |
Protein accession | YP_592080 |
Protein GI | 94970032 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000321187 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGCGCATTT CTTCTGTTCG GCTCTTTGGC ATCACGCTTC TTTCGTCAAT ACTCTCGGTC GCTTCCTTTG CTCTCGCGCA AGACCAGAAG CCTGCTCCGC CGGCGCCTAC CGGCCCGAAC GTCACTCCGC CGAAGGTCGA TAGATCCGAC GACGCCGATT ACGCCAAGCA GGCCTATGTT TACGAGAAGT TTGAAGACAA ATGGCGCTTT GAGACCGACG GCACCGGCCA GGAAACCACC ACGCTGCGCG TCAAGATTCA AAGCGATGCC GGCGTGAAGG CGTGGGGCCA GCTCGTCTTC GGCTACAACT CCGACAGCGA CGAGATGCAG ATCGCCTATG CGCGCGTCCG CAAGCCCGAC GGCAAAGTCA TCGACACACC GCCCGATTCC GTTCGCGATA TGACCAGTTC CGTTGAACGT GAAGCCCCGG TCTACACTGA CTATCGTGAG AAGCACCTGA CCGTTTCTTC GCTCCAGGTC GGCGACATCC TTGAGTACCA GCGCATTCGC AAAATCACGA AGCCCGCTAC GCCGAATGAG TTCTTCACCG AGCACACCTT CATGAAGAAC TACATCATCC TCGACGAACA GATCGAGTTC GATGTTCCCG TCAAGGCCAA CGCCAAGCTC AAATCTCTCC CGGGCTTCGA ACCTACCTCC ACGCGTACCG AGGGCGATCG CACCATATAT TCCTGGAAGC GCTCAAACCT GAAGGTCGAG GACGAAGAAG AGAAGCAGAA GCGCGAAAAG AAAAAGGGGA AGAAGCCGCA GGAATTTGCC GACGTCCAAC TCACCACCTT CAACAGTTGG GAGCAAATCG GCAAGTGGTA CCAGGACCTC CAGCGTGATC GCGTTGCGCC CACGCCTGAA ATCAAAGCCA AAGCCGCGGA GCTGACCAAG GGCCTCACTA CCGACGAAGA CAAGATCGCT GCTCTCTATC GCTACGTCGC CACCGGCTAT CGCTACGTGA GCCTCTCGCT CGGCGTCGGA CGCTTCCAAC CCCGTGCTGC CTCCGTCACC ATGCAGGACA AGTACGGCGA CTGCAAAGAC AAGGCGACGC TTCTCTCATC GCTACTGATC GCATCCGGCT ATAAGCCCGC GAACGTCCTC ATCCACACCT TCGTCAAGCT CCAGGATGAC TTCCCGACCC CAGCCTCCTT CAACCACGTC ATCACCGAAG TCAAAGCCGG CGATAAAGAA TTCTGGATGG ACAGCACGAC GGAGGTCGCA CCCTTCCGCC TCCTCACCTG GAACATCCGC AAAAAGAAAG CCCTGCTCGT TCCCGTGGAT GGCCAGCCGC ACGTCGTGGA AACGCCCGCC GATCCGCCAT TCACCAGCCT TGAGACCATC AACGTCGCCG GCAAGATCAA CGAGCTCGGC ACCGCTGACC TCCACCTGCA GATCATCTCG CGCGGCGACA GCGAGCTCCA GCTTCGTAGC GTCTTCCGCA ACTACGGCCA GGCAAACTAT CAGAAATTGA TGGAGAACAT CTCGCGCGTC CTCGGCGTCC CCGGCGACGT CAGCGACGTC AAGGTCTCCG ATCCCGCCGA CACCGTGAAG CCGTTCAGCA TGGAGTGCAA CGTCAAAGTC CAGAACGCGA TCGAGTGGAA AGACAAGACC GGCACCCTCG GCGTTCCGTT CGGCTCCATG AATCTCTCCG AAGAGCCGTC GGATCCCGGC CCTGACACCG AACCGCTCCC GCTCGGCGGA TCTCCCGGCG AGTATCGCGT CATCATGAAA GTTGACCTGC CGGAGAAGTA CACCCTCCGC CTGCCTGCCT CCATGAGCGT CAAGCGCGAT TACAGCGAGT ACTCCTCGAA CTACACGCAG GACAAGTCCA CATTCGTCGC CGAACGATAC CTGCACATCA TGCAGCGTGA AGTGCCGATC AAACGTTTCG GTGATTACCA CGCCTACCGC CTCGCGGTGA ACTCCGATCA CGGGCAAGCG CTCACCCTCA CCCGCACCGA CGCCAGCGTC GCCGGCGCCG AAAAAGACGC CAAGGCCGAC GACCTTTTCG ACGCCGCGCA GGCCGCCGTT CGCGCCGAAA ATTATCAGAA CGCCATCGAG CTTCTGCAGC GCGCGCTCGT CCTCGAGCCT GAGCACAAGT ACGGATGGGA CGCCCTCGCC GAGACCTACT ACAACGCCGG CGACCTCAAC AAAGCCATCG AGTACTACAA GAAGCAGCTT GAGGTGAATC CCTACGACGA CCTCGCCAAC ACCGGTCTGG CGCAGGTCTA CATGACGCAG TACAAGTACG ACGACGCTCT CGCGGCCTTC AAGAAGCAGG CCGAAATCAA TCCGCTCGAC AAGACTGCGC ACCTCGGCAT CGGTCAGGTC GACATCATTC GTGAAGACTA CAAGGCCGCC GTGCCTGAAC TCGAGCGCGC CGTATCGATC CTTCCGCAGT CGTCGGTCGC TCGCTACATG CTGGGCAACG CGTATCTCAA CACTGGTCAG ACCGAGAAGG CGATCACCGC CTTCGAAGAA TCCGTCAAGC TCGACGCCAA CAATCCCATG ACGTGGAACG ACATCGCCTA CGCGCTCGCC GACAAAGACG TCAAACTCGA CAAGGCCGAG CAGTACGCGC AGAGTTCCGT CAGTACCACG CAGTCCTACC TGCGCAACCT GCCCGCGGAG CAGGCCCTCA AAGCTGGCCC GCAAATGACC GCCAGCCTCG CCGCTGCCTG GGACACCCTC GGCTGGGTCT ACTACAAACA GGGCAAACAG AAAGAAGCCG AAGAATTCAT CCACGCCGCT TTCGATCAGG ACCCGCACTC CGAAGTCGCG GAGCACCTCG CGATCTTCGC CGAGAAGCGT AACGACAAGA AGGCAGCCGC CGAGTACTAC GCCATGGCCC TCGCTGGCGA TCGTCCTGCG CCTCGCTATC GTGAGAAGCT GATCACCCTC GCTTCGATCA AGGATGCCGA CGTCGAGGCG AAGATCAAAG AAGCCAAGGT CAAGCTCGAC GCTGAACGAT TCCTCAAGCT CAACAACGCT GGTTGGACCG GTAAAGCCGA ATTCGTACTG ACTTTTACCG CTTCCAAACA AGCGTCAGAT GCGCAGTGGA AGTCTGGCGC CGACTCGCTC AAGCCCGCGG CCAAAGCGCT TATGGCGATG AGCTACCCCA TCACTCTGCC GTCAGGGGAG TACCGCATCT TCCGTCGCGT GCTCGTCAGT TGCGAAGCAG GGAAGGACTG CAGCGTGTTG CTGTATGGCG CCGAAGATCG CGAGAGCACC GTTGATGTGC CCACCGCAGC GTCAATGAGC GACACCAAAC CCGCGAATTA A
|
Protein sequence | MRISSVRLFG ITLLSSILSV ASFALAQDQK PAPPAPTGPN VTPPKVDRSD DADYAKQAYV YEKFEDKWRF ETDGTGQETT TLRVKIQSDA GVKAWGQLVF GYNSDSDEMQ IAYARVRKPD GKVIDTPPDS VRDMTSSVER EAPVYTDYRE KHLTVSSLQV GDILEYQRIR KITKPATPNE FFTEHTFMKN YIILDEQIEF DVPVKANAKL KSLPGFEPTS TRTEGDRTIY SWKRSNLKVE DEEEKQKREK KKGKKPQEFA DVQLTTFNSW EQIGKWYQDL QRDRVAPTPE IKAKAAELTK GLTTDEDKIA ALYRYVATGY RYVSLSLGVG RFQPRAASVT MQDKYGDCKD KATLLSSLLI ASGYKPANVL IHTFVKLQDD FPTPASFNHV ITEVKAGDKE FWMDSTTEVA PFRLLTWNIR KKKALLVPVD GQPHVVETPA DPPFTSLETI NVAGKINELG TADLHLQIIS RGDSELQLRS VFRNYGQANY QKLMENISRV LGVPGDVSDV KVSDPADTVK PFSMECNVKV QNAIEWKDKT GTLGVPFGSM NLSEEPSDPG PDTEPLPLGG SPGEYRVIMK VDLPEKYTLR LPASMSVKRD YSEYSSNYTQ DKSTFVAERY LHIMQREVPI KRFGDYHAYR LAVNSDHGQA LTLTRTDASV AGAEKDAKAD DLFDAAQAAV RAENYQNAIE LLQRALVLEP EHKYGWDALA ETYYNAGDLN KAIEYYKKQL EVNPYDDLAN TGLAQVYMTQ YKYDDALAAF KKQAEINPLD KTAHLGIGQV DIIREDYKAA VPELERAVSI LPQSSVARYM LGNAYLNTGQ TEKAITAFEE SVKLDANNPM TWNDIAYALA DKDVKLDKAE QYAQSSVSTT QSYLRNLPAE QALKAGPQMT ASLAAAWDTL GWVYYKQGKQ KEAEEFIHAA FDQDPHSEVA EHLAIFAEKR NDKKAAAEYY AMALAGDRPA PRYREKLITL ASIKDADVEA KIKEAKVKLD AERFLKLNNA GWTGKAEFVL TFTASKQASD AQWKSGADSL KPAAKALMAM SYPITLPSGE YRIFRRVLVS CEAGKDCSVL LYGAEDREST VDVPTAASMS DTKPAN
|
| |