Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3234 |
Symbol | |
ID | 4072569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3830390 |
End bp | 3831670 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637985255 |
Product | type IV pilus assembly PilZ |
Protein accession | YP_592309 |
Protein GI | 94970261 |
COG category | |
COG ID | |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0626406 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACTAG CATCTGAGTC AACATTTCCC CCAAATCTCC TTCACGTCGG GGTTCATACC CCGAGCTTGG AAAAAATCAT GGCTCGCAGG CGGGAAGAAC GAATCATCAT CAGCTTGCCC GTGCGTTTAT GGGGCATGGA CGTAAATGGG AAACCGTTTA CGCAAAATGC CTCGTCCGTT GACATCACCC GAATGGGCGC CCGCATTCAG GGTGTCACGG CACAGATTCA GCATGGGGAC ATCATCGGCG TCCAGCACGG CAGCGACAAA GCTCGCTTCC GCGTTATCTG GGTCGCTCGG CCGGGATCGC GTAACGAAGG CCAGATCGGC GTGCATTGCG TCGAGGCCAA CAAGTACATC TGGGGGACCG TCGAGCCGAG CAATCAGGCC GATACCTGGG ATCCCGAAGC GGCTTCGGCC TCAACCCACG CGGTGGGCGC CGGCGGCGGA GTCGCCGCCG TCATGGCTGC CTCGCCATCG CACAAAGACA ACTTCGTGAA CGACGCCACC CGCGAGGGCC GGCGCCGCAT GCCACGCTAC GCGTGCCGCG GTGGTGGTGA AATTCGCCAG CCCGGAATGA AGACCGTGGT GTGGGGCTCG ATGACCGACA TCAGCCGCAG CGGATGCTAT CTCGAAACGC TGACCACGCT GCCGCGCAAT GCGAAGTGCG AACTCATGCT GAACGTAGAA GGCATTGAGG TACGCGCCGG CGCGGAAGTC CGCGTCTCGC ATCCTTCGAT GGGCATGGGC TTGCAGTTCA TTGACGTCGA TCCGACCGAT CAGAAGAAGC TCGATGATCT GCTCGTGAAA CTTGCGGGTG GCAAAGAGCC GGAAGATCGC ATCGTGCATC CGGTGAGCAA TGAGTTCGCG ACCGCGATCG CCTCAGCGGC GTCGCAGCTT CGTGACCTCG AAGCCTGCGT ATCGGAAAAC GAGGAAAACG TCGATCCACG GCTGCTCTCC GAGTTCCGCA GCGCCGTCGA TCATGCGCGT TCCACCACTG CAGCAATTGA GCAGTGGGTC GATTTGCAGG AGCAGGACCG CGATCCATTC CCGGTTCTCG CGGCGATTGA AACGAGCAGG ATCCGTTTGA CTGCGAGCTT CATGCGCGAG CTGGTGATGG ACATTGACGC CGCAACTTTG CATCTCGGCA GCGAAGGTGT GAAGGAGTTG TACGAGGCGG CACGTCAGTT GCACCTGCGC ATTGAGCAGA TGATTGCAGA TGCGACCGAG CCGGAAGATC TGCTCGACGC AGACGACGAC CACGCACAAT CAGCCGACTA G
|
Protein sequence | MRLASESTFP PNLLHVGVHT PSLEKIMARR REERIIISLP VRLWGMDVNG KPFTQNASSV DITRMGARIQ GVTAQIQHGD IIGVQHGSDK ARFRVIWVAR PGSRNEGQIG VHCVEANKYI WGTVEPSNQA DTWDPEAASA STHAVGAGGG VAAVMAASPS HKDNFVNDAT REGRRRMPRY ACRGGGEIRQ PGMKTVVWGS MTDISRSGCY LETLTTLPRN AKCELMLNVE GIEVRAGAEV RVSHPSMGMG LQFIDVDPTD QKKLDDLLVK LAGGKEPEDR IVHPVSNEFA TAIASAASQL RDLEACVSEN EENVDPRLLS EFRSAVDHAR STTAAIEQWV DLQEQDRDPF PVLAAIETSR IRLTASFMRE LVMDIDAATL HLGSEGVKEL YEAARQLHLR IEQMIADATE PEDLLDADDD HAQSAD
|
| |