Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_1396 |
Symbol | |
ID | 8731836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 1462134 |
End bp | 1464062 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646502015 |
Product | Endopygalactorunase-like protein |
Protein accession | YP_003393200 |
Protein GI | 284042860 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.230855 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACGGT TGCTGGTCGG AGCCGTATTG GCCGTACTGC TGGGTTGCCT GCTGCCTGCG GCCGCGAGCG CGTTGGACGT CGATGTGACG GCGGCGCCGT ACAGTGCCGC CGGTGACGGG GTGACGAACG ATCGGCTCGC GATCCAGAGC GCGATCGACG CGGTCAACGC GGCCGGCGGC GGGACGGTGA CGCTGCCGGC GTCGAGAACG TTCCTGTCCG GCAACCTGCG GCTGAAGTCG AACGTAGAGC TGATCATCGC CAGAGGGGCG ACGCTGAAGC AGAGCCAGAC GGTCGCACAC TACGACTACA CCCCTCTCAG AGGGATGATC ATCGACCTCA CGATCCCGTG GAACTTCACC TTCTACCGGA ACTTTCCGCT CGTCTACGCG GCATCGGCGA GAAACGTGAA GGTGACCGGC AGAGGCACGA TCGAGATGAC CCACCTGCCG AACGTCGCCG ACACGATCCT CAACGACGCG ATCGGCCTTT ACCAGGTTAC GGGCTTCGAG CTGGCGGACC TGCACGTGAT CGGCGGCGGC GTCCCCTACA TCGGCATCTA CTTCAGCGAC AACGGCTCGA TGCACGACAT GACGCTCAAC GAGCCCTACA AGGACCCGGT CAGCGGCCTC GAGCTGATCA GCTCGCAGCA CGTCGTCGTC GAGGACAACG TGATGCGCAA CCCGGACGGC AGACCAGGGG TCACCGATGA CGGGATCGCG CTCGGCACCA TACACAGAGA TCCGCGCTCG ACCGGCTGGT GGAGAAGCGA CGTGTCGGAG CCGCTCTCCG ACGTGATCGT GCGCAACAAC ATCGTCGAGA CGGAGTGCTG CAGCGCGCTC GCGTTCATCC CGTGGGCGTC GGTCACCGCC GACAAGCGCG ACGGGGAGAT GCGCAACATA CTGATCGAGA ACAACGTCCT CAACGCGCCG GCGACGTGGG ACTCCGTCAA GTGCTGGTGC GACAACCCCT GGAACGGCCC GGGCGGCGCC GCGTACACCG CGACCGACAG CGATCAGGCG CAGATGACCA ACATCCGCTT CTCCGGCAAC ACCTACGCCG GAAGAGTGAA CGAGTTCATC AGAGCCAACA TCGCGGGCGT TCGCGGCGCT CCGTTCCATC CCGGCGAACC GTTCATCCAG AACCCAGGGT TCGAGACGAC CGGGACGTCC AGCTGGTCGA CGGTCGGCAC CGCGAGCCAG GTCGGTGCGA CCGACGGGCT CTCGGTCGGC CAGTCGGGCA GATTCTACGG GTACATCCAG GACTTCAGAA CGGGCTACAC GGCGCTCGGC CAGGGCGTGG GGCTCGCGGC CAGCACGAGC TATCGCTATC GGGCGAGAGT CCAGACCAGC GGCGCGAACG TGCGCCTCTT CGTCCACAAC CAGTGCACCA ACACCACGGT CGCCGTGCTG AACGTCTCGG CGACGAGCTG GACGACGTAC GACCTGCCGT TCACGACGAC GACGGCCTGC TCGAACTATG AGATCGGGAT CGACCCGGCC GGGTCGACGA GCGGGTGGGC GCGGATCGAC GACGTCGAGC TGCACGGGCC GGCGATGGAC GAGGGCGACC CCAAGTTCAC CTACTCGGGC ATGTGGCAGC AATGGCTGGA CCCGGTCGAC TTCGGCGGCA CGCACCTGAC GACGTCGAGC GCGGGCGCGA CGGCGAACGT CACCTTCTAC GGCACTGAGG CCAGATGGCG TGGCACGAGA GGCCCGAACA TGGGCTACGC CGACATCTGG CTGGACGGCG TCTTCAGAGG CACCGTCGAC CTCTACAGCG GCTCCTTCGC GACCGGTGCC GACCTGTGGA GCACGGGCAC CGTCGCGAGA GGCTGGCACG TGCTGGGCAT CCGGGCGCAG GGCGCGCGCA ACCCGCTCTC GAGCGCCGAC TACGTCGCGA TCGACTCCGT CGCCGTCAGT GGGTACTGA
|
Protein sequence | MRRLLVGAVL AVLLGCLLPA AASALDVDVT AAPYSAAGDG VTNDRLAIQS AIDAVNAAGG GTVTLPASRT FLSGNLRLKS NVELIIARGA TLKQSQTVAH YDYTPLRGMI IDLTIPWNFT FYRNFPLVYA ASARNVKVTG RGTIEMTHLP NVADTILNDA IGLYQVTGFE LADLHVIGGG VPYIGIYFSD NGSMHDMTLN EPYKDPVSGL ELISSQHVVV EDNVMRNPDG RPGVTDDGIA LGTIHRDPRS TGWWRSDVSE PLSDVIVRNN IVETECCSAL AFIPWASVTA DKRDGEMRNI LIENNVLNAP ATWDSVKCWC DNPWNGPGGA AYTATDSDQA QMTNIRFSGN TYAGRVNEFI RANIAGVRGA PFHPGEPFIQ NPGFETTGTS SWSTVGTASQ VGATDGLSVG QSGRFYGYIQ DFRTGYTALG QGVGLAASTS YRYRARVQTS GANVRLFVHN QCTNTTVAVL NVSATSWTTY DLPFTTTTAC SNYEIGIDPA GSTSGWARID DVELHGPAMD EGDPKFTYSG MWQQWLDPVD FGGTHLTTSS AGATANVTFY GTEARWRGTR GPNMGYADIW LDGVFRGTVD LYSGSFATGA DLWSTGTVAR GWHVLGIRAQ GARNPLSSAD YVAIDSVAVS GY
|
| |