Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_0124 |
Symbol | |
ID | 8730552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 121203 |
End bp | 123128 |
Gene Length | 1926 bp |
Protein Length | 641 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646500738 |
Product | Endopygalactorunase-like protein |
Protein accession | YP_003391935 |
Protein GI | 284041595 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.778802 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.020189 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTGGT CCCTGCGCGG GAGCCTGCTG GCTGCCGCGG CGCTGATGCT CGGCCTGCCG GCCGCGGCCA GCGCACTCGA CGTGGACGTC ACCGCGGCGC CGTACAGCGC CGCCGGTGAC GGCGTGACGA ACGACCGGGC GGCGATTCAG AGAGCGATCG GCGACGTCGC CTCCGCCGGT GGCGGCAAGG TGACGTTGCC GGCGCCGAAG ACATACCTGA CCGGCAACCT GCATCTCAGA ACGAACGTCG AGCTGCACAT CGCCAGAGGC GCGAAGGTCA AGCAGAGCCA GGAGATCACC CACTACGCCG ACAGACCGCT GCGCGGGGCG CTGATCGACG AGACGATCCC GTGGAACTTC ACCTTCTACC GCAACTACCC GCTGATCTAC GGCGGCAGCG TCAGAAACGT GAAGCTGACC GGTGCCGGCA CGATCGAGAT GACGCAGCTC CCGGACGCGA GCCAGACGAT TCGCACCAAC GCGATCGGGA TGTGGGACGT CTCCGGGTTC GAGATCTCCG GCATCCATGC GATCGGCGGC ACGACGCCGT ACGTCGGCCT CTACTTCAAC GACAACGGGG TCGTGCGCGA CACGACGCTC AACGAGCCGG GCGGACCGAC CGTCTCCGGC ATCGAGGTCG TCAGCTCACA GCACATCCTC GTCGAGAGAA ACCTGATGCG CGACCCGGAC GGCACCCCCG GCGTCACCGA CGACGGGATC GCCCTGTCGA CGACCTACGG CGACCCGCGC GCCACCGGCT GGTGGAGAAG CGACGTGCCG AGACCGCTCG TCGACGTCGT GATCCGCGAC AACATCGTGG AGGCGGAGTG CTGCAGCGCG CTGGCGTTCA TCCCGTGGGG CACCTCGGCG CCCGACCAGC GCGACATCGA GTTCCGCGAC ATCCGGATCG AGAACAACGT CCTCAACGCG CCGGGCAGCT GGGACTCCGT CAAGTGCTGG TGCGACAACC CGTGGAGAGG TTCCTACGGG CCGGCGTACA CGGCGGTCGA GGACGACCAG GCGCCCATGA CCAATGTGAC GTTCTCCGGC AACAGCTACG CCGGCAGTCT CGACGGGTTC GTGAGAGCGA GAATCAGCGG GATCCAGGGC GCGTCGTTCC ACCCCGGCAA CCCGTACCTC AAGAACGGCG GCTTCGAAGA GACCGGGATC GCGTACTGGT CGAAATCGGG CACCGCGAGA CAGGTCGGCG CTGAGACCTT CGCGGCCGGT CAGACCGGCA GATGGTTCGG CTACATCACC AACTTCGGCG ACGGCTACAC CGCGCTCGCG CAGGGCGTCG GCCTGCCGGC GAGCACGAGC TACACGTTCC GCGCCAGGGT GCAGACGAGC GGCGACCCGG TGCGGATGTA CGTCCACAAC CAGTGCACGG GCGCGACGGT CGCGGCCAAA TGGGTCAGCG GGACGGGCTG GTCGACCGAG GACCTGTCGT TCACGACCAC CTCGGCCTGC TCGAACTACC ACGTCGGGAT CGACTCCTCC GGCCAGACGA GAGGCTGGGG CCGGATCGAC GACGCGCGGC TGCTCGGCAG CGTGATCGAG GACGGCGACC CGCGGATCGG CTACGCCGGC CTCTGGCACC AGAACGTCCA TCCGAGCGAC TCCGGCGGCA CGCACATGCT CGCCGTCGCC GACAGAACGA CCGCGAACGT CACGTTCGAG GGCAACCGCG CCAGATGGCG GACGATCGTC GGCCCCAACG GCGGCTATGC CGACGTCTGG CTCGACGGCG TCTTCAAGGG CACGGTCGAC ACGTACAGCG CGAGCTACGG CTGGGCCGAG ATCTACGACA CCGGGGTGCT GAGCAGAGGC ACGCACGTGC TCGGGATACG GCCGCAGTGG GCGAAGAACC CGCTCTCGAC GTACACCTAC GTGACGATCG ACGCGATCAA CGTCTCGGGG TGGTGA
|
Protein sequence | MRWSLRGSLL AAAALMLGLP AAASALDVDV TAAPYSAAGD GVTNDRAAIQ RAIGDVASAG GGKVTLPAPK TYLTGNLHLR TNVELHIARG AKVKQSQEIT HYADRPLRGA LIDETIPWNF TFYRNYPLIY GGSVRNVKLT GAGTIEMTQL PDASQTIRTN AIGMWDVSGF EISGIHAIGG TTPYVGLYFN DNGVVRDTTL NEPGGPTVSG IEVVSSQHIL VERNLMRDPD GTPGVTDDGI ALSTTYGDPR ATGWWRSDVP RPLVDVVIRD NIVEAECCSA LAFIPWGTSA PDQRDIEFRD IRIENNVLNA PGSWDSVKCW CDNPWRGSYG PAYTAVEDDQ APMTNVTFSG NSYAGSLDGF VRARISGIQG ASFHPGNPYL KNGGFEETGI AYWSKSGTAR QVGAETFAAG QTGRWFGYIT NFGDGYTALA QGVGLPASTS YTFRARVQTS GDPVRMYVHN QCTGATVAAK WVSGTGWSTE DLSFTTTSAC SNYHVGIDSS GQTRGWGRID DARLLGSVIE DGDPRIGYAG LWHQNVHPSD SGGTHMLAVA DRTTANVTFE GNRARWRTIV GPNGGYADVW LDGVFKGTVD TYSASYGWAE IYDTGVLSRG THVLGIRPQW AKNPLSTYTY VTIDAINVSG W
|
| |