Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_0127 |
Symbol | |
ID | 8730555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 126012 |
End bp | 128807 |
Gene Length | 2796 bp |
Protein Length | 931 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646500741 |
Product | PKD domain containing protein |
Protein accession | YP_003391938 |
Protein GI | 284041598 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG5434] Endopolygalacturonase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00188311 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGTCCCTGT GGTCCACCGC GCGACGCGCG GTACCCACCG CGCTCGTCGC CGCCGCCGCT GTCGGCGCCG TCGCGAGCGC ACCCGCCGCC GCGGCGCCGG GAGACCCGTC GGCCGACGTG ACGGCGGCGC CGTACAACGC GGCCGGCGAC GACACGACGA ACGATCGCGC CGCGATCCAA CAGGCGATCG ACGACATCGG CGCCAGAGGC GGCGGCGAGG TGATCGTGCC GGCCGGCAGA ACGTTCCTGA CGGGCAGCCT GAAGCTGAGA TCCGGCGTCA CGCTGCGGCT CGACGGCACG CTGAGACAGA GCCAGGACGT CGCCCACTAC GCGAAGGCGC CGATCCCGGG CCTCGACATG CCGACGGACA TCCCCTGGCA TCGCGCGTAC TTCCACAACG ATCCGCTGAT CGGCGCGATA GGCGTCCACG ACGTCGCGAT CGTCGGCAGC GGCAAGCTGC AGATGACCGT CCAGGCCGAT CACACGAGAC GGATCGACAT CATCCCCGTC GGCTTCTTCG AGGCACAGCG CTTCACCCTC CGCGGCATCA CGATCCGCGA GGCGCGCACG ACGATCATCA CGATGCTGCA CTCCAGCGAC GGCCTCGTCG CGAACAACGA CGTCGCGACC TCGCCCGACC TCGGCGCCGA CGGCATCGTC GTCAGCGGCT CGCAGCGCGT GAGAGTGCTG CACAACAGAG TCCGCACGAG CGACGACAAC TTCGTTCTGG AGACGAAGCA CCTCGACCCG CGTGACCAGG ATTCGTGGTG GAGCTCGGCG ACCTCGACGC CGCTGAAGGA CATCGAGCTG GCGTACAACG ACGCCGAGAA CTTCGGCGCG CGCTACTCGA TCGCGTTCCT CCCGTGGGCG AGCACGCCGG CCGACAAGCG CGACGCCGCG ATCTCCGACA TCTGGATCCA CGACAACAGA CTCGTGTCGA GAAACGACAC GTGGGTCTAC TGCGACTGCG ACAACCCGTT CAACCCGGAG AGAAGACCGT TCATCCAGCC GACCCGGCGC GGCGAGCAGG CGCCGATGAC GCGGATCGTG ATGGAGAGAA ACACCTACGA CGGCAGAATG TGGCCGTTCT ACTCGTGGGT CCCGCCCGTC TTCACCGACT CGCGCTTCGA GATCGCGCGC GCGAACGCAC CGGCGCTGAT GAACGGCGAC TTCGAGAGAA CCGCGGCCGT CTGGTGGAGC CCCGAGGGCA ACGCCGGCGC GACCGACGAC CCGGCCGAGG TGCCGGAAGC GGCCGAGGCC GCGTTCGCGA GACGCGGCGG CAGCTTCGCC GCGTACCTGG AGACGCGCCG CAATCAGAGA GCCGCGCTCT ACCAGGGAAT CAGCCTGAAG AACGCCGCCA CGCTGCAGAT GCCGCGTCTC GGCCTCGGCA ACAGAGCGGT CTACACGTTC GAGGCGAACG TCGTCACGAG CGGCCACGCG TTCGCGCTCG TCGCGCGCGA CACCTGCGCG AACAGAGAGC TGGCACGACG GCTCGTCGCC CCGCGCAGCT TCCGCCGCGA GCGGCTGCTG ATCCCGGTCG AGACGAACTG CGACCTCGTC CGCGTCGGGA TCGAGCGGAC CGCCGCGAAC GGCGCCTGGG CGCTGCTCGA CGACGCCGAG CTGCACATGC CGGTGATCGA CAGCGAGGAC CCGCGCTGGG TCCTGAGAGG CATGTGGATG CGCGACTGGT CGGGCTGGGA CGTCGGCGAG ACCGACCACC ACGCCGGCGA CGACGGCGGC TCCGCGGCGC TGACGTACAC CGGCGGGCAG GCGAAGCTGA TCTCCGCCAG AAAGCCGTGG GGAGGAATCG TCGAGATGTC GGTCGACGGG GCGGCGAGAG GCGACGTCGA CCTCTACGGC AGCCAGCAGA TCGGCGGGCT CGAGGTGTTC GACACGGGCA CCCTGAGCAG CGGCAGACAC ACGCTGAGAC TCACCATCAC GGGCCGCAAG AACGCGGCGT CGGAGAACGA CTACGGCATC TTCGACGCGA TCGTCGTGCC GGAGTGGGTC AACCCCGCCG CGCCCGCCGC GCACCTCGAG GAGCTGCGTC CGGACCTCGG CGCGGTCGTC GTGCCGGGCA GAGCGCTCGT CGGCCACCCG GTCTACGTGG AGGCGCCCGC GTACGCGCCG CGCGGCGGCG GCGTGACGGT CGGCTGGAGA CTCGGCGACG GCACGACGAC GAGCGGTTCG AAGGCCAACC ACGTCTACGA CGCGCCCGGC ACGTACACCG TGACGGTCGA GGCGAGAGAC ACGCACGGCG AGACGACGAC GGCGACGCGC ACGATCGTCG TCAGAGCGGG CCCGTCGGGC CTCGCAGGCC TGGACGGCGC GGCCGGCGCT GACGGAACCG ACGGGGCGCC CGGCGAGCGT GGGGCCGACG GTCCGGCAGG GCCTGCGGGC GCGGCCGGCG CGACGGGCGC GACCGGTCCG GCCGGCGCAA CGGGCGCAGG CGGCCCGACC GGGGCGCGCG GCGCCGCTGG GCCGAAGGGC GACGCGGGCG ATCCGGCGAA CGTGTCGGTC AGCTGCACGC TCGTCAAGCG CCGCACGGCG GTGCGCTGCA CCGTGGCGGC GACGAGAGCG CGGGCGCGCG GAGCGGCGCG CGTGAGCGGA ACGTTCCGCA CGGCCGGGAC GACGGCGCGC GCCGCCGGCT CGGGCCGCGT CGCCGTCACG CTGCGTCCGG CGCAGCGGCC CACGAAGAGC GCGCGCGTCG CGGTGCGGCT GCGCGTCGAC GGGATCGCGA AGGACCTGGT CGTGCCGCTC GGCACGACGC GAAGCGTCGC GCTGCCGCGT AGGTAG
|
Protein sequence | MSLWSTARRA VPTALVAAAA VGAVASAPAA AAPGDPSADV TAAPYNAAGD DTTNDRAAIQ QAIDDIGARG GGEVIVPAGR TFLTGSLKLR SGVTLRLDGT LRQSQDVAHY AKAPIPGLDM PTDIPWHRAY FHNDPLIGAI GVHDVAIVGS GKLQMTVQAD HTRRIDIIPV GFFEAQRFTL RGITIREART TIITMLHSSD GLVANNDVAT SPDLGADGIV VSGSQRVRVL HNRVRTSDDN FVLETKHLDP RDQDSWWSSA TSTPLKDIEL AYNDAENFGA RYSIAFLPWA STPADKRDAA ISDIWIHDNR LVSRNDTWVY CDCDNPFNPE RRPFIQPTRR GEQAPMTRIV MERNTYDGRM WPFYSWVPPV FTDSRFEIAR ANAPALMNGD FERTAAVWWS PEGNAGATDD PAEVPEAAEA AFARRGGSFA AYLETRRNQR AALYQGISLK NAATLQMPRL GLGNRAVYTF EANVVTSGHA FALVARDTCA NRELARRLVA PRSFRRERLL IPVETNCDLV RVGIERTAAN GAWALLDDAE LHMPVIDSED PRWVLRGMWM RDWSGWDVGE TDHHAGDDGG SAALTYTGGQ AKLISARKPW GGIVEMSVDG AARGDVDLYG SQQIGGLEVF DTGTLSSGRH TLRLTITGRK NAASENDYGI FDAIVVPEWV NPAAPAAHLE ELRPDLGAVV VPGRALVGHP VYVEAPAYAP RGGGVTVGWR LGDGTTTSGS KANHVYDAPG TYTVTVEARD THGETTTATR TIVVRAGPSG LAGLDGAAGA DGTDGAPGER GADGPAGPAG AAGATGATGP AGATGAGGPT GARGAAGPKG DAGDPANVSV SCTLVKRRTA VRCTVAATRA RARGAARVSG TFRTAGTTAR AAGSGRVAVT LRPAQRPTKS ARVAVRLRVD GIAKDLVVPL GTTRSVALPR R
|
| |