Gene Cwoe_0124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0124 
Symbol 
ID8730552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp121203 
End bp123128 
Gene Length1926 bp 
Protein Length641 aa 
Translation table11 
GC content68% 
IMG OID646500738 
ProductEndopygalactorunase-like protein 
Protein accessionYP_003391935 
Protein GI284041595 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.778802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.020189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTGGT CCCTGCGCGG GAGCCTGCTG GCTGCCGCGG CGCTGATGCT CGGCCTGCCG 
GCCGCGGCCA GCGCACTCGA CGTGGACGTC ACCGCGGCGC CGTACAGCGC CGCCGGTGAC
GGCGTGACGA ACGACCGGGC GGCGATTCAG AGAGCGATCG GCGACGTCGC CTCCGCCGGT
GGCGGCAAGG TGACGTTGCC GGCGCCGAAG ACATACCTGA CCGGCAACCT GCATCTCAGA
ACGAACGTCG AGCTGCACAT CGCCAGAGGC GCGAAGGTCA AGCAGAGCCA GGAGATCACC
CACTACGCCG ACAGACCGCT GCGCGGGGCG CTGATCGACG AGACGATCCC GTGGAACTTC
ACCTTCTACC GCAACTACCC GCTGATCTAC GGCGGCAGCG TCAGAAACGT GAAGCTGACC
GGTGCCGGCA CGATCGAGAT GACGCAGCTC CCGGACGCGA GCCAGACGAT TCGCACCAAC
GCGATCGGGA TGTGGGACGT CTCCGGGTTC GAGATCTCCG GCATCCATGC GATCGGCGGC
ACGACGCCGT ACGTCGGCCT CTACTTCAAC GACAACGGGG TCGTGCGCGA CACGACGCTC
AACGAGCCGG GCGGACCGAC CGTCTCCGGC ATCGAGGTCG TCAGCTCACA GCACATCCTC
GTCGAGAGAA ACCTGATGCG CGACCCGGAC GGCACCCCCG GCGTCACCGA CGACGGGATC
GCCCTGTCGA CGACCTACGG CGACCCGCGC GCCACCGGCT GGTGGAGAAG CGACGTGCCG
AGACCGCTCG TCGACGTCGT GATCCGCGAC AACATCGTGG AGGCGGAGTG CTGCAGCGCG
CTGGCGTTCA TCCCGTGGGG CACCTCGGCG CCCGACCAGC GCGACATCGA GTTCCGCGAC
ATCCGGATCG AGAACAACGT CCTCAACGCG CCGGGCAGCT GGGACTCCGT CAAGTGCTGG
TGCGACAACC CGTGGAGAGG TTCCTACGGG CCGGCGTACA CGGCGGTCGA GGACGACCAG
GCGCCCATGA CCAATGTGAC GTTCTCCGGC AACAGCTACG CCGGCAGTCT CGACGGGTTC
GTGAGAGCGA GAATCAGCGG GATCCAGGGC GCGTCGTTCC ACCCCGGCAA CCCGTACCTC
AAGAACGGCG GCTTCGAAGA GACCGGGATC GCGTACTGGT CGAAATCGGG CACCGCGAGA
CAGGTCGGCG CTGAGACCTT CGCGGCCGGT CAGACCGGCA GATGGTTCGG CTACATCACC
AACTTCGGCG ACGGCTACAC CGCGCTCGCG CAGGGCGTCG GCCTGCCGGC GAGCACGAGC
TACACGTTCC GCGCCAGGGT GCAGACGAGC GGCGACCCGG TGCGGATGTA CGTCCACAAC
CAGTGCACGG GCGCGACGGT CGCGGCCAAA TGGGTCAGCG GGACGGGCTG GTCGACCGAG
GACCTGTCGT TCACGACCAC CTCGGCCTGC TCGAACTACC ACGTCGGGAT CGACTCCTCC
GGCCAGACGA GAGGCTGGGG CCGGATCGAC GACGCGCGGC TGCTCGGCAG CGTGATCGAG
GACGGCGACC CGCGGATCGG CTACGCCGGC CTCTGGCACC AGAACGTCCA TCCGAGCGAC
TCCGGCGGCA CGCACATGCT CGCCGTCGCC GACAGAACGA CCGCGAACGT CACGTTCGAG
GGCAACCGCG CCAGATGGCG GACGATCGTC GGCCCCAACG GCGGCTATGC CGACGTCTGG
CTCGACGGCG TCTTCAAGGG CACGGTCGAC ACGTACAGCG CGAGCTACGG CTGGGCCGAG
ATCTACGACA CCGGGGTGCT GAGCAGAGGC ACGCACGTGC TCGGGATACG GCCGCAGTGG
GCGAAGAACC CGCTCTCGAC GTACACCTAC GTGACGATCG ACGCGATCAA CGTCTCGGGG
TGGTGA
 
Protein sequence
MRWSLRGSLL AAAALMLGLP AAASALDVDV TAAPYSAAGD GVTNDRAAIQ RAIGDVASAG 
GGKVTLPAPK TYLTGNLHLR TNVELHIARG AKVKQSQEIT HYADRPLRGA LIDETIPWNF
TFYRNYPLIY GGSVRNVKLT GAGTIEMTQL PDASQTIRTN AIGMWDVSGF EISGIHAIGG
TTPYVGLYFN DNGVVRDTTL NEPGGPTVSG IEVVSSQHIL VERNLMRDPD GTPGVTDDGI
ALSTTYGDPR ATGWWRSDVP RPLVDVVIRD NIVEAECCSA LAFIPWGTSA PDQRDIEFRD
IRIENNVLNA PGSWDSVKCW CDNPWRGSYG PAYTAVEDDQ APMTNVTFSG NSYAGSLDGF
VRARISGIQG ASFHPGNPYL KNGGFEETGI AYWSKSGTAR QVGAETFAAG QTGRWFGYIT
NFGDGYTALA QGVGLPASTS YTFRARVQTS GDPVRMYVHN QCTGATVAAK WVSGTGWSTE
DLSFTTTSAC SNYHVGIDSS GQTRGWGRID DARLLGSVIE DGDPRIGYAG LWHQNVHPSD
SGGTHMLAVA DRTTANVTFE GNRARWRTIV GPNGGYADVW LDGVFKGTVD TYSASYGWAE
IYDTGVLSRG THVLGIRPQW AKNPLSTYTY VTIDAINVSG W