Gene Cwoe_0125 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0125 
Symbol 
ID8730553 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp123264 
End bp125165 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content68% 
IMG OID646500739 
ProductEndopygalactorunase-like protein 
Protein accessionYP_003391936 
Protein GI284041596 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5434] Endopolygalacturonase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.300956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0208856 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGAC TGCTTGTCAG CGGCCTGGTG GCCGCGGTGA CGGCATGCCT TGCCCTGCCC 
GCGGCCGCCT CCGCGCTCGA CGTGGACGTG ACCGCAGCGC CGTATGGAGC TGCCGGGGAC
GGCGTGACGA ACGACCGTGC GGCGATCCAG AGCGCGATCG ACGACGTCAG CGCGGCGGGT
GGCGGGAAGG TGACGCTGCC CGCGCCGAGG ACGTTCCTGT CCGGCGACGT CTCGGTCAAG
AGCAACGTGA CGCTGGAGAT CGCCGGCGGC GCGACGCTGA AGATGAGCCA GAACCAGTCG
CACTGGGCGC ACAGACCGGT GCTCGGCCAC ATGATCGACG GCACGATCGA GTGGAACATC
GCGATGTACC GGAACTATCC GCTGATCCAT TCCGGCAACG CGAGAAACGT CACCGTGACG
GGCGGCGGCA CGATCGAGAT GACGCGCGCG CTGACCGACG AGACGACGAT CCACGCGATG
GGGATCGGCT TCCACAAGGT CGACGGGTTC AGAATCTCCA ACCTGACGAT GATCGGCGCG
AGCGCGAGCA ACGTCTCGCT CTACACCGTC GACAACGGGA TCGTCTCCGG GATGACGATG
AGAGACGACC TCGACACCAA CGTCGAGGGC GTCGCGATCG GCAACTCGCA GCACGTGCGC
GTCACCGGCA ACACGATGGA CGCGACCAGC GACGACACGA TCGTGCTGTG GACGCAGTGG
AACGACCCGC GCGGCACGAC GTGGTGGTCG ACGGCCGTGC CCGAGCCGAT CAACGACATC
GAGATCGACA ACAACTACGC GACCTCGACC TGCTGCAAGG CGATCGCGCT GATCCCCTGG
GGCACCGCGT ACGGCGACCA GCGCCAGGTC GAGATGAGCG ACGTGCGCGT CCACGACAAC
ACGCTCGCGG CGACCGACGC GGTCGGCTGC TGGTGCGACG ACCCGTACAC CGGCGAGCCG
CCGTCGAGAT TCACCAACCA GGAGCAGGAC CAGGCGCCGA TGAAGGACCT GACGTTCGCC
AGAAACAGAT ACACCGGATC GACCGCGCTG ACGAAGGCGC ACATCCAGAA CCTCGTCGCC
GACTTCGCCA AGACGAGCCC GACGTTCATC CGCAACGGCG GCTTCGAGCG CACCGGCCAC
GCGTACTGGT CGCGCGTGAG CAGAGCCGGC CAGTCCGACG CCGCCGACTA CTCCGTCGGC
CAGAACGGCT CGTGGTACGG CTACATCCAG TACTTCGACG TCGGCTACAC GACGCTCTAC
CAGGGCGTGT CGCTGACCGG CGGGCAGACG TACACGCTGC GCGCGCGGAT CCAGACACCG
GACGGGAGCC CGGCGCGGAT GTATGTGTAC GACACGTGCG GCGGCACGGG CGTGACGCAG
AACGTCAGCT CGCTCGCGTG GACCGACGTC AGCCTCAGAT TCACCGCGGC GTCGAGCTGC
GGCCGGTACA TAGTCGGGTT CGACTCGGGC ACGTTCACCT CAGGCGGCCT GCGCGTCGAC
GACGTGCTGC TCGAAGGGCC GCGGATCGAC AACGACGACC CGGTGATCTC GTACGAGGGC
CTCTGGTACC TCTACGAGCG CGCCGGCGAC CACGGCGGGA CGGAGCGGAT GGCGACGCAG
GCCGGCACCA GATCGAGCGC GACCGTCACG TTCCGTGGCA CGCGCGCGAG AGTGCTCGCC
GTCCGCGGCG GCGACCGCGG CAGAGCCGAC GTCTACCTCG ACGGCGTCTA CAAGACGACG
ATCGACCAGT ACAGCCCGAC GACCGACCTG CAGTACGTCA CGTACGACAC CGGCACGGTC
GCGGCCGGGA CGCACGAGCT GAGAGTGGTG CCGACCTGGA CCAAGCACCC CGCGTCGGCG
AACACGGTGA TCTCGGTCGA CGCGGTCGAG GTCGTACGGT AG
 
Protein sequence
MRRLLVSGLV AAVTACLALP AAASALDVDV TAAPYGAAGD GVTNDRAAIQ SAIDDVSAAG 
GGKVTLPAPR TFLSGDVSVK SNVTLEIAGG ATLKMSQNQS HWAHRPVLGH MIDGTIEWNI
AMYRNYPLIH SGNARNVTVT GGGTIEMTRA LTDETTIHAM GIGFHKVDGF RISNLTMIGA
SASNVSLYTV DNGIVSGMTM RDDLDTNVEG VAIGNSQHVR VTGNTMDATS DDTIVLWTQW
NDPRGTTWWS TAVPEPINDI EIDNNYATST CCKAIALIPW GTAYGDQRQV EMSDVRVHDN
TLAATDAVGC WCDDPYTGEP PSRFTNQEQD QAPMKDLTFA RNRYTGSTAL TKAHIQNLVA
DFAKTSPTFI RNGGFERTGH AYWSRVSRAG QSDAADYSVG QNGSWYGYIQ YFDVGYTTLY
QGVSLTGGQT YTLRARIQTP DGSPARMYVY DTCGGTGVTQ NVSSLAWTDV SLRFTAASSC
GRYIVGFDSG TFTSGGLRVD DVLLEGPRID NDDPVISYEG LWYLYERAGD HGGTERMATQ
AGTRSSATVT FRGTRARVLA VRGGDRGRAD VYLDGVYKTT IDQYSPTTDL QYVTYDTGTV
AAGTHELRVV PTWTKHPASA NTVISVDAVE VVR