Gene Cwoe_0159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0159 
Symbol 
ID8730587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp156079 
End bp157707 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content75% 
IMG OID646500773 
ProductCellulase 
Protein accessionYP_003391970 
Protein GI284041630 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGCC TCGCCCCCCT CATCCTCGCC GGCCTGCTCC TCGCCTCGCT CGGCGCCGCG 
ACCGCCCATG CGGGTGGCCG GATGATCTCC ACCCCGCTCT CGACGAAGGG CGCGCGGATC
GTCGACGCCG GCGGCCGCAC GGTCGTGCTG CAGGGGGTCA ACTGGTTCGG CTTCGAGACC
GCCAACCATC TCGTCCACGG GCTGTGGGCG CGCGACTACA GAGACGTCCT CGCGCAGGTC
CGCCGGCTCG GCTTCAACAC GATCCGCCTG CCGTTCTCGC TGGAGGCGAT CCGCTCGACG
GCACCGGTCT CCGGCGCCGA CTTCTCCGGC GGGCGGAACG CCGCGCTGAA GGGCGCGACG
CCGCTCGAGG CGATGGACGC GGTGGTCGAG GAGGCCGGCC GCCAGGGCTT GCTGATCCTG
CTCGACAACC ACTCGCACGC GGACGACGCC TACCAGCAGG GCCTCTGGTA CGGCCAGGGC
TTCAGCGAGG ACGACTGGGT CGCGACCTGG AAGCGGCTCG CCGCTCGCTA CCGCGACCAG
CGCAACGTGA TCGGCGCCGA CCTCAAGAAC GAGCCGCACG CCGAGGCGAC GTGGGGCACC
GGCGGGCCGA CCGACTGGCG CCGCGCGGCC GAGCGCGCCG GCAACGCGGT GCTGTCGGTC
GCGCCGCAGT GGCTCGTCGT GGTCGAGGGC GTCGGCGGCG GCGCGCCGGT CCCCGGCCAG
CGGCTGGACA CGCACTGGTG GGGCGGAAAC CTCGAAGGGG TCCGCACGCA TCCGGTGCGG
CTCGACCGCG CCAACCGGCT CGTCTACTCG CCGCACGAGT ACGGGCCGGG CGTCTTCCCA
CAGCCGTGGT TCGGCAAGCC GAACACGCCG GCGCTGCTGG AGGAGCGTTG GAGAACCGGC
TTCGGCTTCA TCGCCGAGCA GGGGATCGCG CCGATCCTCG TCGGCGAGTT CGGCGGTCGC
AACGTCGACC GGGAGAGCGC CGAGGGCCGC TGGCAGCGGC AGTTCTTCGA CTTCATCGGC
CGCACCGGCG CGTCGTGGAC GTACTGGGCG CTGAACCCGA ACTCGGGCGA CACCGGAGGC
GTGCTGAAGG ACGACTGGTC GAGCGTGCAG CCGGCGAAGA CCGCGCTGCT CCAGCGGATG
ATCGCGCGCC AGCGGATCGC GTTCCGCGGC AGCGGCGCCG TCTTCACCGC TCCGCGCCGC
GCGACGACCC CGAGACGGGG CGGGAAGGCG GCGCCGAAGA CGCCGGCGAG ATCGCAGACG
GCGGCCCCGA CGCAGCCGAG CGCGCCCGCG CAGCCGCCCG CACAGCCGCC CGCCGACGAC
GCGCCCGGCC CGCCGGCGCC CGGCTCGCTG AGCGCGCGCG TCGTCGTCGA GAACCGCTGG
GACGCCGGCT GGTGCGGTCA CCTCGAGGTC AGCGGACCGG ACGCCACGCT CGCGGCGGCA
CGCGCGACGC TGACGCTCCC GCCCGGCACG CGGATCGCGC AGTCGTGGAA CGCGCAGCGC
TCGGGCGACG GCGGCCGCGT CGAGCTGCGC TTCCCGGCGT GGGCGAAGGT CGCCGGTGGC
GCGCCGTACG CGGCGACCGG CTTCTGCGTC GACGGCTCGG GCGAGGCCGC CGACGTGACC
GTCGGCTGA
 
Protein sequence
MRRLAPLILA GLLLASLGAA TAHAGGRMIS TPLSTKGARI VDAGGRTVVL QGVNWFGFET 
ANHLVHGLWA RDYRDVLAQV RRLGFNTIRL PFSLEAIRST APVSGADFSG GRNAALKGAT
PLEAMDAVVE EAGRQGLLIL LDNHSHADDA YQQGLWYGQG FSEDDWVATW KRLAARYRDQ
RNVIGADLKN EPHAEATWGT GGPTDWRRAA ERAGNAVLSV APQWLVVVEG VGGGAPVPGQ
RLDTHWWGGN LEGVRTHPVR LDRANRLVYS PHEYGPGVFP QPWFGKPNTP ALLEERWRTG
FGFIAEQGIA PILVGEFGGR NVDRESAEGR WQRQFFDFIG RTGASWTYWA LNPNSGDTGG
VLKDDWSSVQ PAKTALLQRM IARQRIAFRG SGAVFTAPRR ATTPRRGGKA APKTPARSQT
AAPTQPSAPA QPPAQPPADD APGPPAPGSL SARVVVENRW DAGWCGHLEV SGPDATLAAA
RATLTLPPGT RIAQSWNAQR SGDGGRVELR FPAWAKVAGG APYAATGFCV DGSGEAADVT
VG