Gene Cwoe_5076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5076 
Symbol 
ID8735542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5424440 
End bp5426581 
Gene Length2142 bp 
Protein Length713 aa 
Translation table11 
GC content75% 
IMG OID646505701 
ProductGlycoside hydrolase family 42 domain protein 
Protein accessionYP_003396860 
Protein GI284046520 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1874] Beta-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.946735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.386552 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCTCG CGACGCAGTA CCACCGCCCG CCGTTCCCGC GCCGCGACCG CTGGCGCGAC 
GACCTCGCGC GGATCCGCGC GACCGGCTTC GACACGATCG TGCTGACCGC GCCGTGGGCG
TGGATCGAGC CGGAGCCGGG CGCGTACGAC TTCGCCGACC AGGACGAGCT GGTCGCGCTC
GCGGGCGAGA TCGGGCTGAA GGTCGTGATC AACCTCTGGA CCGAGCTGCA GCCGGTCTGG
ATCGAGCGCG AGCTGCCCGA CGCCCGGCTC GTCGACCACA CCGGCCGGCC GGTCGTCTCC
TCGCCGCTCG CCTACGCGCA GTTCGGCGTG ATGCCCGGCG GCTGCACCGA CCACCCGGGC
GTGCGCGAGC GCGCGAGTGC GTTCATGACC GCGACGGCGG AGCGCTTCGC CGGCGCCGAG
AACCTGCTGC TGTGGGACTG CTGGAACGAG ATCCGCTGGA TGACGCAGGC CGACGGGCAC
GTCTGCCACT GCGAGCACAC CGTCGCGCGC TTCCGCGACT GGCTGCGCGA GCGCCACGGC
GACCTCGACG GCCTCAACGC CGCCTGGCAG CGCCGCTACC GCTCGTGGGA CGACGTTGCG
ATGGCGAAGC TGCCGACGCG CACCTACACC GACGTGATGG CCTACCAGGC GTTCCTGACG
CGCCGCGCCG CGGTCGACCT GCGCTGGCGT CGCGACGCGG TCCACGCCGG CGACTCTTCG
CGCCCGATCC TCGCGCACAC GGCGTTCCCG TCGGCGTTCT CCAGCGGCGA ATGGTTCGAG
TACGAGCCGG CGCTCGCGCG CGGCAACGAC TTCGAGCTGG CCGACCAGGT CGACGGGCTC
GGCTCCTCGC ACTTCCCGGC GTTCATCCAC ACGAGCGCGG TCGAGTACGC GACGCGGCTG
GAGGCGAGCC GCAGCTCGGC CGGCGGCTTC AACTGGATCG CCGAGCTGCA GGGCGGCGCG
GCCGGCCACG GCCTGCAGCC GATGCGAGCG GTCCCCGGCC GTCTGCAGGC GCGCTGGGTC
TGGAACGGGA TCGCGCGCGG CGCCAAGGCG GTCAGCTTCT GGTGCTGGCG CGACGAGGTC
TTCGGGCGCG AGGCGGCCGG CTTCGGGATC GTCGGCGACG ACGGCCACCG CGACGAGCGG
CTCGCCGAGC TGCGCAGAAC GGCGGACCTG CTGGAGGCGC ACGGTCCGCT GCTCGACGCG
TACGCGCCCG CGCCCGCGCG CGTGGGGGTC GTGCTCGAAC CGAGCGCGTA CCAGCTCGAC
TGGGCCGGCT CCGGCAGAAC GGGCGGGTTG AAGGTCGGGG CCGGCAGCGG CTACCAGGCC
GCCCACGACC TCCAGGGCCA TCTGCTCGCG CTGGAGCGGC TGCAGATCCC CTACGACGTC
GTCGACCCGT CGCACGCGCA CGACCTGTCC GGCTACGCGC TGCTGGTGAT GCCATGGCCG
CTCGTCGTCG ACCCCGCCTT CGGCGAGCGT GTGCTCGCCT GGACGCGCGC CGGCGGCACG
CTGCTGACGG GGGCGGAGCT GGACGCGTTC GACGCCGCCG GTCTCTACCG CTACCAGGAC
GAGCGCCCGT TCGCGAACGC GCTCGGCCTG CGCGGCAGCG GGCTGCGTCA GCCGGACGGC
CGCGCGCTCG AGTACGAGCT GGACGGGACG CGCGGCGAGC TGCGCACCGC GACCTGGGTC
GTGCCGCAGG ATGCCGCGGT GAGCGCCGGC GCGGACGTGC TCGCGGCCGA CGAGCGCGGC
GCGACGGTCG TGCGCCGCGC GGTCGGCGAC GGGCACGTCG TGGCGGTCGG CACCAACGCC
GGGCTCGCCT ACTTCGAGCA GCGCGACGCC GGCTTCGAAC GGTTCCTGCG GACGCTCGCG
GAGGGCGCCG GCGCGCTGGC GCCGCTGCGC TGCTCGATCG AGGACGGCGA GCGGGTGCAG
TGGCGCCACG GCGCCGCGGG CGAGCACGGC GAGCTGCTGT TCGTGATCAA CGAGGGCGCG
GCGGCCGACG TCACGTTCGA CTGGCCGGCG GATCGGCTGC CGGCCGCGGC GGCGCACGAC
CTGACGAGCG GGACGGAGCT TGAGCTGGCG CGCGACGGCG AGCGGCTGAC GCTGCGCCTG
CCGCTGCGGG AAGAGGGCTA CCACGTGGTC CGGCTCACCT AG
 
Protein sequence
MILATQYHRP PFPRRDRWRD DLARIRATGF DTIVLTAPWA WIEPEPGAYD FADQDELVAL 
AGEIGLKVVI NLWTELQPVW IERELPDARL VDHTGRPVVS SPLAYAQFGV MPGGCTDHPG
VRERASAFMT ATAERFAGAE NLLLWDCWNE IRWMTQADGH VCHCEHTVAR FRDWLRERHG
DLDGLNAAWQ RRYRSWDDVA MAKLPTRTYT DVMAYQAFLT RRAAVDLRWR RDAVHAGDSS
RPILAHTAFP SAFSSGEWFE YEPALARGND FELADQVDGL GSSHFPAFIH TSAVEYATRL
EASRSSAGGF NWIAELQGGA AGHGLQPMRA VPGRLQARWV WNGIARGAKA VSFWCWRDEV
FGREAAGFGI VGDDGHRDER LAELRRTADL LEAHGPLLDA YAPAPARVGV VLEPSAYQLD
WAGSGRTGGL KVGAGSGYQA AHDLQGHLLA LERLQIPYDV VDPSHAHDLS GYALLVMPWP
LVVDPAFGER VLAWTRAGGT LLTGAELDAF DAAGLYRYQD ERPFANALGL RGSGLRQPDG
RALEYELDGT RGELRTATWV VPQDAAVSAG ADVLAADERG ATVVRRAVGD GHVVAVGTNA
GLAYFEQRDA GFERFLRTLA EGAGALAPLR CSIEDGERVQ WRHGAAGEHG ELLFVINEGA
AADVTFDWPA DRLPAAAAHD LTSGTELELA RDGERLTLRL PLREEGYHVV RLT