Gene Cwoe_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_2049 
Symbol 
ID8732492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp2151634 
End bp2153367 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content73% 
IMG OID646502668 
ProductX-Pro dipeptidyl-peptidase domain protein 
Protein accessionYP_003393850 
Protein GI284043510 
COG category[R] General function prediction only 
COG ID[COG2936] Predicted acyl esterases 
TIGRFAM ID[TIGR00976] putative hydrolase, CocE/NonD family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.641906 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCGC CGATCCGCAT CGATCGCGAC GTCGAGATGG CGATGCGCGA CGGCGTCGCG 
CTGCGCGGCG ACGTCTGGCG CGTCGACGAC GAGACGCCGC GCCCGGCGCT GGTGCTGCGC
ACGCCCTACG ACCGCGCGAA CACGAACAGC GACCTGCTGC GCCCGCTCGA CGCGGCGACC
GCCGGCTACG CCTGCGTCGT CCAGGACACG CGCGGGCGCT ACGGCTCCGA CGGCGACTGG
GACATCCTGA TGTGGGAGCA GGAGGCGCGC GACGGCTACG ACACGATCGA GTGGGCGGCC
GCGCAGCCAT GGTGCGACGG CAACGTCGGC ACGTTCGGCG CCTCCTACCT CGGCATCGTG
CAGTGGATGA GCGCGGGCGA GCGGCCGCCG CACCTGCGCG CGATGGCGCC GGCGATGACG
ACCAGCGGCG AGCTGGAGGC GCTGGAGACC GGTGGCGCGC TGCGGCTCAA CCACGTCGTC
TGCTGGCTCG CCTACATGAC GCTCGACTGG CTCGGCAAGC AGCTGGCGGC CGGTCGGCCC
GTCGACCCGG CCGCGGTCCC ACGCCTGATG GAGCTGGTCG GCGACGCCAG TCCCGCGCTC
GAGCACCTGC CGCTCGGCGA GATCCCGCAC TTCGACTTCC CCGACTTCCC GCTTCCGCTG
CGCACGCTGC TGCAGCCGGG GCTCGGCATC GCGCAGCGGT TCGACTACGA GCGGATCGAC
GCGCCGACCC TGTCCGTCGT CGGCTGGTAC GACTTCCTCT GCACGGCGAC GATCGAGAGC
CACATGCGGC TCGTGGAACG CGGCGGGGGC GGCGCCGACG CACGTGCGCG CCACCGCCTG
ATCGTCGGGC CGTGGATCCA CGACGGCCGC CTGCCGGGAT TGCAGGGCGA GCTGAACTTC
GGCGTCGGCG ACGGTCCGTT CGCCGGCATC CATCGCCAGC ACCTGCAGTT CTTCGACCAC
CATCTGAAGG GCGCGGCCGA GCCGCTCGCG TCCGTCCAGT ACTTCCTGAT GGGCGCCGAC
GAGTGGCGTG CTGCGGAGGC GTGGCCGCCG CCGGAGGCCG CCGCCGAGAC GTGGCTGCTC
GCGAGCGGCG GCGCGGCCAA CACCGCTGAC GGCGACGGGA CGCTGGCGCC GGAGCGGCCC
GCCGGAGGCG CCGAGCAGGA TCGCTTCGCG TACGACCCCG CCGACCCGGT GCCGACCCAC
GGCGGCCGCA CGCTGCCGAT GGGGACGCAG ATCGCGGGCC CGTTCGACCA CGCGCGGGTC
GAGTCGCGCG CCGACGTGCT CTGCTACTCG TCGGAGCCGC GCGCCGAGCC GCTCGACCTC
GCCGGGCCGG TGTCGGTCCG CCTGTTCGCG GCCTCCAGCG CGCGCGACAC CGACTTCGTC
GTGCGGCTGC TCGACGTCGA CCCGCAGGGT CGCGCGATCC CGTTCGCGGA GGGCATCCAG
CGGGCGCGCT TCCGCAACGG GCTCGGCGAC GAGGTGCTGC TGGAGCCCGG CGCGGTCGAG
GAGTACGCGA TCGCGCTCGG TCACACGGCG TGGCGGGTCC GTCCGGGCCA CCGTCTGCGA
CTCCACGTCA CGAGCAGCAG CTTCCCCGCG TTCGATCGCA ACATGAACAC GGGTGGCCCG
GTCGGCGACG ACGCCGCGGG CGTCGTCGCC CAGCAGACGG TCCTGCACAG CGCGGCGCAT
CCGTCGGCGC TGATCGTTCA CACAATCCGA CAGGTGGTAC CTGATGGCCG TTGA
 
Protein sequence
MTPPIRIDRD VEMAMRDGVA LRGDVWRVDD ETPRPALVLR TPYDRANTNS DLLRPLDAAT 
AGYACVVQDT RGRYGSDGDW DILMWEQEAR DGYDTIEWAA AQPWCDGNVG TFGASYLGIV
QWMSAGERPP HLRAMAPAMT TSGELEALET GGALRLNHVV CWLAYMTLDW LGKQLAAGRP
VDPAAVPRLM ELVGDASPAL EHLPLGEIPH FDFPDFPLPL RTLLQPGLGI AQRFDYERID
APTLSVVGWY DFLCTATIES HMRLVERGGG GADARARHRL IVGPWIHDGR LPGLQGELNF
GVGDGPFAGI HRQHLQFFDH HLKGAAEPLA SVQYFLMGAD EWRAAEAWPP PEAAAETWLL
ASGGAANTAD GDGTLAPERP AGGAEQDRFA YDPADPVPTH GGRTLPMGTQ IAGPFDHARV
ESRADVLCYS SEPRAEPLDL AGPVSVRLFA ASSARDTDFV VRLLDVDPQG RAIPFAEGIQ
RARFRNGLGD EVLLEPGAVE EYAIALGHTA WRVRPGHRLR LHVTSSSFPA FDRNMNTGGP
VGDDAAGVVA QQTVLHSAAH PSALIVHTIR QVVPDGR