Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2049 |
Symbol | |
ID | 8732492 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 2151634 |
End bp | 2153367 |
Gene Length | 1734 bp |
Protein Length | 577 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 646502668 |
Product | X-Pro dipeptidyl-peptidase domain protein |
Protein accession | YP_003393850 |
Protein GI | 284043510 |
COG category | [R] General function prediction only |
COG ID | [COG2936] Predicted acyl esterases |
TIGRFAM ID | [TIGR00976] putative hydrolase, CocE/NonD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.641906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCCGC CGATCCGCAT CGATCGCGAC GTCGAGATGG CGATGCGCGA CGGCGTCGCG CTGCGCGGCG ACGTCTGGCG CGTCGACGAC GAGACGCCGC GCCCGGCGCT GGTGCTGCGC ACGCCCTACG ACCGCGCGAA CACGAACAGC GACCTGCTGC GCCCGCTCGA CGCGGCGACC GCCGGCTACG CCTGCGTCGT CCAGGACACG CGCGGGCGCT ACGGCTCCGA CGGCGACTGG GACATCCTGA TGTGGGAGCA GGAGGCGCGC GACGGCTACG ACACGATCGA GTGGGCGGCC GCGCAGCCAT GGTGCGACGG CAACGTCGGC ACGTTCGGCG CCTCCTACCT CGGCATCGTG CAGTGGATGA GCGCGGGCGA GCGGCCGCCG CACCTGCGCG CGATGGCGCC GGCGATGACG ACCAGCGGCG AGCTGGAGGC GCTGGAGACC GGTGGCGCGC TGCGGCTCAA CCACGTCGTC TGCTGGCTCG CCTACATGAC GCTCGACTGG CTCGGCAAGC AGCTGGCGGC CGGTCGGCCC GTCGACCCGG CCGCGGTCCC ACGCCTGATG GAGCTGGTCG GCGACGCCAG TCCCGCGCTC GAGCACCTGC CGCTCGGCGA GATCCCGCAC TTCGACTTCC CCGACTTCCC GCTTCCGCTG CGCACGCTGC TGCAGCCGGG GCTCGGCATC GCGCAGCGGT TCGACTACGA GCGGATCGAC GCGCCGACCC TGTCCGTCGT CGGCTGGTAC GACTTCCTCT GCACGGCGAC GATCGAGAGC CACATGCGGC TCGTGGAACG CGGCGGGGGC GGCGCCGACG CACGTGCGCG CCACCGCCTG ATCGTCGGGC CGTGGATCCA CGACGGCCGC CTGCCGGGAT TGCAGGGCGA GCTGAACTTC GGCGTCGGCG ACGGTCCGTT CGCCGGCATC CATCGCCAGC ACCTGCAGTT CTTCGACCAC CATCTGAAGG GCGCGGCCGA GCCGCTCGCG TCCGTCCAGT ACTTCCTGAT GGGCGCCGAC GAGTGGCGTG CTGCGGAGGC GTGGCCGCCG CCGGAGGCCG CCGCCGAGAC GTGGCTGCTC GCGAGCGGCG GCGCGGCCAA CACCGCTGAC GGCGACGGGA CGCTGGCGCC GGAGCGGCCC GCCGGAGGCG CCGAGCAGGA TCGCTTCGCG TACGACCCCG CCGACCCGGT GCCGACCCAC GGCGGCCGCA CGCTGCCGAT GGGGACGCAG ATCGCGGGCC CGTTCGACCA CGCGCGGGTC GAGTCGCGCG CCGACGTGCT CTGCTACTCG TCGGAGCCGC GCGCCGAGCC GCTCGACCTC GCCGGGCCGG TGTCGGTCCG CCTGTTCGCG GCCTCCAGCG CGCGCGACAC CGACTTCGTC GTGCGGCTGC TCGACGTCGA CCCGCAGGGT CGCGCGATCC CGTTCGCGGA GGGCATCCAG CGGGCGCGCT TCCGCAACGG GCTCGGCGAC GAGGTGCTGC TGGAGCCCGG CGCGGTCGAG GAGTACGCGA TCGCGCTCGG TCACACGGCG TGGCGGGTCC GTCCGGGCCA CCGTCTGCGA CTCCACGTCA CGAGCAGCAG CTTCCCCGCG TTCGATCGCA ACATGAACAC GGGTGGCCCG GTCGGCGACG ACGCCGCGGG CGTCGTCGCC CAGCAGACGG TCCTGCACAG CGCGGCGCAT CCGTCGGCGC TGATCGTTCA CACAATCCGA CAGGTGGTAC CTGATGGCCG TTGA
|
Protein sequence | MTPPIRIDRD VEMAMRDGVA LRGDVWRVDD ETPRPALVLR TPYDRANTNS DLLRPLDAAT AGYACVVQDT RGRYGSDGDW DILMWEQEAR DGYDTIEWAA AQPWCDGNVG TFGASYLGIV QWMSAGERPP HLRAMAPAMT TSGELEALET GGALRLNHVV CWLAYMTLDW LGKQLAAGRP VDPAAVPRLM ELVGDASPAL EHLPLGEIPH FDFPDFPLPL RTLLQPGLGI AQRFDYERID APTLSVVGWY DFLCTATIES HMRLVERGGG GADARARHRL IVGPWIHDGR LPGLQGELNF GVGDGPFAGI HRQHLQFFDH HLKGAAEPLA SVQYFLMGAD EWRAAEAWPP PEAAAETWLL ASGGAANTAD GDGTLAPERP AGGAEQDRFA YDPADPVPTH GGRTLPMGTQ IAGPFDHARV ESRADVLCYS SEPRAEPLDL AGPVSVRLFA ASSARDTDFV VRLLDVDPQG RAIPFAEGIQ RARFRNGLGD EVLLEPGAVE EYAIALGHTA WRVRPGHRLR LHVTSSSFPA FDRNMNTGGP VGDDAAGVVA QQTVLHSAAH PSALIVHTIR QVVPDGR
|
| |