Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_3347 |
Symbol | |
ID | 8733796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 3559037 |
End bp | 3560749 |
Gene Length | 1713 bp |
Protein Length | 570 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646503964 |
Product | X-Pro dipeptidyl-peptidase domain protein |
Protein accession | YP_003395140 |
Protein GI | 284044800 |
COG category | [R] General function prediction only |
COG ID | [COG2936] Predicted acyl esterases |
TIGRFAM ID | [TIGR00976] putative hydrolase, CocE/NonD family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0310858 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCCGT TCGCGCCACC GCTGCCGCCC GATCCCGTCG GGCGCTCGCT CCACGTCGAG CTGAGCGATG GCGTGCGGCT CGCGGTCGAC GTCTGGCTGC CGGCCGGGCT GCCCGACGGC GAGCGGATCG GGACGGTGCT GCGCGCGACG CGCTATCACC GCGCCGACGA AAGCGACGGC GTCACCGCCG AGGCGCGCCG CTGGACCGGC TGGGGCTACG CGCTCGTGCT GGTCGACGCG CGCGGCAGCG GCGCCTCGTT CGGCTCGCGC GACGCCGAGT TGTCGCGTCG CGAGATCGAG GACTACGGCG AGGTGCTCGA TTGGATCGCC CGGCAGCCGT GGTCCAACGG CCGCGCCGGC GCCTACGGCC ACTCCTACGA CGCCGACACC GCCGAGCTGA TGGCGTCGCT CGGCAACCCG GTGCTGCGCG CGGTCGCGCC GCTGTTCCCC GACTACGACG TCTACGAGGA CCTGATGGTG CCGGGCGGCG TGCCCAACCG GCTGATGACG GACAGCTGGC TGCACATGAC GCGCGCGCTC GACGGGATCG ACGGCGCGCT GGAGGAGGTC GCCGCGCTCG GCGAGACGAT CGCGACCGAG ATCGCGCCGG TCAAGCCGGT CGACGGGCCG GACGGGCCGG CGCTGCGCGA GGCCGCGATC CGCGAGCATC AGGCGAACGC CGACCTGCGC GCGGCGATCG CGCGGACGCC GTTCAAGGAC GACGTCGACG GCGGCTGGAG CTGGGCGGCC GCGTCGCCGC AGACGTACCG GGAGGCGACC GAGCGCGCCG GCGTGCCGAC GATGCCCGTC GCGAGCTGGT TCGACGCCGG TACCGCGGCC GGCACGCTCA CGCGCTTCGC GGTGCTCGAC GTGCCGCAGG AGGCGTACGT CGGCGCCTGG AGCCATGGCG CCAAGTACAC CTGCGACGCG TTCCGCGCCG AGGACGAGCG CAGTGAGTTC GAGGAGGCGG AGGTCTACCG CCGCGTGCGC GACTTCTTCG ACCGCTACGT GCAGCGCGGC GAGGAGCCGC GCCCCGGCCG CCGGCTGCAC TACCTCACGC TCGCGAGCGG CGAGTGGGCG ACGACCGAGA CGTGGCCGCC GCGCGGCACC GCGACGACGC GCTGGTACCT CGGCGCCGAG GGCGCGCTGA CGCAGGAGCC GCCCGCGCAG GAGGTGGCGG TCGACGTCTA CCGCCCGGAC CCGTCCGCGA CCGCCGGCGC GAGCAGCCGC TGGGGCACGC AGGTCAGCGG CGGCGGCGCG GTCGTCTACC CCGACCGCGC GGCGCAGGAC GCCCGGCTGC TGGCGTACAC GAGCGCCCCG CTGGAGCGTG ACGCGCACGT CGCCGGAACC GTCGCGATCG TGCTGGAGCT GTCCTCCTCG CAGCCCGACG GGACGCTGTT CGCCTATCTG GAGGACGTCG CGCCGAACGG CCGCGTGACC TACCTGACCG AAGGTCAGCT GCTGCTGTCC CAGCGCGCGC GCAGCTTCGC CCGCGCCGAC GCGCGGCCGC TGCAGCCGGG CACGGTCGAG ACCGTCCGGA TCGAGCTGTT CCCGGTCTCG GCGGTCGTGC GCGCCGGGCA CCGGCTGCGG ATCGCGCTCG CCTCCAACGA CGCCAGCCAC TTCGAGACGG TCCCGGCCGA CGGCGAGGTG ACGTACGAGG TGCACCGCGA ACGCGACCGG CCGTCGTGGG TCGAGGTGCC GGTCGCGCCG TGA
|
Protein sequence | MSPFAPPLPP DPVGRSLHVE LSDGVRLAVD VWLPAGLPDG ERIGTVLRAT RYHRADESDG VTAEARRWTG WGYALVLVDA RGSGASFGSR DAELSRREIE DYGEVLDWIA RQPWSNGRAG AYGHSYDADT AELMASLGNP VLRAVAPLFP DYDVYEDLMV PGGVPNRLMT DSWLHMTRAL DGIDGALEEV AALGETIATE IAPVKPVDGP DGPALREAAI REHQANADLR AAIARTPFKD DVDGGWSWAA ASPQTYREAT ERAGVPTMPV ASWFDAGTAA GTLTRFAVLD VPQEAYVGAW SHGAKYTCDA FRAEDERSEF EEAEVYRRVR DFFDRYVQRG EEPRPGRRLH YLTLASGEWA TTETWPPRGT ATTRWYLGAE GALTQEPPAQ EVAVDVYRPD PSATAGASSR WGTQVSGGGA VVYPDRAAQD ARLLAYTSAP LERDAHVAGT VAIVLELSSS QPDGTLFAYL EDVAPNGRVT YLTEGQLLLS QRARSFARAD ARPLQPGTVE TVRIELFPVS AVVRAGHRLR IALASNDASH FETVPADGEV TYEVHRERDR PSWVEVPVAP
|
| |