Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_3979 |
Symbol | |
ID | 8734437 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 4220366 |
End bp | 4222978 |
Gene Length | 2613 bp |
Protein Length | 870 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646504604 |
Product | protein of unknown function DUF608 |
Protein accession | YP_003395771 |
Protein GI | 284045431 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4354] Predicted bile acid beta-glucosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0113578 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGACG AGCCGACCAA CTCCCGCTTC GCGGGCGACG CGCGCCGCGC CACGGCGCTG CCGCTCGGCG GCGTCGGCGC GGGCCACGTC GCGCTGCACG GCGACGGCTC GCTGCGCCAG TGGCAGCTCC ACGGCCGGCC GAATCACACC GCGTTCGCGC CCGGCGCGCT GTTCGCCGTG CGCGCCTGCT GCATCGAGCC GCCGGCCGAC GTGCGCCGGG TCCTGCAGAC CGCGCCGCCG CCGGCTGCCG CAGACCCCGC GCCGTGCGTG GACGACGACC ACGTTCCGGA CGGGCTGACC GATCCGCTCG CCGTCTGGCC GCCGATGGCG AGCAGCGCGC TGTCGGTCGC CTACCCGTTC GCGCGCGTGG CGTTCGACGA CCCCGAGCTT CCGCTGGCGG TCGAGCTGGA GGCGCACACG CCGTTCGTCC CGTTGGACGA CGCCGACAGC GGCCTGCCGC TCGTCGTCTT CAGTGCGCGG TTGCGCAACA GCGGCACGGA GCAGCTGCAC GGCTGGCTGC TCGCGACGCT GCCGAACCTG ATCGGCTGGG ACGGGCTGAC GGAGTTCCGC GACGGGCGCT GCAGCGTGCT CGGCGGCAAC GTCAACCGGA TCGTGCGCGA TGGCGAGCGG ACCGCGGTCG TGATGGACAA CCCGTCGCTC GCGGAGGACG ACCCGCGCTT CGGCGAGATG GCGCTGTCCT CGACGGCGCC GTGCGTCCCG CTGCCGCGCT TCGCCGATGT CGCCGACGCG CTGCGCATCG CGCAGACGCT GAAGCTGCTC GCGCCGAGCC AGCGGGGCGA CTTCTCGCCG GAGGCGGTGC GCGCGGCGGT GCGCGACCAC CAGCCGCCGC CGGGGCCGGT CGGCGCGAGC CCGGCCGGGA CGACCTGGGG CGGCGGGCTC GCGCTGCCGT TCGCGCTGGC GCCCGGGGAG GAGACGACGA TCGAGGTCGT GCACGCCTGG TGGTTGCCGA ACCGGCTCGC GGACTTCGAC CAGTTCGGCC CCGAGCGCGG CCTGAACCGC CACCGGCTGT GGCTCGGCAA CGCCTACGCC GAGCGGCACC GCGGCGTGCT GCAAGTGCTC GGCCACCACG CGCGCGAACG CGACGAGCTG CTGGCGGCGT CGCGTGCGTG GGCGCAGTCG ACCGCGACGG CCACGCTGCC GCAGCCGGTG CGGGAGCTGC TCGACGTGCA GGCGTCGCTG ATCCGGTCGC CGACGCTGAT GGTCACCGGC GACGGCCGCT GCTACGGCTT CGAGGGCGGC CTCGGCGCGT CGACGACGAA CTGGAACGGC GACTGGGGCG GCTCCTGCCC GCTCACGTGC ACGCACGTCT ACAACTACGA GCAGGCGCTC TCGCGGCTGT TCCCCCGCTT GGCGCGGACG ATGCGCGAGG TCGAGCTCGA CCACGTCCTC GGCGACGACG GCTCGCTGCC GCACCGCGTC GTGCTGCCGC TGTGGCTGCC CCAGCTGCAC GGGGTCGCGA TCGGCGGGCC GGCTGCGCCG GCGCTCGACG GGATGCTCGG CGCGGTCCTG AAGACCTATC GCGAGGCGCG GCAGGGCGGC GGCGAAGCGT GGCTCGCCGA CCGCTGGGAC GCGCTGGTGC GGCTGATGGA CCACGTCGAC CGCACCTGGA ACAGCGCGGG CGACGGCGTG CTGCGCGGCG AGCAGCCGTG CACGTACGAC ATCGCACTGC ACGGACCGAA TCTGCTCATC GGCGGCCTCT GGCTGGCGGC GCTGCGGTCG ATGGAGGAGA TCGCGCTGCG GCTCGGGGTC GCCGGCGCGG CCGGCTACCG CGTCCGCTTC GAGCAGGCGC GCGGCGGCTA CGAGAAGCTG TTCAACGGCG AGTACTACGC GCAGCCGGTG ACCGGCGAGC CGCACGACTT CGGCGACGGC TGCCTGTCGG ACCAGCTGCT GGGCCAGTGG TGGGCGCACC AGCTGGAGCT CGGGCACCTG CTCGACCCCG AGCGCGTCCG CAGCGCGTTG CGCGCGATCG TCGCCCACAA TCTGCGAGAG GGCTTCGGCG CGCGCGTCGC GGACGAGCAG CCGCCCGGCC ACCGGGTCTT CGCCGACGGC GAGGACAGCG GGCTCGTCGT CTGCTCGTGG CCGCGCGGCG GCCGGCCGGA CGTCCCGCTG CGGTACTGCG ACGAGGTCTG GAGCGGCGTC GAGTACGCCG TCGCCGCGCA CTGCATCGAC GAGGGGCTGG AGGACGAGGG GCTGGCGCTG GTCGAGGCGG TTCGCCGCCG GCACGACGGC ACGCGGCGCA ACCCCTACAA CGAGATCGAG TGCGGCGACC ACTACGCCCG CGCGATGTCC GGCTGGTCTG TGCTGGAGGC GCTGACCGGC TTCCGCTACG ACGCGCTCGC CCGCCGCATC GCCCTGCGCG GCCGCGCAGG GCGCTTCCCG TTCGTCGCCG GAACCGCCTG GGGCACGATC GCGGTCGGCG ACGCGGGAGA CGTCGAGCTG ACGGTCTCGC GCGGCGAGCT GGAGCTGGAC GTCGTCGCGG TCACCGACGG CACGGGGTCA GAGCGCACGC ACGCTGTCGG CCGCCGCATC GGCGCCGGCC GGTCGCTGGC GCTCGGAGAG CCGGGCGGCG CGCCTACGGA CGGGCGCGCA TGA
|
Protein sequence | MTDEPTNSRF AGDARRATAL PLGGVGAGHV ALHGDGSLRQ WQLHGRPNHT AFAPGALFAV RACCIEPPAD VRRVLQTAPP PAAADPAPCV DDDHVPDGLT DPLAVWPPMA SSALSVAYPF ARVAFDDPEL PLAVELEAHT PFVPLDDADS GLPLVVFSAR LRNSGTEQLH GWLLATLPNL IGWDGLTEFR DGRCSVLGGN VNRIVRDGER TAVVMDNPSL AEDDPRFGEM ALSSTAPCVP LPRFADVADA LRIAQTLKLL APSQRGDFSP EAVRAAVRDH QPPPGPVGAS PAGTTWGGGL ALPFALAPGE ETTIEVVHAW WLPNRLADFD QFGPERGLNR HRLWLGNAYA ERHRGVLQVL GHHARERDEL LAASRAWAQS TATATLPQPV RELLDVQASL IRSPTLMVTG DGRCYGFEGG LGASTTNWNG DWGGSCPLTC THVYNYEQAL SRLFPRLART MREVELDHVL GDDGSLPHRV VLPLWLPQLH GVAIGGPAAP ALDGMLGAVL KTYREARQGG GEAWLADRWD ALVRLMDHVD RTWNSAGDGV LRGEQPCTYD IALHGPNLLI GGLWLAALRS MEEIALRLGV AGAAGYRVRF EQARGGYEKL FNGEYYAQPV TGEPHDFGDG CLSDQLLGQW WAHQLELGHL LDPERVRSAL RAIVAHNLRE GFGARVADEQ PPGHRVFADG EDSGLVVCSW PRGGRPDVPL RYCDEVWSGV EYAVAAHCID EGLEDEGLAL VEAVRRRHDG TRRNPYNEIE CGDHYARAMS GWSVLEALTG FRYDALARRI ALRGRAGRFP FVAGTAWGTI AVGDAGDVEL TVSRGELELD VVAVTDGTGS ERTHAVGRRI GAGRSLALGE PGGAPTDGRA
|
| |