Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_0112 |
Symbol | |
ID | 8730540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | + |
Start bp | 108727 |
End bp | 110301 |
Gene Length | 1575 bp |
Protein Length | 524 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646500726 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_003391923 |
Protein GI | 284041583 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.214652 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.175509 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTGGA AGAGGACAGT TGCGGCGGCG ACGGTCGCGT GTGCGCTCGC CGTCGCGGGC TGCGGTGGTG ACAGCAGCAG CACGTCGGGC GACTCCGGCG GCGACACGTC CACCGGCGAC ACGCCGGCGC AGGTCGCGCA GGGCGGAACG CTCACCGTCG CCCATACGGA CGGCATCCCG CAGCTGAACC CGGTGATCCG GACGTTCGGC TGGGAAGAAG TGCTCTTCCC GCTGCTGTGG AACGGGCTCT CGAAGGCCGA CGAGAACGGC GACGTCGTCC CCGACATCGC CAGATCGTGG AGCGCCTCGC CCGATCAGAG AACGTGGACG TTCGAGCTGC GGACCGACGT CAGATACTCC AACGGCAGAC CGCTCACGTC GAGAAACGTC GTGAAGGCGT TCGAGTACTA CCTCGACCCC AGAACGACGA CGCAGGAGGC GAACAAGATC GCCACGATCG CGAGCGTCAG AGCGGACGGG CCCGCGAGAG TCGTCGTGCA GCTGAGAGAG CCGAACGCGC TCTTCCCCGA GGCGATCGTG TGGGTCAAGA TCGTCGACGT CGACAACGTC GGCCAGATCG ACAAGAGACC GGTCGTGACC GGTCCGTACA CCGTGAAGGA CTTCGTCCCG GACGACCACG TCACGCTCGT CCCCAACCCC GAGTACTGGG GCGACCCGGC GCCGCTGAGA GAGATGGAGA TCGTCAAGGC GACCGACACG ACCTCCGCGC TCGGCGCGCT GCGCGCCGGC GACATCGACG TGATCTGGGC GACCAACCCG ACCGACGTCG CCTCGCTGGA GGGCGATCCC GACCTCAAGA CGGTCGAGCC GGAGGTGCCG AGCAAGTACG TCGACTGGGA GTTCGACACG ACCGCGCCGC CGTTCGACAA CGTCAAGGCG CGGCAGGCCG TCGCCTACGC CGTCGACCGC GAGGCCGTGC TGCAGAGCGC CTACTACGGC CTCGGCGACC TCGCCCCGAC GAACAACCCG CTCAGCACGA ACAACCCGTA CTACGGCGGC GAGCTGACCG ACTACAGCTA CGACCTCGAC AAGGCCAGAG CGCTGTTCGA GGAGGCCGGC ATCAGAGCCG GCGACACGAT CACCTGGTGG GGCACCGCCG GCACGAACGC CGAGTGGACG ACCGCCGCCG AGATCGTGCA GGCGAGCCTG AGAGAGATCG GCATCGAGCT GAAGATCGAG AACCGCGAGA TCTCCACCTG GGCCGACAAG TTCTACCCGG CCGGCAAGAG ATTCCCGAAC TTCCTCGTGC CGAACCTCGC CTCGTTCCCG CCCTCGCCGG CCGACGCCTT CGGCTTCTAC CGCAGAGGCC GCTGCGAGTG CAACTGGGAC AACCCGGAGT TCGAGAGCGC CTACGACGCC GCTCTCGCCG AGCCCGACGA GACGAAGGCG AAGGAGAAGT GGGCGACGGT GCAGGAGATC GTCAACAGAG AGGTCCCGCT GATCATCCCG CTGCAGGTCA AGGTGGTGTC GTCCATGCGG TCCAACGTCG AAGGCGTCTG GATGGAGGGC GGCGGCCAGC TGCACCTCGA GCAGGCGGGC GTCGCCGCGG AGTAG
|
Protein sequence | MRWKRTVAAA TVACALAVAG CGGDSSSTSG DSGGDTSTGD TPAQVAQGGT LTVAHTDGIP QLNPVIRTFG WEEVLFPLLW NGLSKADENG DVVPDIARSW SASPDQRTWT FELRTDVRYS NGRPLTSRNV VKAFEYYLDP RTTTQEANKI ATIASVRADG PARVVVQLRE PNALFPEAIV WVKIVDVDNV GQIDKRPVVT GPYTVKDFVP DDHVTLVPNP EYWGDPAPLR EMEIVKATDT TSALGALRAG DIDVIWATNP TDVASLEGDP DLKTVEPEVP SKYVDWEFDT TAPPFDNVKA RQAVAYAVDR EAVLQSAYYG LGDLAPTNNP LSTNNPYYGG ELTDYSYDLD KARALFEEAG IRAGDTITWW GTAGTNAEWT TAAEIVQASL REIGIELKIE NREISTWADK FYPAGKRFPN FLVPNLASFP PSPADAFGFY RRGRCECNWD NPEFESAYDA ALAEPDETKA KEKWATVQEI VNREVPLIIP LQVKVVSSMR SNVEGVWMEG GGQLHLEQAG VAAE
|
| |