Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_5097 |
Symbol | |
ID | 8735563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 5451830 |
End bp | 5453164 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646505722 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003396881 |
Protein GI | 284046541 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.59554 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTCAGA CCCATCCCGC GCGGGGATCG CTCGCGCTGA GCACACCGCT GTCGCGCCGC AGCCTGCTGA AGGCGGCCGG AGCGGCCGGC GCCGCGCTGA CCGGCGCGCC GCTGCTGGCG GCGTGCGGCT CCTCCGGCGG AGGCGGCGGC TCCGGCGGCG CGGCGACGAT CGAGTTCTGG GACATGCTGT GGGGCCTCGA CAGATACGAG CCGACGGCGC GCGCGCTCGT CGCCGAGTGG AACAGAGCGA ACCCCGACCT CCAGGTCAAG TACCGCCTGA TCCCGTGGGC GAGCTTCTAC GAGGTCTTCT CGACCGCCGT CGCCAGCGGC ACCACGCCGG ACGTCAGCAC CGGCGCGACC TATCAGGCGT TCCAGTTCGA GCAGGCGATC GAGCCGATGA ACGACGCCGT CGCGCAGTGG AGAAGAGACG GCACCTACGA CCAGGTGATC CCGCAGTCGA TCACGGCGCA GGCGACGGAG GACGGCGAGC AGACGGGCCT GCCCTGGGGG ATGACGCTGC GCACGCTCAG CTGCAACAGA AAGCTGTTCG GTGCCGCGGG TGTGACACAG CCGAGATCGT TCGACGAGCT GCGCGCCGCC GCCAGAAGAC TGACCGGCGG CGGGCGCTAC GGGATGGGCT TCTGCGGCCA GGGCGCGCTC GGCTGGCAGA TGCTGCTGTC GCTGATGGTC AACAACGGCG GCGGCCTCTA CGACGCGAAG TGCGGGCCGG CGCTGGTGAC CGATCGCAAC CGCGAGGCGT GCCAGCTCGT GCAGGACATG GTCCGCGACG GCTCGATCCC GAAGGCCGCG GTCGGCTGGG ACCAGACCGA CGTCTCGGCC GCGATGACGC GCGGCGACAT CGCGATGGCG ATAACCGAGC CGGCGCTGTT CAACTCGTTG CCCAACGGCG CCGACATCGA CATCGCCTCG CCGTACGAGG GCTTCCACGG CGACAAGGGC ACGCTCCTCT GGTACCTCGC GATGTGGCAG TACCGCACCA GCGAGGACAA GCCCGGGGCG ACCGAGTTCA TGAACTGGTG GCTGAGCAAC GAGCAGCCGC TGTGGTCCAG AGGCGGCACG ACGCAGCTGC CGGTGCGCAC GCCGTTCTAC GACGAGATCA GAACGCTGCA GGACCCGCGC TACAGAAAGG TGCTCGACGA GTGGGTGCCG GTCGGCAAGA TCATGTCGAC GCCCTGCGAG TACGCCCTGC CGACGCTCAA CCAGGTCGAG GGGCAGGCGT TCATGCCGAC GCTCGTGCAG GACGTCCTGT CGCTGAAGCC GATCGACGAA TCGCTGCAGA CCGCGCAGGA CGCGCTGTCG CAGCTGAGAG CGTGA
|
Protein sequence | MRQTHPARGS LALSTPLSRR SLLKAAGAAG AALTGAPLLA ACGSSGGGGG SGGAATIEFW DMLWGLDRYE PTARALVAEW NRANPDLQVK YRLIPWASFY EVFSTAVASG TTPDVSTGAT YQAFQFEQAI EPMNDAVAQW RRDGTYDQVI PQSITAQATE DGEQTGLPWG MTLRTLSCNR KLFGAAGVTQ PRSFDELRAA ARRLTGGGRY GMGFCGQGAL GWQMLLSLMV NNGGGLYDAK CGPALVTDRN REACQLVQDM VRDGSIPKAA VGWDQTDVSA AMTRGDIAMA ITEPALFNSL PNGADIDIAS PYEGFHGDKG TLLWYLAMWQ YRTSEDKPGA TEFMNWWLSN EQPLWSRGGT TQLPVRTPFY DEIRTLQDPR YRKVLDEWVP VGKIMSTPCE YALPTLNQVE GQAFMPTLVQ DVLSLKPIDE SLQTAQDALS QLRA
|
| |