Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_4454 |
Symbol | |
ID | 8734916 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 4750222 |
End bp | 4751622 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 646505080 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003396243 |
Protein GI | 284045903 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGCTGA GCAAATCATT CCGTGGCCGG GCAGCGGCCG GCCTCGCGGT GCTGGGTGCG TGCGCCCTGG CCGCCTGCGG CGGTCCCGGC ACGAGCGCAT CGACGTCGAC GGACTCCTCG GCGTCGGCCG ACACGAGCGT TCCGAGACAG GACCTCACCG TCACGATGTG GGACCAGGAG GCGCAGAACC CCATCAACCC CGGGCTCGAG GCCGCGATCT CCCAGTTCGA GAAGCTGCAT CCGAACATCT CGATCAAGCG GGTGGCGCCG CCCGGGATCG GCAACGAGTC GACCGTCGAG TCCACGACGA AGACGAAGCT GGCCGTGTCG GGGTCGAATC CCCCGACCCT CGTCGAGGGC GACACGCTCC CCGGAGGGGT GCTCGCGCCG CTCATCCAGG CAGGGCTCAT GCGCGACATG GGACCGTACG TCGACGCCTA CGGCTGGGAC CGGCGGTTCC GCTCGATCGG CGGCCTGACG CTCGACCGCT ACTCCAACAA CACCAGACGC TGGGGAGAGG GCCCGGTGTA CGGGGTCCCG TGGTACGGCT CGGAGATCGG CGTCTTCTAC AACCGCAGAC TGCTGGAGCA GATCGGCGTC CCGCTGCCGA AGACCTTCGC GGACTTCGAG GCGTCGCTGG CTGCCGCCAA GCGCGCGGGC ATCACGCCCA TCGTGCTCGG CAACTCCGAT CGCGACGGCG GCGGCTGGCT GTTCAACGAG CTGCTCGCCG ACTTCCTCGG CGCACCCGAG ATGCTCAAAT GGGTCCTCAA CAAGCCGGGA GCGACGATCA CCGGGCCTGA CGGGGTGCAA GCGGCTCAGA CGATGAAGGA CTGGGCCGAC GCGGGCTACT TCACGAAGGG CTACGAAGGC ATCAGCCTCA CGCAGCAGGC GGCCCAGTTC GGTCGCGGCG AGGGCCTGTA CATGATCGCC CTCCACGTGC TGGCACCCAC CAAGGACCCT GACGGCTTCG GCTTCTTCTA CGTGCCCGAC CCCAACGACG CCGGCAGAAC GCCCGCAACC GTGAACAACG CGATGTGGAG CTTCGGCATC TCGGCGAACG CGCCCCAGGA AGAGGCGAAC GCCGCCGCTG CGTTCCTCGA CTACCTGAGC TCGCCGCAGA TGCAGCGCCA ACTCGCGCTC AGACATGGGT CGCTCCCGGT CGTGCCGGTG GACGGGCATG TCGATCGCGG TGCGGTCTAC GACGACCTGC TGCAGGCGTG GAACGAACAG CGGGCGTCCG GGACGGCGCT GCCCTTCCTC GGCAGCGGCA CGCCGCTCAA CTACACCTAC ACGACCTCGA TCCAGGGCGT GCAGGACCTG CTGGCCGGAC GCGTGAGCCC TGCGGAGCTG CTGGACAAGA TCGAGGAGCA GCAGCAGGCG TATGTCAGAG CCGGCAAGTG A
|
Protein sequence | MVLSKSFRGR AAAGLAVLGA CALAACGGPG TSASTSTDSS ASADTSVPRQ DLTVTMWDQE AQNPINPGLE AAISQFEKLH PNISIKRVAP PGIGNESTVE STTKTKLAVS GSNPPTLVEG DTLPGGVLAP LIQAGLMRDM GPYVDAYGWD RRFRSIGGLT LDRYSNNTRR WGEGPVYGVP WYGSEIGVFY NRRLLEQIGV PLPKTFADFE ASLAAAKRAG ITPIVLGNSD RDGGGWLFNE LLADFLGAPE MLKWVLNKPG ATITGPDGVQ AAQTMKDWAD AGYFTKGYEG ISLTQQAAQF GRGEGLYMIA LHVLAPTKDP DGFGFFYVPD PNDAGRTPAT VNNAMWSFGI SANAPQEEAN AAAAFLDYLS SPQMQRQLAL RHGSLPVVPV DGHVDRGAVY DDLLQAWNEQ RASGTALPFL GSGTPLNYTY TTSIQGVQDL LAGRVSPAEL LDKIEEQQQA YVRAGK
|
| |