Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_4425 |
Symbol | |
ID | 8734887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 4715171 |
End bp | 4716511 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 646505051 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003396214 |
Protein GI | 284045874 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.446855 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.265456 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTACAA GGCTCAAGCG CATCTGCGCG CTGGCGCTCG TCGGCGCGAT CGCGCTGTTC GCCGTCGCGT GCGGAGGCTC GAACGACGAC AACGGCGGCG GTGGTGGCGG CAGCAGCGGC AGCAGTACGG CGGGCACGCC GGACCCGGCT GACGTCAGCG GCAAGATCGT CGTCTGGGAC GTCTTCTACA GATCGTTCCC GAGCTACACG AGAGCGGTCC CGGAGCTCGA CGCGGCGTTC AAGAAGAAGT ACCCGAACGT GACCGTCGAG CACGTCGCGC AGCCGTTCGC CGAGTACCAG GCGCTGCAGC AGGCGGCGTT CACCGGGCGC GAGGGCCCGG ACGTGATGAT GATGCCGAAC GCCGGGGGCA TCCGCCAGTT CCAGAAGGGG CTGGAGGAGC TCAATGACCG GATCACGCCG GAGATGCAGG AGCAGCTGAC CGGCTGGGAT GCGGTGACGC CGGGGTACAG AACCGAGGGT CCGCACTACG GCGTGCCGAT CGCCCAGACC GACTTCATGT TCTATTTCAA CAAGAGACTG TTCAGAAGAG CCGGTCTGCC GACCGACTTC GAGCCGAGAA CCTGGGAGGA CCTCCGCGAC GCCGGCCTGA AGCTGAAGGC CGCCGGCATC CAGCCGTTCA CCGGGGGCAA CAAGGAGGGC TACGAGTCGC AGTGGTGGTG GCACATGGCG TGGCCGACCT ACAACACCCA GGAGCAGGCG GTCGCGCTCG CCGACGGCGA GCTGCCCTTC ACCGACCCGG CCGTCGCCCA GACGTTCGAG CCCGAGATGA TGATGCAGAG AGCGGGGCTG TTCGAGAGAG ACCGCTTCAC CACGCCCTTC TTCAATGACG GCTGGATGCG CTTCGCCGAC GGCAAGGCCG CGATGATCCT CGGCGGCTCG ACGAACACCG CCTACTGGGG TGACTTCAAC AGAGCGCTCG GCGAGGAGAA CGTCGGGATG TTCCTGCCGC CGGGCAGCAA GTACATATCG ATGAACCCCG AGTGGAGCTG GTCGATCCCG AAGTTCGCCG AGAACAAGGA CGCCGCATGG GCGTACATCG ACTTCATGGC GAGCAGAGAG GGTCTCGAGA TGCTCTTCGA GATGACCGGC GAGCTGCCCA ACCGCAAGGA CGCGGAGCTG CCGGCCGACG CGCCGTCCCA GGCCAGACAG ATCCGCGACT GGTACCGCGA CGGGCCGACG TTCCTCGCGA CCGACGTGCT GACTCCGAGC CAGGTCACGC AGACGATGAA CACGGAGGCG AAGGAGTGGC TCCAGGGCCG CAAGTCGAAG GAGGAGATGC TGGAGTCGAT CCAGGCGACG TCCGAGCAGG TCAACCGGTG A
|
Protein sequence | MSTRLKRICA LALVGAIALF AVACGGSNDD NGGGGGGSSG SSTAGTPDPA DVSGKIVVWD VFYRSFPSYT RAVPELDAAF KKKYPNVTVE HVAQPFAEYQ ALQQAAFTGR EGPDVMMMPN AGGIRQFQKG LEELNDRITP EMQEQLTGWD AVTPGYRTEG PHYGVPIAQT DFMFYFNKRL FRRAGLPTDF EPRTWEDLRD AGLKLKAAGI QPFTGGNKEG YESQWWWHMA WPTYNTQEQA VALADGELPF TDPAVAQTFE PEMMMQRAGL FERDRFTTPF FNDGWMRFAD GKAAMILGGS TNTAYWGDFN RALGEENVGM FLPPGSKYIS MNPEWSWSIP KFAENKDAAW AYIDFMASRE GLEMLFEMTG ELPNRKDAEL PADAPSQARQ IRDWYRDGPT FLATDVLTPS QVTQTMNTEA KEWLQGRKSK EEMLESIQAT SEQVNR
|
| |