Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_3983 |
Symbol | |
ID | 8734441 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 4226841 |
End bp | 4228211 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646504608 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003395775 |
Protein GI | 284045435 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCCAG ATCGTCAGAC GTCGCGCATG GACGCCGCCG GGGCCGCGCC CGGCGGAGGT CCGCGCGTCT CGCGACGCGC CTTCGCGCGG ATCGCCGCCT CCGGGCTCGC GGTCTCCGTC GCCGCACCGC TGCTCGCCGC CTGCGGCGAC GACGGGGAGA GCAGCGCGGA CGGCAAGGTC ACGCTGAAGT TCTGGAAGTA CGAGGACCCG GCCACGAAGT CCGTGCTCGA GCAGCTCGTC GCGAAGTACA ACCGCGAGCA GCAGAACGTC AAGGTCGTGA TGCAGACCTT CCCGTTCGAC CAGTACCTGG CCGAGAAGAT CACGACCGCG CTGTCGGCGG GCTCCGGGCC CGACGTCTTC TGGGTCAGCG CAGCGACGCT GCTGAACTTC GCGCCGAAGC AGCTGCTGCT GCCGCTCGGC GACACCTTCA CGCCGAAGGA GCAGAGCGAC TTCCTGCCGC AGAGCTTGAG AGGCATCACG CTCAGAGGCG ACGTCTACGG CGTCCCGCAC GAGATGGGCG TCCAGGGCCT GCTCTACGAC CAGCGCCTGA TGGAGAGACT GCGGCTCGAG CCCCCGAAGA CGTGGGACGA GCTGAAGGAG GTCGCGGCCA AGATCAAGAC CGACACGCGC TGGGGCATCA TGCTGCCGAC CGCCCCCGAC GTGTTCCAGA ACTTCATCTG GTGGCCGTTC CTGTGGATGG GCGGCGGCGA GGTCGTCAGC GCCGACTACA GCCACGCGAC GATCGCCGAG CCTGCCGGCG TGCAGGCACT GGCGCTGTGG GGCGACCTCG TGCGGGACGG GCTCGCGGCG CCGAAGTCGT CCGGTCCCTT CGGCGAGGAG CTGGCGCAGG GCAAGGCCGG GATGGCGGCG CTCGGGATGT GGGTCGTCGG CAACTACCGC ACGACGTACC CGAACGTCGC GCTCGGCGCG GCGCCGCTGC CGACGCCGAC CGCCGGCGGG AGATCGCTCG CGGCGTTCGG CGGCTGGTAC ACCGCCGTCA GCGCGGCGAC GAAGCACGCC GAGGAGGCGC GCAGATTCGC CGTCTGGCTG TTCGGCGAGA ACCCGGCCAA CGCCGTCGAG CTGACGAAGG CGATGACGGT GCTCTCGCCG CGCAGATCGG TCACCGCGAC GCTCGAGACG CTGCCCGCGT TCAGAAAGGC GCCGATCCCG GAGTTCACGC GGATCTGGCC GAGCACGCGT GCGGAGCCGG CGTACCCGCC GGAGATCCAG ACCGCCGTCA CGAACGCGCT GCAGGCGGTC ATGTTCAGCA AGGCGGAGCC CGAGCGGGCG GCGGAGGACG CCGCGAAGGC GATCGACAGC TACCTCGCGA GTCCCGACGG GAGCCTGCTC AAGGAGCTCA TGGGCTCGTG A
|
Protein sequence | MSPDRQTSRM DAAGAAPGGG PRVSRRAFAR IAASGLAVSV AAPLLAACGD DGESSADGKV TLKFWKYEDP ATKSVLEQLV AKYNREQQNV KVVMQTFPFD QYLAEKITTA LSAGSGPDVF WVSAATLLNF APKQLLLPLG DTFTPKEQSD FLPQSLRGIT LRGDVYGVPH EMGVQGLLYD QRLMERLRLE PPKTWDELKE VAAKIKTDTR WGIMLPTAPD VFQNFIWWPF LWMGGGEVVS ADYSHATIAE PAGVQALALW GDLVRDGLAA PKSSGPFGEE LAQGKAGMAA LGMWVVGNYR TTYPNVALGA APLPTPTAGG RSLAAFGGWY TAVSAATKHA EEARRFAVWL FGENPANAVE LTKAMTVLSP RRSVTATLET LPAFRKAPIP EFTRIWPSTR AEPAYPPEIQ TAVTNALQAV MFSKAEPERA AEDAAKAIDS YLASPDGSLL KELMGS
|
| |