Gene Cwoe_5441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5441 
Symbol 
ID8735916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5817512 
End bp5818870 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content67% 
IMG OID646506071 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003397221 
Protein GI284046881 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGCTGA CGAGGGCGCT GCTGCTCGCC GTCGCCGCAC TGCTGGCGGC GGTTGCCGTG 
GGGTGCGGTT CACCGGGAGA GTCCGACGAC GATTCGTCGT CGACCACTGC CACGACAAGA
GCGACCGAGA GAGTCGACGT CGCCGCCGCC GGCGACGTCA CGCTGACGAT CTGGGACCAG
GAGGTCCGCG GAGGTCAGGC GGCACAGATC AGAAGGCTGA ACGCCGAGTT CCAGGAGAAG
TACCCGAACG TCACGATCAA CCGCGTGGCG AAGTCGTTCG AGGACCTCAA CACGACGCTC
AAGCTGGCCG TCTCGGGCCC GAGAGCGCCG GACGTCGTGC AGGCCAACCA GGGTCGTCCC
GTGATGGGCC AGCTCGTGAG AGCGAGATTG CTGCGGCCGC TGACCGACTA CGCCGAGGTC
TACGGCTGGG ACGACCGCTA CTCGGACCTG CTGCTGAACC TGAACAAGTT CTCGCCCGAC
GCGAGACAGT TCGGCAGCGG CGACCTCTAC GGCCTCTCCC AGATGGGTGA GATCGTCGGC
GTCTTCTACA ACAGAAGACT CGTGAGAAGA CTGCCCGAGA CGCTCGAGGA GTTCGAGGCC
TCGCTCGCCG AGACCAAGAG AGCCGGCGGC GTGCCGATCC AGTTCGGCAA CCTCGACAAG
TGGCCCGGCA TCCACGAGTA CGAGACCGTG CTCGGCCGCA ACGCCACGCC GCAGGCCGTG
AGCGACTTCG TCTTCGCCGC CAGCGGCGCG TCGTTCGACA CGCCTGAGTT CACCGCCGCC
GCCAGAACGC TGCAGGACTG GGCGAAGGCG GGCTACTTCA CGCCCGACTT CAACGGCGTC
GGCTACGACC CCGCGTGGCA GCGCTTCGCG AGAGGCGAGA GCCCCTACCT GATCGCCGGC
ACGTGGCTCG TGGCGGACCT GATCAGAGCG ATGGGCGACG ACGTCGGCTT CTTCGTCCTC
CCGGGCGAGA CGGCCGGCGA CGATCCGGTC GCGCTCGGCG GCGAGGGCCT GCCGTTCACG
GTCACGACCG CGTCGAAGAA CCCCGACGTC GCGGCCGCGT ACATCGACTT CATCACCGAC
GCCAACGCGG CGAGAGTGCT GGTCGAGACC GACAACCTGC CGGCGATGTC GCTGCCGGAC
GGGATAGCGC CGACGGAGGG CCTGACCGGC GATGTCTTCG CCGCCTGGAG AAGCCTCAAC
GAGGCGAACG GGATCATCCC GTACATCGAC TACGCGACCC CGACGTTCTA CGACGACATC
AGCGGCGCGA TCCAGGAGCT GCTGGCTGAG AAGCAGTCGC CGGAGGAGTT CACGTCCGGC
GTCGAGGCGA AGTACAGCGA GTTCACCGGC TCGCTCTGA
 
Protein sequence
MKLTRALLLA VAALLAAVAV GCGSPGESDD DSSSTTATTR ATERVDVAAA GDVTLTIWDQ 
EVRGGQAAQI RRLNAEFQEK YPNVTINRVA KSFEDLNTTL KLAVSGPRAP DVVQANQGRP
VMGQLVRARL LRPLTDYAEV YGWDDRYSDL LLNLNKFSPD ARQFGSGDLY GLSQMGEIVG
VFYNRRLVRR LPETLEEFEA SLAETKRAGG VPIQFGNLDK WPGIHEYETV LGRNATPQAV
SDFVFAASGA SFDTPEFTAA ARTLQDWAKA GYFTPDFNGV GYDPAWQRFA RGESPYLIAG
TWLVADLIRA MGDDVGFFVL PGETAGDDPV ALGGEGLPFT VTTASKNPDV AAAYIDFITD
ANAARVLVET DNLPAMSLPD GIAPTEGLTG DVFAAWRSLN EANGIIPYID YATPTFYDDI
SGAIQELLAE KQSPEEFTSG VEAKYSEFTG SL