Gene Cwoe_3957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3957 
Symbol 
ID8734414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4199059 
End bp4200357 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content71% 
IMG OID646504581 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003395749 
Protein GI284045409 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGACGGT GGATGGCGAC GGGCGCGGCG GTCATGCTGA GCGTCCTGGC GGTGCTGAGC 
GCCGGCTGCG GCGGCGACAG CGCGGGCGGG TCGGGCGACG ACGGCGTCGT CGAGATCACC
TTCTGGCACG GCCAGACGGA GGAGACGGCG ACGGAGCTGA ACACGCTGAT CGACGAGTTC
AACGAGACGC ACCCGAGAAT CCGCGTCTCC AAGGACGCCG GCGGCGTGCA GGCCGACGAC
ATGCTGACGA AGGTCACCGC CGCCCTGGCG GGCGGCTCCT ACCCGGACAT CGCGTACATC
TTCGGCCCCG ACGTCGCCAA CGTCGCGCGC AGCCCGAAGG TGCTCGACCT GACCGAGACG
GTCAGAGCGC CGGACTGGAC CTGGGACGAC TTCTACAAGC CCGAGCGCGA CGCGGTCACC
GTCGACGGGA AGGTGCGCGC CGTCCCGGCC CTGGCCGACG TGCTCGCCGT CATCTACAAC
AAGAGACTGT TCGCAGACGC TGGCGTGCCC GAGCCCGACG GCACCTGGAC GTGGGACGAG
TTCCGCGCCA CGGCCAAGCA GCTGACCGAC CGCGGCAAGG GCGTCTTCGG CACCTCCTGG
CCGGGCGTCG GCGGCGAGGA CACGGTCTGG CGGCTGTGGC CGATGGTGTG GCAGCTCGGC
GGCGAGATCC TCTCGCCCGA CGGGACCAGC GTCGGCTTCG ACGACGACTC GGGGCTCAGG
TCGCTGACGA CGATCCACGA CCTCGCGGTC ACCGACGAGT CGACCTACAT CGATCCCGAC
CCCAGCAGCG ACCGCACCGG CCAGCTGTTC CAGAACGGCA AGCTCGCGAT GTGGACGGCC
GGACCGTGGG CGCTGCTCGA CGTCAGAGTC TCCGGCGTCG AGGCCGGCGT CGCGCGGCTG
CCCTCCTACG ACGGCGAGCC GCTCTCGATC GCTGGCCCGG ACAACTACGT CCTGTTCGAC
AACGGCTCGG CGCGCTCCAG AGCGGCGATC GAGTTCGTCA AGTGGCTGAC CGCCGCCGAG
CAGGACCTGC GCTACTCGGA GGCGACCGGC CACCTGCCGT TGCGCAAGAG CGCGACGAGA
CTGCCCGGGT ACCCCGCGTT CGCGAGAACG TGGCCGGGCA TCCCGGTCTT CGTCGAGAGC
CTCGAGACGG CGAAGGTGCG GCCGACGATC GAGGCCTATC CCGAGCTGTC CGACGCGGTC
GGCAAGGGGA TCGTCTCGGT CCTGCTCGAC CGCGCGAGCC CCGCGGACGC GCTGAAGACG
ATGTCCGAGC AGGCGAACGA CGTGCTCGCG TCGGAGTGA
 
Protein sequence
MRRWMATGAA VMLSVLAVLS AGCGGDSAGG SGDDGVVEIT FWHGQTEETA TELNTLIDEF 
NETHPRIRVS KDAGGVQADD MLTKVTAALA GGSYPDIAYI FGPDVANVAR SPKVLDLTET
VRAPDWTWDD FYKPERDAVT VDGKVRAVPA LADVLAVIYN KRLFADAGVP EPDGTWTWDE
FRATAKQLTD RGKGVFGTSW PGVGGEDTVW RLWPMVWQLG GEILSPDGTS VGFDDDSGLR
SLTTIHDLAV TDESTYIDPD PSSDRTGQLF QNGKLAMWTA GPWALLDVRV SGVEAGVARL
PSYDGEPLSI AGPDNYVLFD NGSARSRAAI EFVKWLTAAE QDLRYSEATG HLPLRKSATR
LPGYPAFART WPGIPVFVES LETAKVRPTI EAYPELSDAV GKGIVSVLLD RASPADALKT
MSEQANDVLA SE