Gene Cwoe_0171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0171 
Symbol 
ID8730599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp168198 
End bp169604 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content67% 
IMG OID646500785 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003391982 
Protein GI284041642 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGACC ATCAGGACGC CGACGCGCGT GAGCGCGCCG ACGCCGAACA GGGCTTGAAG 
GTCTCGCGCG CCACGCTGCT GCGCGGTGCT GCGGCCGCCG GCATCGCGCT GCCGGCGATG
GGCCTGCTCT CCGCCTGCGG GAGCGGCGAC GACGACAGCT CGGCCAGCAC GAGCGGCGAC
TCCGGCGGCA CCTCCGAGGC GTACACCGGC GCGCTCGCGA TGACCGCGTG GGAGGCGTAT
CCGGACCAGA TCAGAGAGAA CCTGGCCGCG TTCAAGCAGC AGTACGGCAA CCAGGTCGAC
CTCACACTGA TCCCCAACAT CGGCTACGGG CCCGCGATCC AGACGCGCCT GCAAGGCGGC
CAGGAGATCG ACGCCTACTA CAACTTCGCG TACAACTCGA CGAAGTTCGT CGACGCCGGC
TGGGCGAAGG AGCTGAACGA CCTCCCGGGC GTCGAGGAGA TGCTCGCGGA CATGTTCGAG
ACCTCCGCCG CGCGGCACAA GCTGCCCGAC GGCCGCATCG TCTCGGTCCC GTACTTCTCC
GCCGTGCACC TCCTGATGTA CAACGAGGCG CAGCTGAGAG AGAACGGCAT CTCCGCCGCG
CCGCAGTCGT ACAGCGAGAT CTACGACCAG TGCGAGAAGC TCAAGGCCGG CGGCGTCCGC
GCCCCCTACG CCGCCTACTG GACCAAGCAG TTCTCGGAGG AGTACTTCAT CCTCTACCTC
GTCTCCGAGG GCATCGTCCC GTTCGACGAC GACGGCAACC CGACGTTCCA GGACGACCCC
AAGACCGAGG GCGTCCTCGA CTGGTGGACG TCGATGTACC AGGACGGCCT CACCGCCAGA
TCGATCCTCA CCGACGATCC CGGCAAGCAC GTCGCGGCGA TGGCGCAGGG CACCTCCAGC
TTCTTCACCC TGCACCACTA CTTCCTCAAG GAGATCCGCA ACGCCAGAGG ACCGCAGTCG
AGAAACGTCA CGATGAGCTA CCGGATCCCC GGCAGCTCGG GCGAGAGCCT CCAGATCGGC
GAGGTCGTCC AGATGGGCAC CAAGGCCGAC GGCGGTCGTG CCGACAGAGC GTGGGAGCTG
CTCAAGTTCT ACGGCTGGAA GGACAAGGAC GGCCGCTACG GCACGTTCAT ATCGTGGGCC
GAGTCGGCCG CGCTGCTCGG GCCCTACCCC GGCCTCTTCA AGGACCCGCA GTTCAGAAGA
GCGTTCCCCG CCTACTACGA CCTCGGCGAG CTGGAGAGAG CGTTCGAGGC CTCGCAGGTC
GTCCCCGCCC GCGTGCTGCC GTGGTACTCG TCCTTCCAGA CGAAGGTCGG CGACCGCATC
CAGGCGATGC TGCTCGGCCA GGCGAGCGTG AAGGACACGA TCTCGTCGCT CGCCGACGAC
GCGAAGAGCT TTGCCGCCGC AGGCTGA
 
Protein sequence
MHDHQDADAR ERADAEQGLK VSRATLLRGA AAAGIALPAM GLLSACGSGD DDSSASTSGD 
SGGTSEAYTG ALAMTAWEAY PDQIRENLAA FKQQYGNQVD LTLIPNIGYG PAIQTRLQGG
QEIDAYYNFA YNSTKFVDAG WAKELNDLPG VEEMLADMFE TSAARHKLPD GRIVSVPYFS
AVHLLMYNEA QLRENGISAA PQSYSEIYDQ CEKLKAGGVR APYAAYWTKQ FSEEYFILYL
VSEGIVPFDD DGNPTFQDDP KTEGVLDWWT SMYQDGLTAR SILTDDPGKH VAAMAQGTSS
FFTLHHYFLK EIRNARGPQS RNVTMSYRIP GSSGESLQIG EVVQMGTKAD GGRADRAWEL
LKFYGWKDKD GRYGTFISWA ESAALLGPYP GLFKDPQFRR AFPAYYDLGE LERAFEASQV
VPARVLPWYS SFQTKVGDRI QAMLLGQASV KDTISSLADD AKSFAAAG