Gene Cwoe_5097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5097 
Symbol 
ID8735563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5451830 
End bp5453164 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content69% 
IMG OID646505722 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003396881 
Protein GI284046541 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.59554 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCAGA CCCATCCCGC GCGGGGATCG CTCGCGCTGA GCACACCGCT GTCGCGCCGC 
AGCCTGCTGA AGGCGGCCGG AGCGGCCGGC GCCGCGCTGA CCGGCGCGCC GCTGCTGGCG
GCGTGCGGCT CCTCCGGCGG AGGCGGCGGC TCCGGCGGCG CGGCGACGAT CGAGTTCTGG
GACATGCTGT GGGGCCTCGA CAGATACGAG CCGACGGCGC GCGCGCTCGT CGCCGAGTGG
AACAGAGCGA ACCCCGACCT CCAGGTCAAG TACCGCCTGA TCCCGTGGGC GAGCTTCTAC
GAGGTCTTCT CGACCGCCGT CGCCAGCGGC ACCACGCCGG ACGTCAGCAC CGGCGCGACC
TATCAGGCGT TCCAGTTCGA GCAGGCGATC GAGCCGATGA ACGACGCCGT CGCGCAGTGG
AGAAGAGACG GCACCTACGA CCAGGTGATC CCGCAGTCGA TCACGGCGCA GGCGACGGAG
GACGGCGAGC AGACGGGCCT GCCCTGGGGG ATGACGCTGC GCACGCTCAG CTGCAACAGA
AAGCTGTTCG GTGCCGCGGG TGTGACACAG CCGAGATCGT TCGACGAGCT GCGCGCCGCC
GCCAGAAGAC TGACCGGCGG CGGGCGCTAC GGGATGGGCT TCTGCGGCCA GGGCGCGCTC
GGCTGGCAGA TGCTGCTGTC GCTGATGGTC AACAACGGCG GCGGCCTCTA CGACGCGAAG
TGCGGGCCGG CGCTGGTGAC CGATCGCAAC CGCGAGGCGT GCCAGCTCGT GCAGGACATG
GTCCGCGACG GCTCGATCCC GAAGGCCGCG GTCGGCTGGG ACCAGACCGA CGTCTCGGCC
GCGATGACGC GCGGCGACAT CGCGATGGCG ATAACCGAGC CGGCGCTGTT CAACTCGTTG
CCCAACGGCG CCGACATCGA CATCGCCTCG CCGTACGAGG GCTTCCACGG CGACAAGGGC
ACGCTCCTCT GGTACCTCGC GATGTGGCAG TACCGCACCA GCGAGGACAA GCCCGGGGCG
ACCGAGTTCA TGAACTGGTG GCTGAGCAAC GAGCAGCCGC TGTGGTCCAG AGGCGGCACG
ACGCAGCTGC CGGTGCGCAC GCCGTTCTAC GACGAGATCA GAACGCTGCA GGACCCGCGC
TACAGAAAGG TGCTCGACGA GTGGGTGCCG GTCGGCAAGA TCATGTCGAC GCCCTGCGAG
TACGCCCTGC CGACGCTCAA CCAGGTCGAG GGGCAGGCGT TCATGCCGAC GCTCGTGCAG
GACGTCCTGT CGCTGAAGCC GATCGACGAA TCGCTGCAGA CCGCGCAGGA CGCGCTGTCG
CAGCTGAGAG CGTGA
 
Protein sequence
MRQTHPARGS LALSTPLSRR SLLKAAGAAG AALTGAPLLA ACGSSGGGGG SGGAATIEFW 
DMLWGLDRYE PTARALVAEW NRANPDLQVK YRLIPWASFY EVFSTAVASG TTPDVSTGAT
YQAFQFEQAI EPMNDAVAQW RRDGTYDQVI PQSITAQATE DGEQTGLPWG MTLRTLSCNR
KLFGAAGVTQ PRSFDELRAA ARRLTGGGRY GMGFCGQGAL GWQMLLSLMV NNGGGLYDAK
CGPALVTDRN REACQLVQDM VRDGSIPKAA VGWDQTDVSA AMTRGDIAMA ITEPALFNSL
PNGADIDIAS PYEGFHGDKG TLLWYLAMWQ YRTSEDKPGA TEFMNWWLSN EQPLWSRGGT
TQLPVRTPFY DEIRTLQDPR YRKVLDEWVP VGKIMSTPCE YALPTLNQVE GQAFMPTLVQ
DVLSLKPIDE SLQTAQDALS QLRA