Gene Cwoe_4883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4883 
Symbol 
ID8735349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5209689 
End bp5210999 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content70% 
IMG OID646505511 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003396670 
Protein GI284046330 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.504344 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGAA TGCTGATGGT GGGCGCGGTG ATCGGCGCCC TCGCCGTCGC CGGCTGCGGC 
AGCAGCGACG ACGACGGCGG CGGGGGAAGC GGGAGCGCGG GCGGCGTCAC CGAGGTGACG
TTCTGGCACG GCCAGAACGA CATCACGCAG AGAGCGCTGG AGAGACTGGT CGACGAGTTC
AACGCGACCC ATCCGAGCAT CAAGGTCGAC GCCAACTCCG GCGGCGTGCT CGCCGACGAG
ATGCTGACGA AGCTGACGGC CAGCCTGGCG GGCGACTCCT ACCCGGACGT CGCGTACGTG
TTCGGCTCCG ATCTCGCCAA CATCTCGCGC AGCGACAAGG TCCAGAACCT GACCGAGGCG
GTCGCCGAGC CGGGCTGGAG ATGGAACGAC TTCTGGCAGG GCGAGCGCGA GGGCGCGACC
GTCGACGGCA GAGTCCGCGC GGTGCCGGCG CTCGCCGACA ACCTCGGGAT CATCTACAAC
AAGAGACTGT TCGACGACGC GGGCGTCGCG TACCCCGAGG CCGACTGGAC GTGGGACGAC
TTCCGCGCGA CGGCGAAGCA GCTGACCGAC GAGGGCAAGG GCCAGTTCGG CTACTCGTGG
CCGGGCGGCG GCGGCGAGGA CACGACCTGG CGCCTGTGGC CGATGATCTG GCAGCAGGGC
GGCGACATCC TCACGCCCGA CGGCAGCAGA GCCGCGTTCA ACTCGCCCGC TGGCGTCAGA
GCGCTGGAGC TGATCGGCGA GATGGCGACC GACGACAAGT CGATCTTCGT CGACTCCGAC
CCCAACGGCG AGCGCGCGAT CAGACTGTTC CAGGGCGGCA AGCTCGCGAT GATCGAGGCC
GGTCCGTGGG TGCTGCCCGA CGTGATCGAC GCGAAGGTCG ACTACGGCGC TCAGCGCCTG
CCCGGCTTCG ACGGCGACCA CACCACGATC GCCGGCGCCG ACAACTGGGT CCTGTTCGAC
AACGGCGACG AGCGCTCCAG AGCCGCGCAG GAGTTCATCC AGTGGCTGAC GGCCGAGAGA
CAGGACCTCG CATGGGTCGC CGCGACGCAG TCGCTGCCGC TGCGCAGAAG CACCGAGGAG
ACCGCTGAGT ACAGAAGGCT CGCGAGCAGA GTCCCCGGCA CCGACGTCTT CGCGATGAGC
CTCGACACGG CGCGCGCACG GCCGTCGCTG GAGGTCTACC CCGAGATCTC CAAGGCGGTC
GCCGAGGGCG TCGTCGCGGT GCTGCTCGGC CGCGCTGACG CGCAGGAGGC GCTCGACGCC
GCCGCGCAGA GAGCCGACGC CGTGCTCGCC GGGGCCGGGG CGGGAGGCTG A
 
Protein sequence
MRRMLMVGAV IGALAVAGCG SSDDDGGGGS GSAGGVTEVT FWHGQNDITQ RALERLVDEF 
NATHPSIKVD ANSGGVLADE MLTKLTASLA GDSYPDVAYV FGSDLANISR SDKVQNLTEA
VAEPGWRWND FWQGEREGAT VDGRVRAVPA LADNLGIIYN KRLFDDAGVA YPEADWTWDD
FRATAKQLTD EGKGQFGYSW PGGGGEDTTW RLWPMIWQQG GDILTPDGSR AAFNSPAGVR
ALELIGEMAT DDKSIFVDSD PNGERAIRLF QGGKLAMIEA GPWVLPDVID AKVDYGAQRL
PGFDGDHTTI AGADNWVLFD NGDERSRAAQ EFIQWLTAER QDLAWVAATQ SLPLRRSTEE
TAEYRRLASR VPGTDVFAMS LDTARARPSL EVYPEISKAV AEGVVAVLLG RADAQEALDA
AAQRADAVLA GAGAGG