Gene Cwoe_4425 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4425 
Symbol 
ID8734887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4715171 
End bp4716511 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content66% 
IMG OID646505051 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003396214 
Protein GI284045874 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.446855 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.265456 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACAA GGCTCAAGCG CATCTGCGCG CTGGCGCTCG TCGGCGCGAT CGCGCTGTTC 
GCCGTCGCGT GCGGAGGCTC GAACGACGAC AACGGCGGCG GTGGTGGCGG CAGCAGCGGC
AGCAGTACGG CGGGCACGCC GGACCCGGCT GACGTCAGCG GCAAGATCGT CGTCTGGGAC
GTCTTCTACA GATCGTTCCC GAGCTACACG AGAGCGGTCC CGGAGCTCGA CGCGGCGTTC
AAGAAGAAGT ACCCGAACGT GACCGTCGAG CACGTCGCGC AGCCGTTCGC CGAGTACCAG
GCGCTGCAGC AGGCGGCGTT CACCGGGCGC GAGGGCCCGG ACGTGATGAT GATGCCGAAC
GCCGGGGGCA TCCGCCAGTT CCAGAAGGGG CTGGAGGAGC TCAATGACCG GATCACGCCG
GAGATGCAGG AGCAGCTGAC CGGCTGGGAT GCGGTGACGC CGGGGTACAG AACCGAGGGT
CCGCACTACG GCGTGCCGAT CGCCCAGACC GACTTCATGT TCTATTTCAA CAAGAGACTG
TTCAGAAGAG CCGGTCTGCC GACCGACTTC GAGCCGAGAA CCTGGGAGGA CCTCCGCGAC
GCCGGCCTGA AGCTGAAGGC CGCCGGCATC CAGCCGTTCA CCGGGGGCAA CAAGGAGGGC
TACGAGTCGC AGTGGTGGTG GCACATGGCG TGGCCGACCT ACAACACCCA GGAGCAGGCG
GTCGCGCTCG CCGACGGCGA GCTGCCCTTC ACCGACCCGG CCGTCGCCCA GACGTTCGAG
CCCGAGATGA TGATGCAGAG AGCGGGGCTG TTCGAGAGAG ACCGCTTCAC CACGCCCTTC
TTCAATGACG GCTGGATGCG CTTCGCCGAC GGCAAGGCCG CGATGATCCT CGGCGGCTCG
ACGAACACCG CCTACTGGGG TGACTTCAAC AGAGCGCTCG GCGAGGAGAA CGTCGGGATG
TTCCTGCCGC CGGGCAGCAA GTACATATCG ATGAACCCCG AGTGGAGCTG GTCGATCCCG
AAGTTCGCCG AGAACAAGGA CGCCGCATGG GCGTACATCG ACTTCATGGC GAGCAGAGAG
GGTCTCGAGA TGCTCTTCGA GATGACCGGC GAGCTGCCCA ACCGCAAGGA CGCGGAGCTG
CCGGCCGACG CGCCGTCCCA GGCCAGACAG ATCCGCGACT GGTACCGCGA CGGGCCGACG
TTCCTCGCGA CCGACGTGCT GACTCCGAGC CAGGTCACGC AGACGATGAA CACGGAGGCG
AAGGAGTGGC TCCAGGGCCG CAAGTCGAAG GAGGAGATGC TGGAGTCGAT CCAGGCGACG
TCCGAGCAGG TCAACCGGTG A
 
Protein sequence
MSTRLKRICA LALVGAIALF AVACGGSNDD NGGGGGGSSG SSTAGTPDPA DVSGKIVVWD 
VFYRSFPSYT RAVPELDAAF KKKYPNVTVE HVAQPFAEYQ ALQQAAFTGR EGPDVMMMPN
AGGIRQFQKG LEELNDRITP EMQEQLTGWD AVTPGYRTEG PHYGVPIAQT DFMFYFNKRL
FRRAGLPTDF EPRTWEDLRD AGLKLKAAGI QPFTGGNKEG YESQWWWHMA WPTYNTQEQA
VALADGELPF TDPAVAQTFE PEMMMQRAGL FERDRFTTPF FNDGWMRFAD GKAAMILGGS
TNTAYWGDFN RALGEENVGM FLPPGSKYIS MNPEWSWSIP KFAENKDAAW AYIDFMASRE
GLEMLFEMTG ELPNRKDAEL PADAPSQARQ IRDWYRDGPT FLATDVLTPS QVTQTMNTEA
KEWLQGRKSK EEMLESIQAT SEQVNR