Gene Cwoe_5802 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5802 
Symbol 
ID8736278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp6210367 
End bp6211701 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content69% 
IMG OID646506429 
Productextracellular solute-binding protein family 1 
Protein accessionYP_003397578 
Protein GI284047238 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0809589 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGAGG TCTACGAAGT GCGCACACGG AACAGAGGGC TCACACGCGC CGCGCTCGCC 
GCCCTGCTGG TGGCGCTGCT CGCACTCGTC GCCGCCGGCT GCGGCAGCGG CGACGACGAC
AGCGGCGGCG ATGGTGGTGA CGGCCCAGTC GAGATCACGT TCTGGCACGG CCAGAACCAG
ACCGCGCAGC AGACGATCGA AGGGCTCGTC GACAGATTCA ACGCCTCGCA TCCCGACGTG
AAGGTCAAGG CCGAGGTCGG CGCGCTCGCC GACAGCCTCT ACCAGAAGAC GACGGCCGCG
CTGGCCGGCG GCAAGTACCC CGACGTCGTC TACCAGTTCG GCCCCAACAT CGCATCGCTC
GCGCGCAGCC CGAAGGCGCT CGACCTGACC GACGCCGTCA GAGACGCGGC GTGGAGATGG
GACGACTTCT ACCCGCCCGC GCGGGAGGCC GTCACGGTCG ACGGCAAGGT CCGCGCCGTG
CCCGCGCTGA TCGACTCCTT GGCCGTCGTC TACAACAGAA GACTGTTCAG AGAGGCGGGC
ATCCCGGCGC CGAGAGCCGG CTGGACGTGG GACGACTACC GCGCGATCGC CAGACAGCTG
ACCGACTCCT CCAAGGGGCA GTTCGGCAGC GCGTGGCCGG GCGTCGGCGA CGAGGACACC
GTCTGGCGGC TGTGGCCGAT GGTGTGGCAG CTCGGCGGCG ACGTCACCTC GCCGGACGGC
GAGCAGGCCG GCTTCGAGGG CGAGAGCGGG CTGACCTCCT TCACGACGAT CAACGACATG
GCGGTCACGG ACAGATCGCT CTACATCGAC AAGACTGCCG GCAGCGAGAA GATGTACGCC
ATCTTCAACA CCGGTCGCAT CGGCATGGTC CCGACGGGTC CGTGGCAGGT CCCCGAGTTC
GTCAAGGCGA GAGTCGACTA CGGCGTCGTT CCGATGCCGA GCTACTCGGA CAGACCGACG
ACGATCTCGG GCCCGGACGC GTGGATGCTG TTCGACAACG GCGACGCGCG CGCCAGAGCG
GCGCAGGAGT TCGCGCAGTG GCTGACGCTG CCCGAGCAGG ACGCCGTGTG GGACGTGGAC
GCCGGCTCGC TGCCGCTGCG CAGATCGACC GCGCAGCAGC CGATATGGAG AAGACACGCG
CAGGAGGTCG TCGGGCTCGA CGTCTTCACC GCTGCGCTCG AGCAGGCGCG TGTGCGCCCG
ACGATCCAGG CCTACCCGAA GCTGTCCGAG GCGGTCGGGT CGGGGATCGT CGACGTCCTG
CTCGGCACCG CCGACCCGCA GGAGGCGCTC GACAAGGCCG TCGACGGCGC GAACGAGGCG
CTCGCCGGCG ACTGA
 
Protein sequence
MREVYEVRTR NRGLTRAALA ALLVALLALV AAGCGSGDDD SGGDGGDGPV EITFWHGQNQ 
TAQQTIEGLV DRFNASHPDV KVKAEVGALA DSLYQKTTAA LAGGKYPDVV YQFGPNIASL
ARSPKALDLT DAVRDAAWRW DDFYPPAREA VTVDGKVRAV PALIDSLAVV YNRRLFREAG
IPAPRAGWTW DDYRAIARQL TDSSKGQFGS AWPGVGDEDT VWRLWPMVWQ LGGDVTSPDG
EQAGFEGESG LTSFTTINDM AVTDRSLYID KTAGSEKMYA IFNTGRIGMV PTGPWQVPEF
VKARVDYGVV PMPSYSDRPT TISGPDAWML FDNGDARARA AQEFAQWLTL PEQDAVWDVD
AGSLPLRRST AQQPIWRRHA QEVVGLDVFT AALEQARVRP TIQAYPKLSE AVGSGIVDVL
LGTADPQEAL DKAVDGANEA LAGD