Gene Cwoe_5639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_5639 
Symbol 
ID8736115 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp6039775 
End bp6040884 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content69% 
IMG OID646506269 
Productputative sugar ABC transporter, substrate- binding protein 
Protein accessionYP_003397418 
Protein GI284047078 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4213] ABC-type xylose transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.420367 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGATCA TCCGCACGGG GACCGCGGTT GTGCTCGCCG GGGCGCTCGC CCTCGGTGCC 
AGCGCCTGCG GCAGCGACGA CGACGGGGGC GGCTCGACGA CCGCCGCAAG CAGCGGCGGG
GGCGGCGGGG GCGAGAAGAT CGCGCTGCTG CTGCCCGAGT CCAAGACGGC GCGCTACGAG
AATCAGGATC GACCGCGCTT CGTCGAGAAG GTCAGAGAGC TGTGCCCAGA CTGCGAGGTG
CTCTACTCGA ACGCCGAGCA GGACCCCGCC CAGCAGCAGC AACAGGCCGA GCAGGCGATC
ACCAACGGCG CGAGAGTGCT CGTCGTCGAC GCCGTCGACG TGAAGTCGGC CGCGGCGATC
GCGACGAACG CGAAGTCCCA GGGCGTGCCG GTCGTCAGCT ACGCGCGCCT GATCTCCGAC
GCCGAGCTCG ACGCGTACGT CTCGATCGAC CCGTTCAGAG TCGGCCAGCA GCAGGGCGAG
GCGCTCGTGA GAGCGCTCAG AGGCGGCAGA AGAATCGTGA TGGTCAACGG TTCGCCGACC
GACTCCAACT CGGCGCCGTA CAAGGAGGGC GCGCACGACG TCTTCGACAG ATCCGGCATC
GACGTCGTCA AGGAGTACGA CACGCCCGAC TGGAGCCCAG ACAGAGCCCA GACCGAGATG
GAGCAGGCGA TCACGAGCGC CGGCAAGGAC GGCTTCGACG GCGTCTACTC GGCCAACGAC
GGCATGGCCG GCGGCGTGAT CGCGGCGATG AAGTCGGCCG GCGTCGACCC CAGAACGCGG
CCCGTCACCG GACAGGACGC CGAGGTCGCG GCGCTGCAGC GGATCCTCAC CGGCGAGCAG
CTGATGACGA TCTACCAGCC GATCAGCGAG ATCGCCGCGA CCGCCGCCGA GCTGGCGGTG
CCGCTCGCCA GAGGCGAGGG CGTCCCGTCG ATCACGACGA CCGAGGTCGA CAACGGCGGC
CCCAGAAGAG TGCCGGCCGT CCTGCTCGAC ACGATCGTGA TCACGAGAGA CAACATCCAG
GACGTGATCA TCAGAGACGG CTTCGCGACC GCCGAGCAGA TCTGCACCGA CGAGTACAGA
GCGGCGTGCG CCGAGGCGGG TATCAGATAG
 
Protein sequence
MSIIRTGTAV VLAGALALGA SACGSDDDGG GSTTAASSGG GGGGEKIALL LPESKTARYE 
NQDRPRFVEK VRELCPDCEV LYSNAEQDPA QQQQQAEQAI TNGARVLVVD AVDVKSAAAI
ATNAKSQGVP VVSYARLISD AELDAYVSID PFRVGQQQGE ALVRALRGGR RIVMVNGSPT
DSNSAPYKEG AHDVFDRSGI DVVKEYDTPD WSPDRAQTEM EQAITSAGKD GFDGVYSAND
GMAGGVIAAM KSAGVDPRTR PVTGQDAEVA ALQRILTGEQ LMTIYQPISE IAATAAELAV
PLARGEGVPS ITTTEVDNGG PRRVPAVLLD TIVITRDNIQ DVIIRDGFAT AEQICTDEYR
AACAEAGIR