Gene Cwoe_4981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4981 
Symbol 
ID8735447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5310438 
End bp5311994 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content78% 
IMG OID646505608 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003396767 
Protein GI284046427 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.547375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCACG CTCCGGCCCA CCCCGACCAC GACGCTCACG GCCACCATGG CCCGCCACCG 
CCGCGCCGTT CGTGGACGCT GCTCGCGCTG CTGTGCGCGG CGCAGTTCAT GGTCATCCTC
GACGTCACCG TCGTGAACGT CGCGCTGCCG AGCATCGGCA GCGACCTCGG CTTCGCTCCC
GCCGACCTGC AATGGGTCAT CACCGCCTAC GTGCTGTTCA CCGGCGGCCT GATGCTGCTC
GGCGGCCGCG CCGCCGACGT GCTCGGCCGG CGCCCCGTCT TCCTCGCCGG CCTCGTCGTC
TTCACGGCCG CGTCGCTCGC CAGCGGCCTC GCGCCGACGG CCGATGTGCT CGTCGTCGCG
CGGGCCGTGC AGGGCGTCGG CGCGGCCCTG CTCGTGCCGG CCGCGCTGGC GCTCGTGACG
ACCGCCTACG ACGGCCACCA GCGCGCGGTC GCGCTCGGCG TCTGGGGCGC GATCGGCAGC
GCCGGCGCCG CCGTCGGCGT GCTCGTCGGC GGCGTGCTGA CCTCGGCGCT GAGCTGGGAG
TGGGTCTTCT ACGTCAACGT CCCGATCGGG ATCGGCGCCG GCCTCGGCGT CGCCGCGCTC
GTGCCGGCGC GAGCCGCCCC AGACTCGCAC GGCGCCGCGG ACTCGCACGG CGCCGCGGAC
TCGCACGGCG CTGCGGCCGA GGGGCCGAGC CACGCGCCGA GCGTCGCCGC CGCGCCGAGC
GCCGCCGCCT CGCGCTCGCG CGGCCGGCTC GACGTCGCCG GCGCGCTGAC CGTGATGGCC
GGGCTCGTGC TCGTCGTGCT GGGGCTCGAC GGCGCCGCCG AGCACGGTTG GGGCTCGCTG
CGCACGCTCG GCCTGCTCGG CGGCGGCGCG CTGCTGCTGG CCGCCTTCGC CGCCGTCGAG
CGGAGCGCCG CCGCTCCGCT CGTGCCGCCC GCGACCTGGC GCGAGCGGCC GCTCGTCGCG
AGCGCGGCGG TGATGCTCGG TGCGACCGGC ATCCTCGTCG GCGCGTTCTT CCTCAACACG
CTCTACCTGC AGGAGGTGCT CGGCGCGAGC GCACTGGAGA CCGGGCTCGC GTTCCTGCCG
CTCGCGCTCG TGATCCTGGC CGGCGCCCAC GCCGCGTCGC GGCTGCTGCC GCACGCCGGC
TCGCGCACGG TCGCGGTCGG CGGGCTCGTG CTCGTCACGG CCGGTGCGCT GCTGCTCGCC
GCGGCGCCGC CGCGCGCCGC CTACGGCAGC GACCTGCTGC CGGGGCTGCT GCTGCTCGGC
GCCGGCGTCG GGCTCGCGTT CGTCGCGGTC TCGGTGACGG CGATGGCGGA GGTCCGCCAC
GAGCAGGCCG GGCTCGCCTC GGGGCTGATG ACGACCGCCC ACGAGCTCGG CGCCGCGCTC
GGCGTCGCGG TCCTCTCCGC CGTCGCCGCC GGCGCGCTGG CGGACGGCGA CCCGGCCGCC
GGCTACGGCG ATGGCTTCCT CGCCGCCGGC CTGATCGCGG CCGCGCTCAC CGTCGTCGCG
CTCGTCGCGC TCCCGGCCGT GCGGCCCGCG CCGGGCGCAC GCGCCGCGAT GCACTGA
 
Protein sequence
MHHAPAHPDH DAHGHHGPPP PRRSWTLLAL LCAAQFMVIL DVTVVNVALP SIGSDLGFAP 
ADLQWVITAY VLFTGGLMLL GGRAADVLGR RPVFLAGLVV FTAASLASGL APTADVLVVA
RAVQGVGAAL LVPAALALVT TAYDGHQRAV ALGVWGAIGS AGAAVGVLVG GVLTSALSWE
WVFYVNVPIG IGAGLGVAAL VPARAAPDSH GAADSHGAAD SHGAAAEGPS HAPSVAAAPS
AAASRSRGRL DVAGALTVMA GLVLVVLGLD GAAEHGWGSL RTLGLLGGGA LLLAAFAAVE
RSAAAPLVPP ATWRERPLVA SAAVMLGATG ILVGAFFLNT LYLQEVLGAS ALETGLAFLP
LALVILAGAH AASRLLPHAG SRTVAVGGLV LVTAGALLLA AAPPRAAYGS DLLPGLLLLG
AGVGLAFVAV SVTAMAEVRH EQAGLASGLM TTAHELGAAL GVAVLSAVAA GALADGDPAA
GYGDGFLAAG LIAAALTVVA LVALPAVRPA PGARAAMH