Gene Cwoe_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_1038 
Symbol 
ID8731473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp1093996 
End bp1095516 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content73% 
IMG OID646501655 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003392845 
Protein GI284042505 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.531658 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACCGA CACCTTCGAC CACCCCATCC GCACGCGAGC CGGCCGATCC GCGCCGCTGG 
TGGCTGCTCA CGCTCGTCGC GATCGCCGAG CTGATGATCA TCCTCGACAT CTACATCGTC
AACATCGCGC TGCCGTCTGC CCAGAGCGCG CTGGAGATAC CCGACGCGAG CCGCCACTGG
GTCGTGACGG CGTACGCGAT CACCTTCGGT GGACTGCTGC TGCTCGGCGG TCGCGTCGGC
GACTACTGGG GCCGCAAGCG CACGTTCACG TTGTCGCTTG TCGGCTTCGG CGTCGCGTCG
GCGCTCGGCG GCGCCGCCTG GTCGCCTGAG CTGCTGTTCG CGGCCCGCGC GCTGCAGGGC
GTCTTCGCCG CGCTGATGGC GCCGGCGATC CTGTCACTGC TGTTCGTCAC GTTCACCGAC
CCGCGCGAGC GAGCGAAGGC GTTCGGCGTC TGGGGCGCGG TCGCCGGCAC CGGCAGCGCG
ATCGGCCTGC TGCTCGGCGG CGTCTTCACC GAGTACGCCT CGTGGCGCTG GACGCTGCTC
GTCAACGTCC CGGTCGCGGT CGCGCTCGCC GTCGCCGCAT GGTGGATCGT CAGGGAGAGC
CGCACCGACG GCGAGACGCG CTACGACATC CCGGGCGCGC TGACCTCGAC GCTCGGCGTG
ACCGCGCTCG TCTACGGCTT CACCCGCGCC GAGAGCGACG GCTGGGGCGC CCCGGTCACG
CTCGCGCTGC TGGCCGCCGG CGTCGCGCTG ATCGTCGCGT TCGTCGCGAT CGAGCGGCGC
TCGGCGAACC CGCTGCTGCC GCTGCGTGTG CTGACCGATC GCAACCGCGC CGGCTCGTTC
GCCGCGAACG CGCTGTTCGC GGGGGCGATC TTCTCCTACG GCGTCTTCCT CGTGTATTAC
CTGCAGGGCA GCCGCGGCTA CTCGGCGATC GAGTCCGGTC TCGCGATCCT GCCGCTCACG
CTGGCTGCGA TCGCGTTCGT CTCGATCGGC GCACGCCTGC TGCCGCGCGT CGGGCCGCGG
CCGCTGACGG TCGGCGGCTT CGCGATCGGC GCCGTCGGGC TCGGCTGGCT GGCGTTGATC
GGCGAGGACA CGTCCTACAT GGCGGTCGTC TTCCCCGGGC TCGTCCTGCT CGGCATCGCG
GCTGGGCTGG TGTGGCCGGT GCTGAGCAAC ACGGCGCTCG TCGGCGTGCA GCCGCGCGAC
GCCGGCGCGG CGAGCGGGAT GGTGAGCGTC GCCCAGCAGC TCGGCGGCGC GCTCACGGTC
GCGTTCCTCA ACACGCTCGC GGCGAGCATC GCCGAGGGGC GGGTCGAGCG CGACGGCGAC
GCCGCGCTCG CCGCGGGCCT GATCGACGGC TACGCGGCGA CGTTCGCGGT CGGCGGCGGG
CTGATGCTGC TCGGCGCGGT CGTCTCGCTG CTGACGATCA CGCGCCGGCT GCCGGCGTCC
GAGCAGGACG CCGAGCTGCC GGACCCCGAG CTGCCGGCCG TGAACGAGGA GCCGGCCGTC
GCCGGCGCGC GGGCGGCGTG A
 
Protein sequence
MQPTPSTTPS AREPADPRRW WLLTLVAIAE LMIILDIYIV NIALPSAQSA LEIPDASRHW 
VVTAYAITFG GLLLLGGRVG DYWGRKRTFT LSLVGFGVAS ALGGAAWSPE LLFAARALQG
VFAALMAPAI LSLLFVTFTD PRERAKAFGV WGAVAGTGSA IGLLLGGVFT EYASWRWTLL
VNVPVAVALA VAAWWIVRES RTDGETRYDI PGALTSTLGV TALVYGFTRA ESDGWGAPVT
LALLAAGVAL IVAFVAIERR SANPLLPLRV LTDRNRAGSF AANALFAGAI FSYGVFLVYY
LQGSRGYSAI ESGLAILPLT LAAIAFVSIG ARLLPRVGPR PLTVGGFAIG AVGLGWLALI
GEDTSYMAVV FPGLVLLGIA AGLVWPVLSN TALVGVQPRD AGAASGMVSV AQQLGGALTV
AFLNTLAASI AEGRVERDGD AALAAGLIDG YAATFAVGGG LMLLGAVVSL LTITRRLPAS
EQDAELPDPE LPAVNEEPAV AGARAA