Gene Cwoe_4784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4784 
Symbol 
ID8735250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp5105404 
End bp5106900 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content74% 
IMG OID646505413 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003396572 
Protein GI284046232 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0122812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.792953 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCGC TTGATGACTT GGGTCAGAAG CGCATGCTGG CGGCGCTCGT GTTCGCCGGG 
CTGACGGTCT CGGTCATGTC GACGCTCGGC ACGCCGCTGA TCCCGACGAT CGCGGACGAG
CAGCACGTCT CGCTCGACAC CGCGCAGTGG ATGCTCACGG TCACGCTGCT CGTCGGCGCG
ATAGCGACGC CGGTGCTCGG GCGCCTCGGC GACGGGCCGC AGCGGCGGCG GGTCCTGCTC
GTCACGCTCG GGTCGGCGTT CGCCGGGTCG GTCGTCGCCG CCACCTCGAC GCGCTTCCCG
CAGCTGCTCG CCGGCCGCGC ACTCCAGGGC GTCGGCTACG GCACCGTCCC GCTGGCGATC
GCGCTCACGC GCGAGCACGT CACCGGCGAC CGTCTGCGCT CGGGGATCGC GATGCTGTCG
ATCACGGTCG CGGTCGGCGC CGGCCTCGGC TTCCCAGTCA CCGGTCTGAT CGCGCAGACG
CTCGACTTCC ACGCCGCCTT CTGGTTCGGC GCGATCTTCG CCGCCGCGGC GTTCGTCTCG
GTCGCGCTCG CCGTTCCGCG CGCGACGGGG GAGGCGAAGC GGGTGGCGCT CGACGTGCCC
GGCGCGCTGT TGCTCGCCGG CGGCCTCGCG TCGCTGCTGC TCGGCATCAG CCAGGGCGAG
TCGCTCGGCT GGGCCTCCGC GGCGGTGGTC GCGCTGTTCG CCGGCGCGGC GGCGCTGCTC
GCGGCGTGGG TGCACGTCGA GTTGCGCCGC GACGCGCCGC TGGTCGACCT GCGCCTCGTC
TCCCAGCGCG CGATCCTCGG CGCCAACGTC GCCGCGCTGC TGCTCGCGAT GGGCATGTAC
ATCGCGATGT CGCTCGTCAA CCGCCTGATG CAGACGCCCG AGTCGACCGG CTACGGCTTC
GGCGCCACGC TCGTCACGAC GGGCCTGATG CTGCTCCCGC TCTCGCTCGG CAGCCTCGTC
TCGCAGCAGA TCACCCGCCG CGTGATCCGC CGCTACGGGA TCGGCGTCGT GCTGCCCGCC
GGCGCGCTGA TCGTCGCCGC GACGCTGCTG TGGCTCGCCG TCGCGCACGG CAGGCTGCTC
GACATCGCGA TCGCCACGGC GCTGCTCGGC GTCGGCGTCG GCTGTTCGTT CGCGGCGATG
CCGGCGCTGA TCGTCGCCAG CGTCCCGGAG GAGCGGACCG GCAGCGCGAC GAGCCTCAAC
CAGGTGCTGC GCTCCGTCGG CGGGGCGCTC GGCAGCGCGG TCGGCGTCGC GATCCTCGCC
GCCCACCATC CCGCCGCGAC GCCGTTCCCG CAGGAGAGCG GGTACACGAT CGCGTTCCTC
GTCGGCGCCG TCGTCTGCGT CGTGACCGCC GGGCTTGCGA TCCTGCTGGC GCCGCGCCGC
CGGCCGGCCG CGGTCGTCGC GCCAAGGTTC GAGCTGGAGC AGGAGCTGTT GATGGAAGAG
GCAGCCGTCG GAGCAGGTGT CGGACCGAGC GTCTTCGACG GGGAGCGGAG GCGATGA
 
Protein sequence
MSSLDDLGQK RMLAALVFAG LTVSVMSTLG TPLIPTIADE QHVSLDTAQW MLTVTLLVGA 
IATPVLGRLG DGPQRRRVLL VTLGSAFAGS VVAATSTRFP QLLAGRALQG VGYGTVPLAI
ALTREHVTGD RLRSGIAMLS ITVAVGAGLG FPVTGLIAQT LDFHAAFWFG AIFAAAAFVS
VALAVPRATG EAKRVALDVP GALLLAGGLA SLLLGISQGE SLGWASAAVV ALFAGAAALL
AAWVHVELRR DAPLVDLRLV SQRAILGANV AALLLAMGMY IAMSLVNRLM QTPESTGYGF
GATLVTTGLM LLPLSLGSLV SQQITRRVIR RYGIGVVLPA GALIVAATLL WLAVAHGRLL
DIAIATALLG VGVGCSFAAM PALIVASVPE ERTGSATSLN QVLRSVGGAL GSAVGVAILA
AHHPAATPFP QESGYTIAFL VGAVVCVVTA GLAILLAPRR RPAAVVAPRF ELEQELLMEE
AAVGAGVGPS VFDGERRR