Gene Cwoe_0876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_0876 
Symbol 
ID8731310 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp918570 
End bp920168 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content72% 
IMG OID646501493 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003392684 
Protein GI284042344 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.757243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCAAGT GGCTCCCGAT GGCCATCCTG GCCTCCGCCC AGTTCATCAT GGTGCTCGAC 
TCGAGCGTCA TGAACGTGGC GATCTCGCAG ATCGTCGCGG ACCTCGACAC GACGATCCAG
GGCGTGCAGA CCGCGATCAC GCTGTACACG CTCGTGATGG CCGCGTTCAT GCTGCTGGGC
GCGAAGCTCG GCGACATGAT CGGCCGCAAC CGCGCGTTCG CGATCGGACT CGCGATCTAC
GGGGTCGGCT CGCTGACGAC GGCGCTGAGC CCGAACCTCG CCGTCCTGCT GATCGGCTGG
TCGGGAATCG AGGGGTTCGG CGCGGTGCTC GTGGTGCCGG CGATCGCGGC GCTCACCGCC
GCCACCTACG AGGGCAAGGA CCGCGCGCTC GCGTACGCGC TGCTCGGCGG GATCGCCGCC
GTCGCGGTCG CCGCCGGCCC GCTGATCGGC GGCTGGGTGA CGACCGAGTT CACGTGGCGC
TACGTCTTCG CCGCCGAGAC CGTCGTCGTG ATCCTGATCC TGCTCCTGCG CGGGCAGCTC
GCGCAGGCGC CGGCCGCCGC GCACCGCCCG CGGCTCGACG TCGTCGGCGT CGCGCTGTCA
TCCGCCGGCC TCGGCCTGAT CGTGTTCGCG ATCCTGCGCA GCAGCGTCTG GGGCTTCGTG
CAGCCGCGCT CCGCACCGAC GATCGGCGGC ACGGAGATCA CGCCGCTCGG CTTCTCCGTC
GTCCCGTTCA TGGTGCTCGG CGGACTCGCG CTGCTCGCGG CGTTCGTGTC GTGGGAGGAG
CGACGCGCCG AGCGCGGCCT GGACCAGCTG CTCGACACCG CGCTGCTGAA GATCGCGCAG
CTGCGCGCGG GGCTCTCGAC GCTGGTCGGG CAGCAGCTCG TCCTGATGGG GACGTTCTTC
GTCATCCCCG TGTACCTGCA GGTCGTGCTG GGCCTCGACG CGTTCGAGAC CGGCAAGCGC
CTGCTGCCGC TGTCGATCGC GATGCTCGTC TTCGCCCTGC TGGGACCTGG GATCGCGGGC
CGGCGCTCGC CGCGCACCGT CGCGCAGCTC GGGCTCGTCG CGGTCAGCGT CGGCGCGGTC
GTGATGCTCG CGACGCTCGA CGTGAGACTG AACGACACCG GCTTCAAGGT CGCGCTCGCG
TTGATGGGCG CGGGCGCCGG GCTGCTCGCC TCGCAGCTGG GGAACGTGAT CATGTCGTCG
GTCGTGCCGA CCCAGACGAG CGAGGCCGGC GGACTCCAGG GGACCGCCCA GAATCTCGGC
TCCTCGCTCG GCACCGCGAT CGTCGGCGCG GTGCTGCTCG CGTCGCTCGC GACCGGCTTC
AGCGACCGCA TCGCCGACAA CCCGGACATC CCGCCCGCCG CGCGCGAGAC GATCGTCGCC
AACACCGAGC AGGGCATCGA CATCGTGCCG GTCACCTCCG TCGAGCAGGC CGCGGTCGAC
GGCGGCCTGA CGCCCGACCA GGCGAGCGCG GTCGCCGACG ACTACGGCGA CGCCCAGCTC
GACGCGCTGC GTCTCTCGCT CGGAGCGGTG GCGCTCGCGG CGCTGCTGTC GCTGTGGCTG
ACGCGGCGGC TGCCGACACG ATCGCTCGCG GACCCGTGA
 
Protein sequence
MRKWLPMAIL ASAQFIMVLD SSVMNVAISQ IVADLDTTIQ GVQTAITLYT LVMAAFMLLG 
AKLGDMIGRN RAFAIGLAIY GVGSLTTALS PNLAVLLIGW SGIEGFGAVL VVPAIAALTA
ATYEGKDRAL AYALLGGIAA VAVAAGPLIG GWVTTEFTWR YVFAAETVVV ILILLLRGQL
AQAPAAAHRP RLDVVGVALS SAGLGLIVFA ILRSSVWGFV QPRSAPTIGG TEITPLGFSV
VPFMVLGGLA LLAAFVSWEE RRAERGLDQL LDTALLKIAQ LRAGLSTLVG QQLVLMGTFF
VIPVYLQVVL GLDAFETGKR LLPLSIAMLV FALLGPGIAG RRSPRTVAQL GLVAVSVGAV
VMLATLDVRL NDTGFKVALA LMGAGAGLLA SQLGNVIMSS VVPTQTSEAG GLQGTAQNLG
SSLGTAIVGA VLLASLATGF SDRIADNPDI PPAARETIVA NTEQGIDIVP VTSVEQAAVD
GGLTPDQASA VADDYGDAQL DALRLSLGAV ALAALLSLWL TRRLPTRSLA DP