Gene Cwoe_4092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_4092 
Symbol 
ID8734554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4345778 
End bp4346968 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content74% 
IMG OID646504719 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_003395882 
Protein GI284045542 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.156772 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGGCG TTCGCGCGCG GCTCGCGATC GCGACCGCGG CGGCGGTCGG CATCCTCTAC 
GGCGTCCAGG GGCTGACGGC GGCGCTGCCG AGCGTGCAGA CGGAGTTCGG GGTCTCCGAC
ACCGCGCTCG CGCTCTTCAC GGCGGCGTAC ATGCTGCCCG CGGTCGTCTT CGCGATACCG
CTCGGCTGGC TCGCCGACAC GCTCGGCCGC CGGCGCGTCT TCGTCGTCGC GGCGGTGCTG
TTCAGCGTCG CCGGCGGGGC GCAGGCGTGG GCGCCGGACT ACGAGACGCT GCTCGCGCTG
CGGTTCGTGC AGGGCATCGG CTTCGGCGCG CTGATGCCGC TGACGGTGAC GCTGATCGGC
GACGCGCTGC GTGGTGCGCA GCAGCTGCGC GCGCAGGCGT CACGGCAGGT CGCGATGACG
CTCGGCGAGT TCGCGATGCC GCTGATCGGT GCGGCGCTGC TCGCGATCTC GTGGAGAGCG
CCGCTCGCCG CGCAGTTCGT CCTGTTGCTG CTCGCGGTCG GCGGGGCGCT CGTGCTCGAC
GACGAGCACG AGCCAGGTGG GTCCTCGCGC GCGTACGCGC GCGTGCTGAC CGGTGCGGTG
CGCGGGCCGG GGATGGGCGG CGTGCTGATC GCCGGCTTCC TGCGCTTCTG GTGCAAGTTC
GCGCTGCTGA CGTACGCGCC GACGCTCCTG ATCCAGGAGC GTGGTGCCTC GCCGCTGGAG
GCCGCGCTGG TCGTCAGCGT CGCGTCGCTC GTCGCGGCGG TCTCGGGCAC GCAGGCGGTG
CGTGTCCTGC GCCGCGTCCC GGCCTCGCGC CTGCTCGCGA CCGCGATCGT GATGAGCGGC
GCCGGGCTGG TCGCGATCGC GATCGCCCCG AGCTGGCAGC TCGCGCTGGC GGCGTCGGTG
CTGTTCGGCG TCGGCGACGG CTGGCTGATG GTGATGCAGA ACTCGATCGT CACCGAGGCG
GCGCCCCCGG CGGTCCGGGC GGGATTGATC GGCGTCAACA GCATGGTGCG CAACGCCGGC
AAGCTCGCCG CGCCGCTCGC GATCGGCGCG ATCGTGCTCG TCGCGCCGCT GTCACTCGCG
CTCGTCGCCG TCGCCGGGAC AGCATGGGCG CTCGTCCCCG TCGTCGCCCG CGCCCGGGAG
TTCGACGATG TCCTCGGCGG CCACGTGCGC AAGGACGACG ACGGCGCATA G
 
Protein sequence
MQGVRARLAI ATAAAVGILY GVQGLTAALP SVQTEFGVSD TALALFTAAY MLPAVVFAIP 
LGWLADTLGR RRVFVVAAVL FSVAGGAQAW APDYETLLAL RFVQGIGFGA LMPLTVTLIG
DALRGAQQLR AQASRQVAMT LGEFAMPLIG AALLAISWRA PLAAQFVLLL LAVGGALVLD
DEHEPGGSSR AYARVLTGAV RGPGMGGVLI AGFLRFWCKF ALLTYAPTLL IQERGASPLE
AALVVSVASL VAAVSGTQAV RVLRRVPASR LLATAIVMSG AGLVAIAIAP SWQLALAASV
LFGVGDGWLM VMQNSIVTEA APPAVRAGLI GVNSMVRNAG KLAAPLAIGA IVLVAPLSLA
LVAVAGTAWA LVPVVARARE FDDVLGGHVR KDDDGA