Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_4092 |
Symbol | |
ID | 8734554 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 4345778 |
End bp | 4346968 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 646504719 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003395882 |
Protein GI | 284045542 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.156772 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGGCG TTCGCGCGCG GCTCGCGATC GCGACCGCGG CGGCGGTCGG CATCCTCTAC GGCGTCCAGG GGCTGACGGC GGCGCTGCCG AGCGTGCAGA CGGAGTTCGG GGTCTCCGAC ACCGCGCTCG CGCTCTTCAC GGCGGCGTAC ATGCTGCCCG CGGTCGTCTT CGCGATACCG CTCGGCTGGC TCGCCGACAC GCTCGGCCGC CGGCGCGTCT TCGTCGTCGC GGCGGTGCTG TTCAGCGTCG CCGGCGGGGC GCAGGCGTGG GCGCCGGACT ACGAGACGCT GCTCGCGCTG CGGTTCGTGC AGGGCATCGG CTTCGGCGCG CTGATGCCGC TGACGGTGAC GCTGATCGGC GACGCGCTGC GTGGTGCGCA GCAGCTGCGC GCGCAGGCGT CACGGCAGGT CGCGATGACG CTCGGCGAGT TCGCGATGCC GCTGATCGGT GCGGCGCTGC TCGCGATCTC GTGGAGAGCG CCGCTCGCCG CGCAGTTCGT CCTGTTGCTG CTCGCGGTCG GCGGGGCGCT CGTGCTCGAC GACGAGCACG AGCCAGGTGG GTCCTCGCGC GCGTACGCGC GCGTGCTGAC CGGTGCGGTG CGCGGGCCGG GGATGGGCGG CGTGCTGATC GCCGGCTTCC TGCGCTTCTG GTGCAAGTTC GCGCTGCTGA CGTACGCGCC GACGCTCCTG ATCCAGGAGC GTGGTGCCTC GCCGCTGGAG GCCGCGCTGG TCGTCAGCGT CGCGTCGCTC GTCGCGGCGG TCTCGGGCAC GCAGGCGGTG CGTGTCCTGC GCCGCGTCCC GGCCTCGCGC CTGCTCGCGA CCGCGATCGT GATGAGCGGC GCCGGGCTGG TCGCGATCGC GATCGCCCCG AGCTGGCAGC TCGCGCTGGC GGCGTCGGTG CTGTTCGGCG TCGGCGACGG CTGGCTGATG GTGATGCAGA ACTCGATCGT CACCGAGGCG GCGCCCCCGG CGGTCCGGGC GGGATTGATC GGCGTCAACA GCATGGTGCG CAACGCCGGC AAGCTCGCCG CGCCGCTCGC GATCGGCGCG ATCGTGCTCG TCGCGCCGCT GTCACTCGCG CTCGTCGCCG TCGCCGGGAC AGCATGGGCG CTCGTCCCCG TCGTCGCCCG CGCCCGGGAG TTCGACGATG TCCTCGGCGG CCACGTGCGC AAGGACGACG ACGGCGCATA G
|
Protein sequence | MQGVRARLAI ATAAAVGILY GVQGLTAALP SVQTEFGVSD TALALFTAAY MLPAVVFAIP LGWLADTLGR RRVFVVAAVL FSVAGGAQAW APDYETLLAL RFVQGIGFGA LMPLTVTLIG DALRGAQQLR AQASRQVAMT LGEFAMPLIG AALLAISWRA PLAAQFVLLL LAVGGALVLD DEHEPGGSSR AYARVLTGAV RGPGMGGVLI AGFLRFWCKF ALLTYAPTLL IQERGASPLE AALVVSVASL VAAVSGTQAV RVLRRVPASR LLATAIVMSG AGLVAIAIAP SWQLALAASV LFGVGDGWLM VMQNSIVTEA APPAVRAGLI GVNSMVRNAG KLAAPLAIGA IVLVAPLSLA LVAVAGTAWA LVPVVARARE FDDVLGGHVR KDDDGA
|
| |