Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_2718 |
Symbol | |
ID | 8733161 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 2900117 |
End bp | 2901328 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646503330 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003394512 |
Protein GI | 284044172 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00439118 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.195622 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCCT CCCTCTCCCC TCCGCGGTCC GCGACCGATC CCGCGCGCAC CACGCGTTCG CAATGGCTCG CGGTGGCGGT GCTGTCGCTG AGCACGTTCG TGGTGGTGAC CTCGGAGATG CTTCCCGTGG GCGTCCTGAC CCCGATGGCC GACGGCCTCG GGATCACGCA GGGCGCGGCC GGCTACAGCC TGTCGATCAC CGGCCTCGTC ACGGCGGTCA CCGCGCCGCT CGTCCCGCGC CTGCTCGGTC GGCTCGACCG GCGCCTGGTG CTGGCGTCGG CGATGGTGGT CCTCGCTGCC GGCAACGCGC TGACCGCGGT GGCGCCGGGG TTCGGGCTGC TGGTCGTCTC GCGTGCCCTC CTCGGGATCG GGATGGGCGT CGTGTGGGGG TTGGCCGCCG TGATCGCCAG CCGGCTGGTC GCACCGCGCA ACGCCGCGCT GGCGGTCTCC TTCGCCGTCA GCGGGGTCGC GGCGGCGTCG GTGATCGGGG TGCCGCTGGG CACGGTCGTC GGGAACGCCT TCGGTTGGCG GACGGCGTTC GCCGTGCTCG CCGGCGGCGG CATCGCCCTC GCCGCGGGTC TCGCGCTGGC ATTGCCCCGC CTGCCTCGCC CGGCCGCCCC GGCCGGCGCA GGCGCCGGTA CCGGCGGCCA ATCGCTGTTG GGCACGCCCG CGGTCGCCGC CGGACTTGCG CTGATCGTGC TGCTCGTCAC CGCGCACTTC GCCGCCTACA CCTACGTGCG CCCCGTCCTC GAAGAACGGA CCGGACTGGC TCCTGGGTCG ATCGCACTCG CGCTCCTCGC GTATGGCGCC TGCGGGCTGG CCGGCAACTT CGCGGCCGGC GCCCTCGCCG CGCGACGCGC GCGCTTCACG CTCCTGAGCC TGGCCACGGG GATCGGCGCG GCGATCGCGC TGCTGGCGTT GCTGGGCAGC GTGGCCGGCG TCGCCTACGC AGCCGTCGCG CTGTGGGGGC TGACCTATGG CGGGCTGTCG GTGGGCGGTC AGATCTGGAT GACCCAGTCC GCACCGCACC GCACCGAGCA CGTCACCGGG CTCTACGTCG GGGTCTTCAC GGCCGCGATC GCCTTGGGCG CCTTCCTCGG CGGCACCGTC GTCGAGGCGT CCGGGGTCAC GCCCCTGCTG TGGAGCGCAG CCGCGCTGGC GCTCGCCGGC CTCGCCGTCG GCGCGGTCGG CCCCGGGCCG GCGCGCCGAT GA
|
Protein sequence | MSASLSPPRS ATDPARTTRS QWLAVAVLSL STFVVVTSEM LPVGVLTPMA DGLGITQGAA GYSLSITGLV TAVTAPLVPR LLGRLDRRLV LASAMVVLAA GNALTAVAPG FGLLVVSRAL LGIGMGVVWG LAAVIASRLV APRNAALAVS FAVSGVAAAS VIGVPLGTVV GNAFGWRTAF AVLAGGGIAL AAGLALALPR LPRPAAPAGA GAGTGGQSLL GTPAVAAGLA LIVLLVTAHF AAYTYVRPVL EERTGLAPGS IALALLAYGA CGLAGNFAAG ALAARRARFT LLSLATGIGA AIALLALLGS VAGVAYAAVA LWGLTYGGLS VGGQIWMTQS APHRTEHVTG LYVGVFTAAI ALGAFLGGTV VEASGVTPLL WSAAALALAG LAVGAVGPGP ARR
|
| |