Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cwoe_3061 |
Symbol | |
ID | 8733507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Conexibacter woesei DSM 14684 |
Kingdom | Bacteria |
Replicon accession | NC_013739 |
Strand | - |
Start bp | 3263892 |
End bp | 3265181 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 646503676 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_003394855 |
Protein GI | 284044515 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00310805 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCGCT TTTCGCTGCT GTGGAGTCAC CGGGACTTCC GCTGGCTGTT CCTGAGCCAG ACGATCTCCA CGACCGGCGA CCGGATCGTG CTCGTCGCGC TCGCGCTGCT CGTGACCGAG CGAACCGGCT CGACGACCGA TCTCGGCCTC GTGCTCGGCG CCTACACGCT CGCGCACGTC TCGTTCGTCC TGCTCGGCGG TGTCTGGGCC GACCGCCTCC CCCGCCATCG GATCATGTTC TCGACCGACC TGATCCGCGG CGGGCTGCAC GCGCTGCTCG CGGTCCTGAT CCTCACCGAC GTCGTGACGA TCTGGCATCT GATCGCGATC GAGGCGCTGT TCGGCATGGC GGAGGCGTTC TTCCGCCCCG CCTTCAGCGG CCTCGTCCCG CAGACGGTGC CGGAGGAGCT GATCCAGGAG GCGAACGCGC TCAACAACCT GACGCAGACG ATCGCGGAGT TCGCCGGGCC AGCGATCGCG ACCGCGCTCG TGCTGACGGT CGGGACCGGC GCCGCGTTCG GCGTCGACGC GGCGACCTTC TTCGTCAGCG CCGCGCTGCT GACGCTCGTG CGCCCGCGCG AGCGCGGCAA GCCGGCGCCG CGGGCGCCGT GGCGGCGCGA GCTGCGCGAG GGCTTCGAGG AGGTCCGCTC CCGCACCTGG GTGTGGGTCA CGATCTCCGT CTTCTCCTTC CAGTTGCTGG GCGCGTTCGC GCCGTACGTC GTGCTCGGCC CGACCGTCGC CGAGCAGCAG TACGGCGACG CGGCGTTGTA CGGCTGGCTG GCAGCGAGCG TCGGCTTCGG CACGGCGCTC GGCTCGCTGC TCGCGCTGCG CTGGCGTCCG CGGCGGCCGC TCGTCGCCGG GATCCTGCTC GTGTTGCCGT TCTGCCTGCT GCTGACGGCG TTCGCGATCG GGATCCCGCT CGCCGTCGCG CTGCCGATCG GCGCGCTCAC CGGGATCGGC GTCGCGCTGT TCGGCGTCTG GTGGCAGACG GCGCTCGCGC AGCGGATCCC CCCGCACGCC CTGTCGCGCG TGACCTCCTA CGACTGGCTC GGCTCGCTCG CGCTGCTCCC GATCGGCTAC GTGCTCGTCG GCGTGCTCGC GGAGCACGTC GGGGCGACCG AGGTGATGGC CGTCGGCGGC GTGCTCGCCG CGTCGGTCCT GCTGCTCGGA CTGCTGCCGC GCGAGTCGCG CGAGCTGGGC AGCGGCGGCT CCGGCGACGT GCCGCCGACG GCGCCGGGCG CCCCGGCGGC GGCGGGCGCC GCCGGCGTCA CGCCGCGCGC GGACGCCTAG
|
Protein sequence | MARFSLLWSH RDFRWLFLSQ TISTTGDRIV LVALALLVTE RTGSTTDLGL VLGAYTLAHV SFVLLGGVWA DRLPRHRIMF STDLIRGGLH ALLAVLILTD VVTIWHLIAI EALFGMAEAF FRPAFSGLVP QTVPEELIQE ANALNNLTQT IAEFAGPAIA TALVLTVGTG AAFGVDAATF FVSAALLTLV RPRERGKPAP RAPWRRELRE GFEEVRSRTW VWVTISVFSF QLLGAFAPYV VLGPTVAEQQ YGDAALYGWL AASVGFGTAL GSLLALRWRP RRPLVAGILL VLPFCLLLTA FAIGIPLAVA LPIGALTGIG VALFGVWWQT ALAQRIPPHA LSRVTSYDWL GSLALLPIGY VLVGVLAEHV GATEVMAVGG VLAASVLLLG LLPRESRELG SGGSGDVPPT APGAPAAAGA AGVTPRADA
|
| |