Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_3656 |
Symbol | |
ID | 5704620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 4218808 |
End bp | 4220355 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641273081 |
Product | major facilitator transporter |
Protein accession | YP_001538445 |
Protein GI | 159039192 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00788445 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.022371 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACTCCC GGCCTGGCCG GTCAACGCCG TCCCGACGGG CCGCCTTGGT CGGCCTGTGT ACCGCTGCCA CCCTGGTGTG GCTCGCGTTC TCCGACCTGG GTGTGGCGCT ACCGACGATC GCCACCGAGT TGAGCGTCAA CCTGACCGAC ATGCAGCGGG CGAACAACGC CCTGAGTATC GCCTGCGGCA CACTCCTACT GGCCGGCGGA CGCCTCACCG ACCTCTACGG CCATCGACGG ATGCTGCTCC TCGGCCTACT GATCTTCGGT ATTGCCACCT TGGCGACCGC GTTCACACCC AACCTCGCCG GCCTGGTCGC CGGTCGGGCG ATGATGGGCG TCGGCAGCGC ACTCATCCTG CCCGCCTCGC TAGCCATGAT CCCGGCGCTG TTCGACCGGG CCAGGCAGCC GTCGGCATTC GCCGCATGGG CAGCGACCAC CTGGGCGGGG CAGGCGGTCG GGCCCGCCAT CGGCGGAGGA CTCACCACCC TGTTGGGCTG GCGGTCGCTG TTCTGGCTCA CCGCGCCGGT GGTGCTTGTC GTGTACGTGA TAACCAGCCG ATACGCACCC AAGGCAAGCA GGCGCCGCGG GCGGGTCGAC CTGGTCGGGC TGGCCACCGG CGCCGGAGCA GCGCTGTGTC TGCTCTTCGC GTTGACCGAG AGCCAGCAGG TCGGCTTCAA CGATCCTTTG ATCATCGTTT TGTTCGCCGC GACGCCGGTA CTCGGCGCAG CGTTCGTGTT CATCGAGACA AAGGCCGCTG ATCCGCTGGT GGATCTGCGG CTGTTCCGCA CCCGCAGCTT CACTGCCGCT CTCATCGTCA ACCTGGCGAT GAGCATGTCC TTCGCCGGCA CGCTGTTCGT GCTGTCCCTC TACCTCCAGG ATGTCCGCGG CTACACCGCG TTCGTGGCCG GCCTGCTGCT GATCCCCGCC GCCGGAACGA TCCTGGGGTT CAACACTGTC GGAGCGCGGC TGGTCACCCG ACACAACGCC CGCTCCCCCT CGCTCTGGGG CCTCGTCCTG GTCGGTCTCG GCGGTTTCGC CATCAGCGCC CTGCTGCCCT CCCTGTCCGT CCTGGCAGTG ATCCTGGGCC TGCTCATCAT CGGCGCCGGG CTGGGCCTGC TGTCCGTGCC CGTGGCCGAC ACCGATGTCG GAGGTCCACC GGCCTCCCTC GCCGGCGCCG CGTCCGGGGC GTACAGGAGC AGCAGCATGC TGGGTGGCGC ACTCGGCGTC GTCCTCCTGA CCACGGCGAC AACCCGCTTC GGCCGCGCCG AGGCCGCACC GGTCAGCACC GCGGCCGGAC TCACCGAAGC GGAATCCAAC CAGGTCGTCA ACGCACTGAC CAACTCCCAG ACCGCGAGCG CCATCCTCGA CAAACTGCCG GCAAACGAAC GGTCCCTCGT CGTCGGTGTC TACAACCAGG CGTTCACGGA CGGAGTTTCG ACCGCCCTCA TCCTCGGTGG TGTGATCGCG GTGGCGGGCA CGGTGCTGGC TGGTTGGATC TGGCCCCGCA CCCACAGAGC CCGACACACG ACGAACCCCG GACCTTAG
|
Protein sequence | MYSRPGRSTP SRRAALVGLC TAATLVWLAF SDLGVALPTI ATELSVNLTD MQRANNALSI ACGTLLLAGG RLTDLYGHRR MLLLGLLIFG IATLATAFTP NLAGLVAGRA MMGVGSALIL PASLAMIPAL FDRARQPSAF AAWAATTWAG QAVGPAIGGG LTTLLGWRSL FWLTAPVVLV VYVITSRYAP KASRRRGRVD LVGLATGAGA ALCLLFALTE SQQVGFNDPL IIVLFAATPV LGAAFVFIET KAADPLVDLR LFRTRSFTAA LIVNLAMSMS FAGTLFVLSL YLQDVRGYTA FVAGLLLIPA AGTILGFNTV GARLVTRHNA RSPSLWGLVL VGLGGFAISA LLPSLSVLAV ILGLLIIGAG LGLLSVPVAD TDVGGPPASL AGAASGAYRS SSMLGGALGV VLLTTATTRF GRAEAAPVST AAGLTEAESN QVVNALTNSQ TASAILDKLP ANERSLVVGV YNQAFTDGVS TALILGGVIA VAGTVLAGWI WPRTHRARHT TNPGP
|
| |