Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2609 |
Symbol | |
ID | 5703364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2973883 |
End bp | 2975433 |
Gene Length | 1551 bp |
Protein Length | 516 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641272070 |
Product | major facilitator transporter |
Protein accession | YP_001537440 |
Protein GI | 159038187 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.151615 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCTCCC GGCCTGGTCG GTCATCGCCG TCGCGACGGG CCGCCCTAGT CGGCCTGTGT ACCGCCGCCG CTCTGGTGTG GATCGCGTTC TCCGACTTGG GTGTGGCGTT ACCGACGATC GCCACCGAGT TGAGCGTCAA CCTGACCGAC ATGCAGTGGG CGAACAACGC CCTGAGTATC GCCTGCGGCA CACTCCTACT GGCCGGCGGA CGCCTCACCG ACCTCTACGG CCACCGACAG ATGCTGCTCC TCGGCCTACT GATCTTCGGC GTCGCCGCGC TGGCAACCGC GTTCACACCC AACCTCGCCG GCCTGGTCAC CGGTCGGGCG ATGATGGGCG TCGGCAGCGC ACTCATCCTG CCCGCCACGC TAGCCATGAT CCCGGCCCTG TTCGACCGAG CCGAGCAACC CTCAGCATTC GCCGCATGGA CCGCAGCGAG CTGGGCCGGT CAGTCGGTCG GGCCCGCCAT CGGCGGAGGA CTCACCACCC TGTTGGGCTG GCGGTCGCTG TTCTGGCTCA CCGCACCCGT GGTCCTCGTG CTGTACCTGA CAATCAGACG CGACGCCCCC ACCGCAACCA GACGCTACGG ACGAGTTGAC CTCCTTGGCG TGGTCACTGG CGGCGGAGCA GCGCTGTGTT TGCTCTTCGC GTTGACCGAA GGCCAACGAG TCGGCTTCGA CGACCCACTG ATCATTTCGC TGTTCGCCGC GAGCTTGGCC CTCACCGCAG CCTTCATCTT CGTCGAACGA CGGACCAGCG ACCCGCTGGT GAACCTACGG CTGTTTCGCA CCCGCAGCTT CGACGGCGCC CTCATCGTCA ACCTCACGAT GAACATGTCC TTCGCCGGCG CACTGTTCGT GCTGTCCCTC TACCTCCAAG ACGTCCGCGG CTACACCGCA TTCATCGCCG GCCTGATCCT CATCCCCGCC GCCGCAACAA TCCTGATCTT CAACACCATC GGCGCCCGAA TTCTCACCCG ACACGACCCC CGCGCCCCCT CAATCTGGGG CCTCGTCCTG GTCGGCATCG GCAGCATCGC CATCAGCACC CTCCTACCCG CCCTGTCCGT CCTCGCAGTA ATCCTGGGCC TGCTCGTCGT CGGCGCCGGA CTGGGCCTGC TGTCCGTACC CGTCGCCGAC ACCATCGTCG CAGGCCCACC AACCACCCTC GCCGGCACCG CATCCGGGGT ATACAAAACC AGCAGCATGC TGGGCGGCGC ACTCGGCGTC GTCCTACTCA CCGCCGCAAC AACCCGCTTC GGCCGCGCCG AAGCCGCACC AGTCAGCACC GCCGCCGGAC TCACCGAAGC GGAATCCAAC CAGGTCGTCA ACGCACTGAC CAACTCCCAG ATCGCGAGAG AGATTCTTGA CAAACTGCCA ACAGGGGAGC GGGCCGTCGT CGTCAGGGTC TATGACCAGG CGTTCTCGGA CGGAGCCTCG ATCGCCCTCG TACTCGGAGG CGTGATCGCA CTGGCCGGCG CGGTGCTGGC CGGCTGGATC TGGCCCCGCC CCCGCAGGAA GGGCCAACGC ACGAAGAATC CCAGACCCTA G
|
Protein sequence | MSSRPGRSSP SRRAALVGLC TAAALVWIAF SDLGVALPTI ATELSVNLTD MQWANNALSI ACGTLLLAGG RLTDLYGHRQ MLLLGLLIFG VAALATAFTP NLAGLVTGRA MMGVGSALIL PATLAMIPAL FDRAEQPSAF AAWTAASWAG QSVGPAIGGG LTTLLGWRSL FWLTAPVVLV LYLTIRRDAP TATRRYGRVD LLGVVTGGGA ALCLLFALTE GQRVGFDDPL IISLFAASLA LTAAFIFVER RTSDPLVNLR LFRTRSFDGA LIVNLTMNMS FAGALFVLSL YLQDVRGYTA FIAGLILIPA AATILIFNTI GARILTRHDP RAPSIWGLVL VGIGSIAIST LLPALSVLAV ILGLLVVGAG LGLLSVPVAD TIVAGPPTTL AGTASGVYKT SSMLGGALGV VLLTAATTRF GRAEAAPVST AAGLTEAESN QVVNALTNSQ IAREILDKLP TGERAVVVRV YDQAFSDGAS IALVLGGVIA LAGAVLAGWI WPRPRRKGQR TKNPRP
|
| |