Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0417 |
Symbol | |
ID | 5708231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | + |
Start bp | 477160 |
End bp | 478422 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641269942 |
Product | major facilitator transporter |
Protein accession | YP_001535337 |
Protein GI | 159036084 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.00934872 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGCAGTG TCGTCGAGGC GTTCGTGCCG GCCCGGCTTG GCAACAGGTT CCGTTGGTTG CTGGCCTCGT CGTGGGTGAC GAACCTGAGC GACGGAATCG CGGTGGCGGC CGGGCCGCTA CTGGTGGCGT CGCTGACCAC CAACCCGATC CTGGTCTCGC TGGCAGCCCT GCTGCGCTGG GCACCTCCGT TGGTGTTCGG CCTGTGGGCC GGCGTGCTCT CCGATCGGCT CGACCGGCGG CGCATCGTGT TGGTGGCCAA CACAGTCCGA CTCGTCACCC TGGTCGTGCT GGCCGGGGCA TTGGTGACCG ACCGGGTGTC GGTGTCGGCG GTGCTGCTGA TGCTGGCGCT GCTCGCCACC GCCGAGGTGT TTGCGGACAA CACGACGGGC ACGCTGACGC CGATGCTGGT GCGTCGGGAG GATCTGGCGC TCGCCAACGC CCGCGTCCTG GCCGGGTTCA TCACGCTGAA CACACTGGCC GGGCCGGCAG TTGGGGCGGC ACTCTTCGCG GCCGGGCGGT CCTTGCCGTT CGCCACCAAC GCGGTTCTCA TCGCGGCCGG GCTGGTGCTG GTGTCCCGGC TGTCCCTGCC GCCACGCGAG CCGGCAGCGG AGAACCGCGG CGTCCGGCGG GACATCGTGG CGGGTATCCG ATGGACCGTC CGTCATCCGG CCGTTCGGAC GCTCTGCCTG ACCACCCTGG TCTTCAACAT CACGTACGGT GCCGCGTGGT CGATCCTGGT GCTCTACGCC ACCGAGCGGC TCGGTCTGGG CGCTGTCGGC TTCGGCCTGA TCAGCACGGC GACGGCGGTC GGCGGCCTGC TCGCCACCGT CGGCTACGGG TGGCTCACCC GTCGGATGAG CCTCGGGGGG ATCATGCGGG CCGGCCTGGT CATCGAGACT CTCACCCACT TCGGTCTCGC CGTCACCACC GCACCCTGGG TCGCCTCGGC CATTCTCTTC GTCTTCGGGG CGCACGCCTT CGCCTGGGGC ACCACCTCGA TGACGATCCG TCAGCGGGCG GTTCCGGCCC ACCTCCTGGG CCGGGTCAAC AGCATCCACA CCATCAGTGC GTACGGTGGG CTGGTCATCG GCTCGGCAAT CGGTGGCCCG CTGGTTGCCC TCCTGGGTGT GACCAGCCCG TTCTGGTTCG CTTTCGCTGG TTCGGCCGTC CTCGTCGTGC TGCTGTGGCG CGAGTTCGCC CACATCGCAC ACACCGACGA TCCCGCTCCG ACCCCGGCCC CGGCCGGTTC AACGGCGGCG TGA
|
Protein sequence | MSSVVEAFVP ARLGNRFRWL LASSWVTNLS DGIAVAAGPL LVASLTTNPI LVSLAALLRW APPLVFGLWA GVLSDRLDRR RIVLVANTVR LVTLVVLAGA LVTDRVSVSA VLLMLALLAT AEVFADNTTG TLTPMLVRRE DLALANARVL AGFITLNTLA GPAVGAALFA AGRSLPFATN AVLIAAGLVL VSRLSLPPRE PAAENRGVRR DIVAGIRWTV RHPAVRTLCL TTLVFNITYG AAWSILVLYA TERLGLGAVG FGLISTATAV GGLLATVGYG WLTRRMSLGG IMRAGLVIET LTHFGLAVTT APWVASAILF VFGAHAFAWG TTSMTIRQRA VPAHLLGRVN SIHTISAYGG LVIGSAIGGP LVALLGVTSP FWFAFAGSAV LVVLLWREFA HIAHTDDPAP TPAPAGSTAA
|
| |