Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4030 |
Symbol | |
ID | 5706434 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4583214 |
End bp | 4584455 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641273455 |
Product | major facilitator transporter |
Protein accession | YP_001538811 |
Protein GI | 159039558 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00161616 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGCCGTCCA TCCTCGCGGT CCTGCGTCGC AACCGAAACT TTCGCATGCT GCTCCTGGCC GAGCTGATGG TCTTCGGCGT CGACTGGTTC GTCATGGTGC CGCTGCTGGT GCTCCTGCCG GCACTGACCG GCAGCGGCGT GTGGGGTGCG CTGGTCCTCG CCATGGACAC CGGAGTCGTG GCGCTGCTGC TGCCGTACAC CGGGGCGGTG GCGGACCGGT TCGATCGCCG CCGGATCATG ATCGGCGCCA ACATCGCCGC ACTGGTCGGC GTACTGCTGC TGCTGGGTGT GCGCGATGCC GGCACGGCCT GGCTGGCCCT GGTCGGGATC GGGGTGGTGG CGGTGGCCAA GGCGTTCTAC TCTCCGGCCG CGCAGGCCGC CTTGCCGAAC GTGCTCGACC CAGATGAGTT GGCCGCGGGT AATGCGGTCG CAGGTTCGGC ATGGGGCACG ATGACGATCG TCGGGGCGTC GCTGGGGGGT GTCCTGAGCA GCGCAGCTGG GCCATACGTC GCCTTCTGGG CGGCCGCTGG CGGCCTGGTT CTGGCCGGGG TCCTGGCGGG GCTGATCCGT CGGCCGTTGC AGGCCCCACG GGACCAGGAC CGACCGGTGC AGCAGACCTG GGCGGCCATC CGGGAGGCAC TCGGCTACAT CGGCCACCGG CCGCGGGTGC TGGCGTTGGT GACCGTGAAG TCGGCGGTCG GCCTCGGCAA CGGCGTGTTG ACGGTGTTTC CTTTGCTGGC GGTGGCCTAC GGGGTGGGTC CGATCGGCAC CGGGCTGCTC TTCGGGGTGC GAGGCGCGGG TGCTCTGGTC GGTCCGATCC TGATGCGGCG GGTTCTGGGT AACCGGTCCT GGCTGCTGCC CGGCCTGGCG GCATCCATGT CGTTGTATGG GCTGGCCTAT CTGGGCACCT CGGCGGTGAA CTGGTTCCCG CTGGTGCTTG CGTTGGTCTT CGTGGCGCAC TTCGCCGGGG GTAGTAACTG GGTCATGTCC AACTACGCCC TCCAGGGCGA GGTCCCGGAT CGGTTACGGG GACGGGTCTT CGCCACCGAC ATGATGCTGG CGACCCTCGC CATCTCGGTG AGTCAGCTGG TGGTGGCATC GGTGATCGAT GTGGTTGACG CGCGGGTGGT GTTGGCCGGT GGTGGACTGG TCACCCTGGT CTACGCGGTT GGCTGGCGAA TCGCGACCCG CCGCCTGTCG TTGACCGACC CGGTCGCGGC GCCGGAGTCG GTCGTTCGCT GA
|
Protein sequence | MPSILAVLRR NRNFRMLLLA ELMVFGVDWF VMVPLLVLLP ALTGSGVWGA LVLAMDTGVV ALLLPYTGAV ADRFDRRRIM IGANIAALVG VLLLLGVRDA GTAWLALVGI GVVAVAKAFY SPAAQAALPN VLDPDELAAG NAVAGSAWGT MTIVGASLGG VLSSAAGPYV AFWAAAGGLV LAGVLAGLIR RPLQAPRDQD RPVQQTWAAI REALGYIGHR PRVLALVTVK SAVGLGNGVL TVFPLLAVAY GVGPIGTGLL FGVRGAGALV GPILMRRVLG NRSWLLPGLA ASMSLYGLAY LGTSAVNWFP LVLALVFVAH FAGGSNWVMS NYALQGEVPD RLRGRVFATD MMLATLAISV SQLVVASVID VVDARVVLAG GGLVTLVYAV GWRIATRRLS LTDPVAAPES VVR
|
| |