Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2963 |
Symbol | |
ID | 5707793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 3370512 |
End bp | 3372050 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641272412 |
Product | major facilitator transporter |
Protein accession | YP_001537780 |
Protein GI | 159038527 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0316634 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000555172 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCCACCG CGGACGCGCC CGCGCCACGC GCCGCCCGAC TGGTACTGCT CATCCTGCTC GTGGCGCAGA TCATGGCGAC GATGGACAAT TCCATCGTCG CCGTGGCCAC GAAGACGATT CGGGACGACC TGCAGACCTC CGGAGCGGCG CTCCAACTCA TCCTGTCCGG CTACACACTG ATGTTCGCCG TACTCGTCGT CACCGGCGCC CGGCTCGGCG GCGACATGGG CCACCGCCGG CTCTTCATGA TCGGCCTCGC CGGTTTCACC GTCAGTTCGC TGATCTGTGG GCTGGCGCCC ACCGCCGGGA CGCTGGTCGC GGCCCGGCTC GTGCAGGGCG CCTTCGGCGC CCTGATGGTG CCCCAGGTTC TTTCGGTCAT CCAGATCGTG TTCACCGGCG AGGCGAGGGC CCGCGCCATC GGCCTGTACT CGATGGTGCT GGCCCTGGGC GTGGCCGCCG GCCAGATCGC CGGCGGCCTG ATCGTCAGTG CCGACGTGCT CGGCAGCGGT TGGCGGGCGG CGTTCATCGT CAATGTGCCG GTCGGCCTCG TTCTGTTGGC GGCTGCGCCG CGCTACCTTC CCGGTAGCCA CGAACGCGGC GAGTTCCGAC CGGACCTCGC GGGCATCGGG CTGCTCGGCG CGTCCATGGC GGCCGTGGTG GCGCCACTCG TCTTCGGCCG GGAGCAGGGC TGGCCGGCGT GGACACTGGC CACCGTCGCC ACCGGTGCCG TCGGGCTCGT GCTGTTCGTC CTCTACGAGC TGCGCCTCGC CGGGCGGGGC GGGCAGCCCG TCCTCGAGTT GGACGCGCTG CGTCCGCCCG GCGTCAAGTC CGGCCTGCTC GCCTGCTGCA TCCTCAACTT CGCGTTCGCC GGCGTGCTGT TCCCCCTCAC CCTGCACGCC CAGAACGGGT TGGGCTACAG CCCTCTGCAG GCAGGCCTGA TGTTCATTCC GTACCCGGTC GGGTTCGCCA CGGTCAGTCT CACCTGGACC CGCCTACCGA AGCGGTTCCA CCAGGTGTTG CCGGTGGTGG GTCTGGTCGT GTTCGCGATC GCCTTGGCCG CACTGGCCGT GGTGGTCGCC GGAGGCTGGC CCGTTCCGCT CGTCGCGGCA CTTCTGATGC TTGCCGGAGC CGGCATGGCG GCCGGGTTCA GCACGTTGGT GGAGCAGACC GCCGCCACGG TCGGGCCCCG GTACGCGGCG GCGCTGTCCG CCCTCGTGTC CACCGGCACG CTGCTGGCCA GCGTCATCAG CGTGGTCGTC GTCGGCGGCA TCTATCTGGC CGTCGCCGAG CAGGACCCGT CCCGGTCGGC GCAGGGGCTC AGCCGCAGCC TCTGGGTGGA CAGTGCGCTG CTGGTCGTGG GGTGCCTGCT GGCGTACCGC ACCTGGCGGC TGGTGGCCCG GCAGCCGCCG GTCGACGCGA CGGACACCGG TGACGGCGGC TCCGACGCCG GGCAGGAACT GCCGGCGACG GACACCGCCA CCGCCGGGGT CACCGCCGGA ACCGGCGACG ACCCCCCGGC CGACAGTCGC ACCGGATGA
|
Protein sequence | MPTADAPAPR AARLVLLILL VAQIMATMDN SIVAVATKTI RDDLQTSGAA LQLILSGYTL MFAVLVVTGA RLGGDMGHRR LFMIGLAGFT VSSLICGLAP TAGTLVAARL VQGAFGALMV PQVLSVIQIV FTGEARARAI GLYSMVLALG VAAGQIAGGL IVSADVLGSG WRAAFIVNVP VGLVLLAAAP RYLPGSHERG EFRPDLAGIG LLGASMAAVV APLVFGREQG WPAWTLATVA TGAVGLVLFV LYELRLAGRG GQPVLELDAL RPPGVKSGLL ACCILNFAFA GVLFPLTLHA QNGLGYSPLQ AGLMFIPYPV GFATVSLTWT RLPKRFHQVL PVVGLVVFAI ALAALAVVVA GGWPVPLVAA LLMLAGAGMA AGFSTLVEQT AATVGPRYAA ALSALVSTGT LLASVISVVV VGGIYLAVAE QDPSRSAQGL SRSLWVDSAL LVVGCLLAYR TWRLVARQPP VDATDTGDGG SDAGQELPAT DTATAGVTAG TGDDPPADSR TG
|
| |