Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_0103 |
Symbol | |
ID | 5707052 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 114315 |
End bp | 115727 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641269629 |
Product | major facilitator transporter |
Protein accession | YP_001535029 |
Protein GI | 159035776 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.360848 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000281581 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGAGTTGG GCATGGTGGA ACGGCGGGCC GCCTGGACGG TGATCGCGCT GTGTGCGGCG CAGTTCATGC TGATCCTCGA CGTAGTGATC ATCAATGTGG CCGTCCCCTC GATCCGGCAA GACCTCGGCC TGCTCGACAG CCGCATCCAG CTGACCGCGA CGGCGTACAC CATCACGTTC GGTAGCTTGT TGATCATCAG TGGCCGCGTT GGTGATCTCC TCGGCCGCAA GAGGCTCCTG CTGACCGGCC TCACCTTCTT CGTCGCGGCA TCGCTCGGCG CTGGTGCCGC CCAGGTCGAT TGGCATCTGT TCGTCTCCCG GGGCCTGCAA GGCGTCGGCG CGGCGATGGT TTCCGCGAAC GCGTTGGCCG CCATCACCGC GAGCCTCGCC GAAGGTCCGG CCCGGAACTG GGCGCTCGGG CTGTGGGCGG CCGTCAGCTC AGCCGGTGCC ATCGCCGGCC AACTCGTCGG CGGTGCGATC ACCCAGTTTC TCGGTTGGCG CTGGATCTTC TTCATCAACA TCCCGGTCGG CTTGGCGGTC GTAGCCGTGC TCGCGCTACT CCTTCGCGAT ACCCCTGCCA CCAACCGACC CCGAATCAAC CTGGCCGGCG CGTTCCTGCT GGCTGGGGGT CTGGCCAGCG GCATCATGGC GTTGACCTGG CTGGCCGAGG ACGGCGGCCG GGACCGGTCG TTCGCCGCGG CCGTCACGGC GTTCGTGTTG CTCGCAAGCT TCGCCCTCGT GGAACGCAGC GAGTCGACAC CGGTCCTGCG GTACGCGCTG CTGCGCCTGC CCGGAGTACG GGCAGCCAAC GCCACGCTGC TGCTCAACGC CGGCGCGCTC GGCGCGACCC TCTTCTTCCT GACGCTCTAC CTCCAGATCG TCCTCGGCTA CTCGCCACTG GCCGTGGGTG TCGCATTCGC ACCGATCACC CTGCTCATCA TGCTGCTGTC ACCGCGCGCT GCGAAACTCG TCACCCGATT CGGCGCCCGG CGGGTACTGG TCAGCGGATT GACAGTCCTC GCCGCCGGCG CGCTCCTGCT CGCCCGGCTA CCCGTCCACG GCGACTACTG GACCGACGTC CTGCCCGGCA TGCTTCTGCT CGCGATCGGT AGTGGTCTGA CCTACGCCCC GACATACATC GCCGCCTCCA GCGGTGTGAC GGCGGAGGAC CAGGGCGCGG CATCAGGGCT GATCAACTCG GCGCAGGAGA TAGGTGCCGC GGTGTGCCTC GCGATGCTCG CGCTCATCGC CACCACGGCC GCCGGACCCG GCGGCAGTGC GACCAGCCTC GCCGAGGGGT ACCGCGCCGG TGTGCTCGCC GCAGCCGTGC TGTTCGCCAT CGGAGCGACG ATCGCCGTCA CCGTGCCACG CCGGCTCGGT CAGGCAACCG AAGCCGAGAA GGTCGCGAGC TGA
|
Protein sequence | MELGMVERRA AWTVIALCAA QFMLILDVVI INVAVPSIRQ DLGLLDSRIQ LTATAYTITF GSLLIISGRV GDLLGRKRLL LTGLTFFVAA SLGAGAAQVD WHLFVSRGLQ GVGAAMVSAN ALAAITASLA EGPARNWALG LWAAVSSAGA IAGQLVGGAI TQFLGWRWIF FINIPVGLAV VAVLALLLRD TPATNRPRIN LAGAFLLAGG LASGIMALTW LAEDGGRDRS FAAAVTAFVL LASFALVERS ESTPVLRYAL LRLPGVRAAN ATLLLNAGAL GATLFFLTLY LQIVLGYSPL AVGVAFAPIT LLIMLLSPRA AKLVTRFGAR RVLVSGLTVL AAGALLLARL PVHGDYWTDV LPGMLLLAIG SGLTYAPTYI AASSGVTAED QGAASGLINS AQEIGAAVCL AMLALIATTA AGPGGSATSL AEGYRAGVLA AAVLFAIGAT IAVTVPRRLG QATEAEKVAS
|
| |