Gene Sare_0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0103 
Symbol 
ID5707052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp114315 
End bp115727 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content69% 
IMG OID641269629 
Productmajor facilitator transporter 
Protein accessionYP_001535029 
Protein GI159035776 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.360848 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000281581 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAGTTGG GCATGGTGGA ACGGCGGGCC GCCTGGACGG TGATCGCGCT GTGTGCGGCG 
CAGTTCATGC TGATCCTCGA CGTAGTGATC ATCAATGTGG CCGTCCCCTC GATCCGGCAA
GACCTCGGCC TGCTCGACAG CCGCATCCAG CTGACCGCGA CGGCGTACAC CATCACGTTC
GGTAGCTTGT TGATCATCAG TGGCCGCGTT GGTGATCTCC TCGGCCGCAA GAGGCTCCTG
CTGACCGGCC TCACCTTCTT CGTCGCGGCA TCGCTCGGCG CTGGTGCCGC CCAGGTCGAT
TGGCATCTGT TCGTCTCCCG GGGCCTGCAA GGCGTCGGCG CGGCGATGGT TTCCGCGAAC
GCGTTGGCCG CCATCACCGC GAGCCTCGCC GAAGGTCCGG CCCGGAACTG GGCGCTCGGG
CTGTGGGCGG CCGTCAGCTC AGCCGGTGCC ATCGCCGGCC AACTCGTCGG CGGTGCGATC
ACCCAGTTTC TCGGTTGGCG CTGGATCTTC TTCATCAACA TCCCGGTCGG CTTGGCGGTC
GTAGCCGTGC TCGCGCTACT CCTTCGCGAT ACCCCTGCCA CCAACCGACC CCGAATCAAC
CTGGCCGGCG CGTTCCTGCT GGCTGGGGGT CTGGCCAGCG GCATCATGGC GTTGACCTGG
CTGGCCGAGG ACGGCGGCCG GGACCGGTCG TTCGCCGCGG CCGTCACGGC GTTCGTGTTG
CTCGCAAGCT TCGCCCTCGT GGAACGCAGC GAGTCGACAC CGGTCCTGCG GTACGCGCTG
CTGCGCCTGC CCGGAGTACG GGCAGCCAAC GCCACGCTGC TGCTCAACGC CGGCGCGCTC
GGCGCGACCC TCTTCTTCCT GACGCTCTAC CTCCAGATCG TCCTCGGCTA CTCGCCACTG
GCCGTGGGTG TCGCATTCGC ACCGATCACC CTGCTCATCA TGCTGCTGTC ACCGCGCGCT
GCGAAACTCG TCACCCGATT CGGCGCCCGG CGGGTACTGG TCAGCGGATT GACAGTCCTC
GCCGCCGGCG CGCTCCTGCT CGCCCGGCTA CCCGTCCACG GCGACTACTG GACCGACGTC
CTGCCCGGCA TGCTTCTGCT CGCGATCGGT AGTGGTCTGA CCTACGCCCC GACATACATC
GCCGCCTCCA GCGGTGTGAC GGCGGAGGAC CAGGGCGCGG CATCAGGGCT GATCAACTCG
GCGCAGGAGA TAGGTGCCGC GGTGTGCCTC GCGATGCTCG CGCTCATCGC CACCACGGCC
GCCGGACCCG GCGGCAGTGC GACCAGCCTC GCCGAGGGGT ACCGCGCCGG TGTGCTCGCC
GCAGCCGTGC TGTTCGCCAT CGGAGCGACG ATCGCCGTCA CCGTGCCACG CCGGCTCGGT
CAGGCAACCG AAGCCGAGAA GGTCGCGAGC TGA
 
Protein sequence
MELGMVERRA AWTVIALCAA QFMLILDVVI INVAVPSIRQ DLGLLDSRIQ LTATAYTITF 
GSLLIISGRV GDLLGRKRLL LTGLTFFVAA SLGAGAAQVD WHLFVSRGLQ GVGAAMVSAN
ALAAITASLA EGPARNWALG LWAAVSSAGA IAGQLVGGAI TQFLGWRWIF FINIPVGLAV
VAVLALLLRD TPATNRPRIN LAGAFLLAGG LASGIMALTW LAEDGGRDRS FAAAVTAFVL
LASFALVERS ESTPVLRYAL LRLPGVRAAN ATLLLNAGAL GATLFFLTLY LQIVLGYSPL
AVGVAFAPIT LLIMLLSPRA AKLVTRFGAR RVLVSGLTVL AAGALLLARL PVHGDYWTDV
LPGMLLLAIG SGLTYAPTYI AASSGVTAED QGAASGLINS AQEIGAAVCL AMLALIATTA
AGPGGSATSL AEGYRAGVLA AAVLFAIGAT IAVTVPRRLG QATEAEKVAS