Gene Sare_0401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0401 
Symbol 
ID5703794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp461832 
End bp463184 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content70% 
IMG OID641269926 
Productmajor facilitator transporter 
Protein accessionYP_001535321 
Protein GI159036068 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0878696 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0116181 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGACGA TGCGAGGTTG GCTACACGAT ACGGCCGGCG GCCTTCCCCG CACGTTCTGG 
TATCTGTGGA CCGGCACCCT GATCAACCGG CTCGGCTCGT TCGTCATCAT CTTCCTCGCC
ATCTACCTCA CCCAGGAGCG AAACTTCTCC GCCTCCCAGG CGGGCCTGGT GCTGGGTCTC
TGGGGGGTCG GCGGCGCGGC GGGCACCACC ATCGGTGGCA CGCTCGCGGA CCGGTGGGGC
CGCCGCCCCA CCCTGCTCAC CGCGCACCTC GGCGCCACGA GCATGATGCT CGCCCTCGGC
TTCGCCCGGG ACCTGTGGTC GGTCGCACTC GGCGCCCTGC TGCTCGGACT CTTCGCCGAG
GCCGCGCGAC CCGCGTTCGG CGCCATGATG ATCGACGTCG TGCCGGACAA GGACCGGTTG
CGGGCCTTCA GCCTGAACTA CTGGGCGATC AATCTCGGCT TCGCCTGCGC CGCGGTCCTC
GCGGGATTCG CGACCGAGGC TGGCTACCTG CTGCTCTTCG TGGTGAACGC GGCCACCACC
CTGACCACCG CACTGATCAT CTTCGTCAAG GTCAGCGAGA CCCGCAAGCC GCTGGTCACC
GCCGCCGGGC GACCGACCGC ACCGCCCCGG GCGCTGCGCA CCATCCTGGC CGACCGCGTC
TACCTCGGGT TCGTGGCGTT GAACCTCTTC GCCGCGTTGG TCTTCCTCCA GCACATCTCG
ATGCTGCCGA TCGCCATGGG CGACTCGGGG CTGAGCCCCG CCACGTACGG CTCGGTGATC
GCACTCAACG GTGTGCTGAT CGTGGTGGGC CAACTCTTCG TACCACGGTT GATCAAGAAC
CGTAGCCGCT CTCACGTGCT GGCGCTGTCG GCGGTGGTGC TGGGGGTCGG ATTCGGCCTG
ACCGCCTTCG CCGAGACCGC CTGGTTCTAC GGTCTGACCG TCCTGATCTG GACTCTCGGC
GAGATGCTCA ACTCGCCGTC CAACGCCACC CTGATCGCCG AACTCTCCCC GAGTGAACTG
CGCGGTCGAT ACCAGGGAGT CTTCTCGCTC TCCTGGCAGG TAGCCGGCGC CACAGCGCCA
GTGCTCGGCG GGGTCGTCCG GGAGCGGGCC GGCGACGACA TCCTCTGGCT GGGCTGCGCC
CTGATCGGCG GGTTGGTGGC GGCGGCGCAC CTGATCTCCG GGCCGACGCG GGAGCGCCGG
GTCACCGCCC TGCGGGCGGC CAACCAGTCG GTGCAGCCGG CCGCGGTCGG GGGCCGGCGC
GCGGCCGAGG CGGAAGAGGC CGTCACGACC GCACCGGCCG AATCGCTCCC GACGGGATCC
GCCGAGAGCA CGGCGGCCGG TCGGGTTCAG TGA
 
Protein sequence
MRTMRGWLHD TAGGLPRTFW YLWTGTLINR LGSFVIIFLA IYLTQERNFS ASQAGLVLGL 
WGVGGAAGTT IGGTLADRWG RRPTLLTAHL GATSMMLALG FARDLWSVAL GALLLGLFAE
AARPAFGAMM IDVVPDKDRL RAFSLNYWAI NLGFACAAVL AGFATEAGYL LLFVVNAATT
LTTALIIFVK VSETRKPLVT AAGRPTAPPR ALRTILADRV YLGFVALNLF AALVFLQHIS
MLPIAMGDSG LSPATYGSVI ALNGVLIVVG QLFVPRLIKN RSRSHVLALS AVVLGVGFGL
TAFAETAWFY GLTVLIWTLG EMLNSPSNAT LIAELSPSEL RGRYQGVFSL SWQVAGATAP
VLGGVVRERA GDDILWLGCA LIGGLVAAAH LISGPTRERR VTALRAANQS VQPAAVGGRR
AAEAEEAVTT APAESLPTGS AESTAAGRVQ