Gene Sare_2113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2113 
Symbol 
ID5704967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2434412 
End bp2435941 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content70% 
IMG OID641271598 
Productmajor facilitator transporter 
Protein accessionYP_001536969 
Protein GI159037716 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.117023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00370364 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGACGAGCC TCGGTGCGCC GGCCCGGGCG GGGCGGCGGG AGTGGACCGG GCTGGCCGTG 
CTGTGCCTGC CCACCATGCT CTCCCAGGTG GACATCAACG TGCTGATCCT CGCCCTGCCG
CAGTTGACTG CCGATCTGGG GACCAGCGCG ACCCAGCAAC TCTGGGTCAC CGACATCTAC
GGCTTCATGA TTGCCGGCTT CCTGCTCACG ATGGGAACGC TCGGCGATCG GGTCGGCCAC
CGTCGGGTGC TGATGGCCGG CGCGGTCGGG TTCATCGCCG CGTCCCTGCT GGCGGCGTAC
TCGTCATCCA CGGAGATGCT GCTGGTGGCG CGGGCCCTGC TGGGCATCGC CGCGGCGACG
GTCATGCCGT CGGTGCTGGC ACTGATCCGG CGGATGTTCC AGGACCCCAA ACAGCTCGGC
GCCGCGTTCG GGATCTGGGG CTCCTCGATC ATGCTGGGTG TGATGTTCGG CCCGGCGATC
GGCGGTCTGC TACTGAACTC GTTCTGGTGG GGCTCGGTAT TCCTGCTGGG TGTCCCGGTG
ATGCTCCTGT TGCTGGCGGT GGGCCCGGCG CTGCTGCCAG AGTCGCGCAC CGCACACGCC
AGCCGGTTGG ACCTCGTCAG CGTGTTGCTG TCGCTGGCCG CCGTGCTGCC GGTCGTCTGG
GGGCTGAAGG AGTTCGCGCG AGCCGGCTGG GGACCGGAAC CGGTCCTGGC CGTCATCGTC
GGTGTCACGC TCGCCGCGCT GTTCGTGACC CGTCAGCGCC GGCTCACCGA ACCGCTGCTG
GACCTGGAGT TGTTCCGCAA CAAGGTGTTC ACCACGGTGG TGGTCACCGG GCTGGCCATC
GGAGCGGTGA TGGCCGGCAC CGGGCTCGTG GTGACGCTGT ACCTCCAACT CGTGGTGGGC
CTCAGCCCGC TGGAGGTCGG CCTGTGGCTG CTGGTCCCGT CCTTTGCCAT GATCGTCGGC
AGCAACGTGG GCCCAGCCGT CGCCCGGGCC GTTCGGCCCG CGTACGTGAT CGGCACCGGC
CTGTTCGTCG CTGCGGCCGG CATGCTGCTG CTCTCCCAGG TGGATCCCGG CGCAACCCTC
ACCTTGCTGA TCGTCGGTCT GGTTTTGGTC TTCACCGGAA ACAGCCCCAC CGGTACTCTC
GGCAGCTTCC TGCTGATGTC TTCGACACCG CCGCACCGGG CCGGCGTCGC CGGATCGATC
TCGTCGGCCG GCGGTGAGCT GGGCATCGCC CTGGGGATCG CGCTCATGGG TAGCGTCGCC
ACCGCCAACT ACCGCAATGA CGTGGCCCTA CCGGCGGGGT TGCCGGGTGA GGCTGCCGAC
CAGGCCCGAG GGAGCATCGC CGGCGCCGCG TCGGCCGCGA ACGGCCTGCC CACGCCGGTG
GCCACCGAGG TCCTCAACGC CGCTCGGGCC GCGTTCACCG ACTCGCTGCA CACCGTCAGC
CTCGTCAACG CGGTGCTCTT CCTCGCCGTG GCCACCCTCG TCCTGGTCAC GCTCCGGCAC
GCGCCAGCGA TGGGCGCAGC GAAACGCTGA
 
Protein sequence
MTSLGAPARA GRREWTGLAV LCLPTMLSQV DINVLILALP QLTADLGTSA TQQLWVTDIY 
GFMIAGFLLT MGTLGDRVGH RRVLMAGAVG FIAASLLAAY SSSTEMLLVA RALLGIAAAT
VMPSVLALIR RMFQDPKQLG AAFGIWGSSI MLGVMFGPAI GGLLLNSFWW GSVFLLGVPV
MLLLLAVGPA LLPESRTAHA SRLDLVSVLL SLAAVLPVVW GLKEFARAGW GPEPVLAVIV
GVTLAALFVT RQRRLTEPLL DLELFRNKVF TTVVVTGLAI GAVMAGTGLV VTLYLQLVVG
LSPLEVGLWL LVPSFAMIVG SNVGPAVARA VRPAYVIGTG LFVAAAGMLL LSQVDPGATL
TLLIVGLVLV FTGNSPTGTL GSFLLMSSTP PHRAGVAGSI SSAGGELGIA LGIALMGSVA
TANYRNDVAL PAGLPGEAAD QARGSIAGAA SAANGLPTPV ATEVLNAARA AFTDSLHTVS
LVNAVLFLAV ATLVLVTLRH APAMGAAKR