Gene Sare_2963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2963 
Symbol 
ID5707793 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp3370512 
End bp3372050 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content72% 
IMG OID641272412 
Productmajor facilitator transporter 
Protein accessionYP_001537780 
Protein GI159038527 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0316634 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000555172 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCACCG CGGACGCGCC CGCGCCACGC GCCGCCCGAC TGGTACTGCT CATCCTGCTC 
GTGGCGCAGA TCATGGCGAC GATGGACAAT TCCATCGTCG CCGTGGCCAC GAAGACGATT
CGGGACGACC TGCAGACCTC CGGAGCGGCG CTCCAACTCA TCCTGTCCGG CTACACACTG
ATGTTCGCCG TACTCGTCGT CACCGGCGCC CGGCTCGGCG GCGACATGGG CCACCGCCGG
CTCTTCATGA TCGGCCTCGC CGGTTTCACC GTCAGTTCGC TGATCTGTGG GCTGGCGCCC
ACCGCCGGGA CGCTGGTCGC GGCCCGGCTC GTGCAGGGCG CCTTCGGCGC CCTGATGGTG
CCCCAGGTTC TTTCGGTCAT CCAGATCGTG TTCACCGGCG AGGCGAGGGC CCGCGCCATC
GGCCTGTACT CGATGGTGCT GGCCCTGGGC GTGGCCGCCG GCCAGATCGC CGGCGGCCTG
ATCGTCAGTG CCGACGTGCT CGGCAGCGGT TGGCGGGCGG CGTTCATCGT CAATGTGCCG
GTCGGCCTCG TTCTGTTGGC GGCTGCGCCG CGCTACCTTC CCGGTAGCCA CGAACGCGGC
GAGTTCCGAC CGGACCTCGC GGGCATCGGG CTGCTCGGCG CGTCCATGGC GGCCGTGGTG
GCGCCACTCG TCTTCGGCCG GGAGCAGGGC TGGCCGGCGT GGACACTGGC CACCGTCGCC
ACCGGTGCCG TCGGGCTCGT GCTGTTCGTC CTCTACGAGC TGCGCCTCGC CGGGCGGGGC
GGGCAGCCCG TCCTCGAGTT GGACGCGCTG CGTCCGCCCG GCGTCAAGTC CGGCCTGCTC
GCCTGCTGCA TCCTCAACTT CGCGTTCGCC GGCGTGCTGT TCCCCCTCAC CCTGCACGCC
CAGAACGGGT TGGGCTACAG CCCTCTGCAG GCAGGCCTGA TGTTCATTCC GTACCCGGTC
GGGTTCGCCA CGGTCAGTCT CACCTGGACC CGCCTACCGA AGCGGTTCCA CCAGGTGTTG
CCGGTGGTGG GTCTGGTCGT GTTCGCGATC GCCTTGGCCG CACTGGCCGT GGTGGTCGCC
GGAGGCTGGC CCGTTCCGCT CGTCGCGGCA CTTCTGATGC TTGCCGGAGC CGGCATGGCG
GCCGGGTTCA GCACGTTGGT GGAGCAGACC GCCGCCACGG TCGGGCCCCG GTACGCGGCG
GCGCTGTCCG CCCTCGTGTC CACCGGCACG CTGCTGGCCA GCGTCATCAG CGTGGTCGTC
GTCGGCGGCA TCTATCTGGC CGTCGCCGAG CAGGACCCGT CCCGGTCGGC GCAGGGGCTC
AGCCGCAGCC TCTGGGTGGA CAGTGCGCTG CTGGTCGTGG GGTGCCTGCT GGCGTACCGC
ACCTGGCGGC TGGTGGCCCG GCAGCCGCCG GTCGACGCGA CGGACACCGG TGACGGCGGC
TCCGACGCCG GGCAGGAACT GCCGGCGACG GACACCGCCA CCGCCGGGGT CACCGCCGGA
ACCGGCGACG ACCCCCCGGC CGACAGTCGC ACCGGATGA
 
Protein sequence
MPTADAPAPR AARLVLLILL VAQIMATMDN SIVAVATKTI RDDLQTSGAA LQLILSGYTL 
MFAVLVVTGA RLGGDMGHRR LFMIGLAGFT VSSLICGLAP TAGTLVAARL VQGAFGALMV
PQVLSVIQIV FTGEARARAI GLYSMVLALG VAAGQIAGGL IVSADVLGSG WRAAFIVNVP
VGLVLLAAAP RYLPGSHERG EFRPDLAGIG LLGASMAAVV APLVFGREQG WPAWTLATVA
TGAVGLVLFV LYELRLAGRG GQPVLELDAL RPPGVKSGLL ACCILNFAFA GVLFPLTLHA
QNGLGYSPLQ AGLMFIPYPV GFATVSLTWT RLPKRFHQVL PVVGLVVFAI ALAALAVVVA
GGWPVPLVAA LLMLAGAGMA AGFSTLVEQT AATVGPRYAA ALSALVSTGT LLASVISVVV
VGGIYLAVAE QDPSRSAQGL SRSLWVDSAL LVVGCLLAYR TWRLVARQPP VDATDTGDGG
SDAGQELPAT DTATAGVTAG TGDDPPADSR TG