Gene Sare_0459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0459 
Symbol 
ID5705456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp527158 
End bp528603 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content72% 
IMG OID641269984 
Productmajor facilitator transporter 
Protein accessionYP_001535379 
Protein GI159036126 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2211] Na+/melibiose symporter and related transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000150974 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACGCGT CGCCGGCTGG GACCGGTCCA CCCGTTGCCG GCCCCACCGC CGCGTTGGCG 
TCGGCCGGTG CCGCCGCCGG TCCACCCCCG GCGGGGGTAC TGCCACGGCG GGTGCATGTG
GGTTACGCGT CCGGATCGCT GGCCACCGGA GCCTTCGGCA CCGTGCCCGG GCTGCTACTG
CTGCCCTACC TGACCGACAC GTTGGGCGTG ACGGCGGGTA TCGCCGCACT GCTCGTGCTG
CTGCCGAAGG CGTGGGACGT GCTGGTCAAC CCGGTCGCCG GGCGGATCTC CGATCGCACC
CGCTCCCGGT GGGGAGCTCG CCGTCCGTAT CTGTTCGGTG CCGGAATCGC GCTCGGTCTG
TTCTTCGCCG CGATCTTCGC GGCACCGTTC GGCACCGGGG CGGCTGCCGG CGGGTACGTG
GCGTTGACCT TCCTCGCCGC CGCGACCGCG TTCGCCTTCT TCCAGGTTCC GTATGTGGCG
ATGCCGGCTG AGCTGACCAC CGGCTACGCC GAGCGTACTC GGCTGATGAC CTGGCGGATC
GCGGTGCTGG CGCTGGCCAT TCTGGTCTCC GGCGCGGTGG CCCCACTGGT CCGCGACGCG
GGCGGCGGCG GTCTCGCGGG GCACCGCTGG ATGGGGTTCT TCGTCGCCGG ACTGATCGTG
CTGGGAGCCC TCGGGGCGTT CGTGGGCACC CGGTCCGCGC CGGTCGGCGC TGTGGCAGAG
AACGAGCCGA CGCTGCGGGC CCAACTCGCG GTGGCCGGCC GCAACCGGGC ATTCCGGGCG
TTGCTGGCCT GCTTTGTGAT TCAGTCCGCC GGAGTGGCGA CGATTCTGGC CGGGGTGAAC
TACTTCGCCA CCCAGATCCT GCGCGACCCG GAGGGCGGGC CCACCATCCT GTTCGTCTGT
TTCGTCGGGC CGGCGATCCT GGTCATGCCG GTGTGGAGCC GGGTCGGGCG GAGGTTCGGC
AAGCTCACCG GGCTGGTCGC CGCCGCGCTC ATCTTCGCTG CCGGCGCACT GGCCCTGGTC
ACCGCCGAGA TACTCCCCGC CGCCGGAATC TACCTGCTGG TCGCGCTGGT CGGCGTCGGG
TACGCCGGCC AGCAGGTCTT CGCCCTCGCC ATGCTGCCCG ACTGCATCGC GTACGACGAG
GCCCGGACCG GTCGGCGGCA GGCGGGCGTG TTCACCGGTG TGTGGACCGC CGGGGAGACC
TTCGGCCTGG CGCTGGGGCC GGGCATCTTC GGGTTGGTAC TCCAGCTCTC GGGCTACCTT
CCATCGGATA CCGGAGCCGC CGCTGCCCAG CCTGACTCCG CCCGGCTCGG CATTCTCCTC
GGCTTCACCG TCCTTCCCGC CCTCCTGGTC GCCGCCGCCA CGCTGCTGCT GCGCCCCTAC
TCCCTCACCC CGGCCGAGCT CGGCGCCGCC ACGGGCCCAC CGCCCGGCGC CAGGGACGCT
CGATGA
 
Protein sequence
MDASPAGTGP PVAGPTAALA SAGAAAGPPP AGVLPRRVHV GYASGSLATG AFGTVPGLLL 
LPYLTDTLGV TAGIAALLVL LPKAWDVLVN PVAGRISDRT RSRWGARRPY LFGAGIALGL
FFAAIFAAPF GTGAAAGGYV ALTFLAAATA FAFFQVPYVA MPAELTTGYA ERTRLMTWRI
AVLALAILVS GAVAPLVRDA GGGGLAGHRW MGFFVAGLIV LGALGAFVGT RSAPVGAVAE
NEPTLRAQLA VAGRNRAFRA LLACFVIQSA GVATILAGVN YFATQILRDP EGGPTILFVC
FVGPAILVMP VWSRVGRRFG KLTGLVAAAL IFAAGALALV TAEILPAAGI YLLVALVGVG
YAGQQVFALA MLPDCIAYDE ARTGRRQAGV FTGVWTAGET FGLALGPGIF GLVLQLSGYL
PSDTGAAAAQ PDSARLGILL GFTVLPALLV AAATLLLRPY SLTPAELGAA TGPPPGARDA
R