Gene Sare_5018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_5018 
Symbol 
ID5705473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5687137 
End bp5688810 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content67% 
IMG OID641274411 
ProductSSS family solute/sodium (Na+) symporter 
Protein accessionYP_001539752 
Protein GI159040499 
COG category[R] General function prediction only 
COG ID[COG4147] Predicted symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000399279 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCACGG TCCTCGCGGC TGAGGCGGGC GACAACACCG CCCGGAACCT GACCATCACC 
CTGTTCCTGG TCTTCGTGGC GGTGACGCTG GCGATCACGG TGTGGGCCAG CCGGCAGACC
AAGACCGCCA CCGACTTCTA CGCGGGTGGC AGGTCCTTCT CCGGCTTCCA GAACGGCATG
GCGATCGGCG GCGACTACAT GTCGGCGGCG TCGTTCCTCG GCATCGCCGG CCTCATCGCG
CTCTACGGCT ACGACGGCTT CCTCTACTCG ATCGGCTTCC TGGTCGCCTG GCTGGTGGCG
CTGCTGCTCG TCGCCGAGTT ACTGCGCAAC TCCGGCCGTT ACACGATGGC CGACGTGCTC
GCCTTCCGGA TGCGGCAACG TCCGGTGCGT ACGGCCGCCG CGGCCTCCAC CATCACAGTG
TCGATCTTCT ACCTGCTCGC TCAGATGGTC GGTGCGGGGG CACTGGTCGC TCTGCTGCTC
GGCATCAAGC CGGGCACGAC ATTGCTCGGC ATGGACGCGG ACGCCGCCAA GATCGCCACA
ATCATCATGG TTGGCGCGCT GATGGTCACC TACGTCACGG TGGGGGGCAT GAAGGGCACC
ACCTATGTGC AGATCGTCAA GGCGTTTCTG CTGATGGGGG GCGCCGTCGC GATGACGCTG
CTGGTGCTGG CGAAGTACAA GTTCAACCTC TCCAGCCTGC TCGGCGACGC GGCGGACGCC
TCCGGCAAGG GAGCCGCCTT CCTCGAACCG GGGCTCCGGT ACGGCGTCGA GACACCCGGT
GACGCCCTGA AGACCTTCTA CAGCAAGCTC GATCTGCTCT CTCTCGGCAT CGCGCTGGTG
CTCGGCACGG CCGGCCTGCC GCACATCCTC ATCCGCTTCT ACACCGTGCC CACGGCGAAA
GCCGCCCGGA AGAGTGTCCT CTGGGCGATC GGCATCATCG GCACCTTCTA CCTACTCACC
CTGGCCCTGG GCTTCGGCGC CGCAGCACTC GTGGGAGGCG AGGCGATCAC TGCGCAGGAC
CGGGCTGGTA ATACCGCCGC GCCGCAGCTC GCCGAGGCGC TGGGGATGGA CTTTCTCGGG
GGCGACCTGG GCGGGGCGAC CCTGCTGGCG GTCATCGCGG CGGTCGCCTT CGCGACCATC
CTGGCGGTGG TCGCCGGCCT GACCCTGGCC TCCTCGTCCA GCCTGGCGCA CGACTTCTAT
GCCAACGTCG TCAAGGACGG GGCCGCCTCC GAACGCCAAG AGGTGGCGGT GGCGCGAATC
TCCGCCCTGG TGATCGGCGC GATCTCGATC GTCCTCGCCA TCTTCGCGCA GAACCTGAAC
GTCGCCTTCC TGGTCGCGCT CGCCTTCGCG GTAGCTGCGT CGGGCAATCT GCCGGCGATC
CTCTACAGCC TGTTCTGGCG GCGGTTCAAC ACCAACGGCG CGGTCTGGGC TATCTACGGC
GGCCTCTTCG CCGCGGTCTT CCTGGTGTTC TTCTCCCCGG TCGTCTCCGG GTCGCCGACC
GCGATGTTCC CCGACCAGGA CTGGCAGTGG TTCCCGCTGT CCAATCCGGG CCTGCTCTCC
ATCCCGTTCG GCTTCCTCTG CGGCTGGATC GGAACGGTGA TCTCCAGGGA ACGCGACGAA
CAGCGGTACG CCGAACTGGA GGTGCGCTCA CTTACCGGCG CGGGCGCGCA CTGA
 
Protein sequence
MTTVLAAEAG DNTARNLTIT LFLVFVAVTL AITVWASRQT KTATDFYAGG RSFSGFQNGM 
AIGGDYMSAA SFLGIAGLIA LYGYDGFLYS IGFLVAWLVA LLLVAELLRN SGRYTMADVL
AFRMRQRPVR TAAAASTITV SIFYLLAQMV GAGALVALLL GIKPGTTLLG MDADAAKIAT
IIMVGALMVT YVTVGGMKGT TYVQIVKAFL LMGGAVAMTL LVLAKYKFNL SSLLGDAADA
SGKGAAFLEP GLRYGVETPG DALKTFYSKL DLLSLGIALV LGTAGLPHIL IRFYTVPTAK
AARKSVLWAI GIIGTFYLLT LALGFGAAAL VGGEAITAQD RAGNTAAPQL AEALGMDFLG
GDLGGATLLA VIAAVAFATI LAVVAGLTLA SSSSLAHDFY ANVVKDGAAS ERQEVAVARI
SALVIGAISI VLAIFAQNLN VAFLVALAFA VAASGNLPAI LYSLFWRRFN TNGAVWAIYG
GLFAAVFLVF FSPVVSGSPT AMFPDQDWQW FPLSNPGLLS IPFGFLCGWI GTVISRERDE
QRYAELEVRS LTGAGAH