Gene Sare_2620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2620 
Symbol 
ID5703876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2984396 
End bp2985994 
Gene Length1599 bp 
Protein Length532 aa 
Translation table11 
GC content67% 
IMG OID641272081 
Productperiplasmic solute binding protein 
Protein accessionYP_001537451 
Protein GI159038198 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0803] ABC-type metal ion transport system, periplasmic component/surface adhesin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCTGG TGGCCGTGCC ATGGCTGCGG CAGGCAATCC TGCCGCTGCT GGTCGTGGGG 
CTGGTCACTG GCTGTCGGTC GGAGGTGGTG TTGAGCGCGA ACGACGAGCG CCCGCAGGTC
ATCACGACCA CCGGAATCCT GCGGGACCTG GTGAGCAACG TCGGCGGTGA GCGGGTAGCG
GTCAGCTCGT TGGTGCCGGA TGGTGCCGAT CCGCACAGCT ACGAGCCGTC GCTGCGTGAT
ATCCGCGACG TCGTCTACGC GGACGTCGCG TTCAGCAACT ATCTGCTTCT GGAAGAGCAG
GCCATCATCA AGGCACTGGA CGCGAACCTG CGTGATGGCG TACCGAACAT CTCGCTGGCT
GAGGGAGCGG TGAAGTACGC CGCGGAGATC ATCCCGCTGG TTGAGGACGC CTCACTGGAC
ACGATCTGGC TGGGCATGCG GGTACGCGGC ACCGGTGCGA CGCACGGGGC GAACCGCGCC
TCGGACGTGT TGCTCAGCGC CACCTCGGCC GACGGTCCGG GTGGTTTGGT CGCCTACCTG
ACCGAATCAT TTGGCAACCC GTCGTTCTAC GTCGACTCCA CCGACGGGTT CGATCCCGCC
AATGGTTTCC GCGACGACAC CGCCACGCTG CCGCCCGCCG CCCACACGCA CATGAGCTGG
GCGTTCACCA AACCGGGTGT GTACCGGCTG ACCATGCAGG CGAAGCTGTC CGTTGACCCG
CAGTCGCCGC CGGTCCCGAT GGGTGATCAG GTGTTCACGT TCGCCGTCGG GGTGGATCCG
CACAAGGTGC CGGGCATGGC CGGTGCGGTG GTCCTGAATT CGGGGCACGC CGACGTCACC
GTCGACCTCG ACCAGCAGCG GCTCTACCTG TTCGCCGACC CCGGGGGCAG CGGCGAGGCC
AACCAGCGGG TGTACGACCC GGCCCGTACC GTCATCGAAG TACCGGGCAA GGCGCTGCTC
GACGTTCCCG GTGACCGGAG GTTCCGATTC CTCGGCCGGC CCGGTACCCA GGTCTACCAG
CTTCCGCAGG CGGTCCTCGG ACGGCACGTG CACGGGGAGA TCGACCCGCA CCTGTGGCAG
AACGTCCGCA ACGCCATCTC CTACACCGAG CTCATCCGCG ACACCCTCAT CGGGGTCGAC
CCCCAGGGCG CCTCGGCCTA CCAGGCGAAC GCCTCGGCCT ACATCCGGGA GCTGGAAGCA
CTCGACACCT ACGTGCGGGA CACCATCGCA CGGATTCCGC CGTCGCGACG TCACCTGGTC
ACCACACATG ACGCGTTCGG CTACCTGGGG CAGGCGTACG GCATCCAAAT CTCCGGGTTC
GTCACCCCGA ACCCAGCCGT CGAACCGAGC CTGGCCGACC GTCGCCGCCT CACCGCGACG
ATCCGCAACT TGAGGATCCC CGCGGTGTTC CTGGAACCGA ACCTGGCAGC CCGCTCCACC
ACCCTCGACG AAGTCGCCCG TGAGGAAGGG CTGCGTGTCT GTGCGATCTA CGGCGACACC
CTCGACGGCG ACATCACCAG CTACGCACAG ATGATGCGGT TCAATGCCGA ATCGCTGCAC
GACTGCCTGA CCACCGACAA GCAGGGGAGG ACCCAGTGA
 
Protein sequence
MSLVAVPWLR QAILPLLVVG LVTGCRSEVV LSANDERPQV ITTTGILRDL VSNVGGERVA 
VSSLVPDGAD PHSYEPSLRD IRDVVYADVA FSNYLLLEEQ AIIKALDANL RDGVPNISLA
EGAVKYAAEI IPLVEDASLD TIWLGMRVRG TGATHGANRA SDVLLSATSA DGPGGLVAYL
TESFGNPSFY VDSTDGFDPA NGFRDDTATL PPAAHTHMSW AFTKPGVYRL TMQAKLSVDP
QSPPVPMGDQ VFTFAVGVDP HKVPGMAGAV VLNSGHADVT VDLDQQRLYL FADPGGSGEA
NQRVYDPART VIEVPGKALL DVPGDRRFRF LGRPGTQVYQ LPQAVLGRHV HGEIDPHLWQ
NVRNAISYTE LIRDTLIGVD PQGASAYQAN ASAYIRELEA LDTYVRDTIA RIPPSRRHLV
TTHDAFGYLG QAYGIQISGF VTPNPAVEPS LADRRRLTAT IRNLRIPAVF LEPNLAARST
TLDEVAREEG LRVCAIYGDT LDGDITSYAQ MMRFNAESLH DCLTTDKQGR TQ