Gene Sare_0747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0747 
Symbol 
ID5707779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp831413 
End bp832714 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content66% 
IMG OID641270266 
Productextracellular solute-binding protein 
Protein accessionYP_001535657 
Protein GI159036404 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00190936 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTCGCA CAGCCAAGGG GGTCGCCGTA CTTGCCTCCA CCACCCTTGC TCTGGCCCTC 
GCCGCCTGTG GCGGGGACAG TCAGGGCGAG GAGGAGCGCC CAGCGGCGGA TCCCGCAGCT
ATGAAGGCAG AACTGACCTG GTGGGACACG TCAGACCCGA AGAACGAGGG TCCGGTGTTC
CAGGAGCTGA TCGCACGGTT CAACGAGACC TACCCGAGTA TAAAGATCAA CTATCAGTCG
GTCCCGTTCG GTGAGGCCCA GAACAAGTTC AAGACCGCCG CGCAGGCCAA GACCGGCGCA
CCGGACATCC TGCGGGCGGA GGTGGCCTGG GTGCCGGAGT TTGCCTCGCT GGGCTACCTC
TACGCGCTGG ATGGCTCCGA GCTGCTTGCC GACGAGGCGG ACTTCCTGGC TACCCCGCTC
GCGTCGAACA AGTACGACGG CAAGACCTAC GGCGTCCCGC AGGTGACCGA CACGCTGTCG
CTCATGTACA ACAAGGAACT GTTGGCCGAG GCCGGCGTCG CCGCAGCGCC GACGACCTGG
GCCGAGCTGA AGACCGCGGC CCAGGCCGTC ACGCAGAAGA CCGGTGCCGA GGGCCTCTAC
GTCAATCCGG CCGGCTACTT CCTGCTGCCC TTCATGTACG GCGAGGGCGG CGACCTGGTC
GACGTCGAGG CCAAGAAGAT CACCGTTGGC TCGGACCGTA ACGTCGCCGG GCTGAAGATC
GCCAAGGACC TGATCGACAG CGGTGCCGCC GTCAAGCCCT CCGCGAACGA TTCCTACGGG
ACGATGATGA CGCTCTTCAA GGAGCAGCAG GTCGCCATGA TCATTAACGG TCCGTGGGAG
GTCAACAACG TTACGCAGGC GCCGAGCTTC GGTGGCGCGG AGAACCTCGG CATCGCTCCG
GTCCCGGGCG GCTCGGCCAG GGCCGGCGGC CCGGTCGGGG GGCACAACTA CACCATCTGG
TCCGGGATGC CACAGGAGAA GGTCGACGCC GCGGTCGCGT TCGTGGCCTT CATGAGTTCC
ACCGAGTCGC AGGCATTCCT CTCCGAAAAA CTCGGCCTGC TGCCGACCCG CAAGTCGGCC
TACGACCTCG ACGCGGTGCG GAACAACCCG ATCGTCACCG CCTACCAGCC CGCCGTGGAG
GCCGCCGTGG GCCGTCCCTG GATTCCCGAG GCCGGCCAGT TCTTCGAACC GCTGGACCAG
ATGGCCACCG AGGTTCTGAT CCAGAACCGG GATCCGAAGG CCGCGCTCGA CGCTGTCGCC
AAGAGGTACC AGGCGGAGGT CGTCACCTCG TTCGGGCTCT GA
 
Protein sequence
MSRTAKGVAV LASTTLALAL AACGGDSQGE EERPAADPAA MKAELTWWDT SDPKNEGPVF 
QELIARFNET YPSIKINYQS VPFGEAQNKF KTAAQAKTGA PDILRAEVAW VPEFASLGYL
YALDGSELLA DEADFLATPL ASNKYDGKTY GVPQVTDTLS LMYNKELLAE AGVAAAPTTW
AELKTAAQAV TQKTGAEGLY VNPAGYFLLP FMYGEGGDLV DVEAKKITVG SDRNVAGLKI
AKDLIDSGAA VKPSANDSYG TMMTLFKEQQ VAMIINGPWE VNNVTQAPSF GGAENLGIAP
VPGGSARAGG PVGGHNYTIW SGMPQEKVDA AVAFVAFMSS TESQAFLSEK LGLLPTRKSA
YDLDAVRNNP IVTAYQPAVE AAVGRPWIPE AGQFFEPLDQ MATEVLIQNR DPKAALDAVA
KRYQAEVVTS FGL