Gene Sare_1507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_1507 
Symbol 
ID5703492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp1736531 
End bp1737502 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content65% 
IMG OID641271013 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001536394 
Protein GI159037141 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1175] ABC-type sugar transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000203032 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGTCACT CGGCCGCCAC GGCTCCGCCG ACACCAACTG CACCACCAGC CCCGCCGGGC 
GGCGCCCGGG AACGTCGACG GTTCACCCTG GACCGCCTGG ACCTCACCTA CTCGCCGTAC
GTCTACATCG CGCCGTTCTT CCTGATCTTC GGCGCGTTCG GGCTGTACCC GATGGCGCGT
ACCGCCTGGA TGTCGTTGCA CGACTGGGAC ATGATCGGCG AGCGCACCTT CATCGGGTTG
GACAACTACA CCCGGTTGCT GTCCGACGAC TACTTCTGGA ACGCGCTGGT CAACACGTTC
GGCATCTTCG CCCTGTCGAC CATCCCACAG CTGCTGCTGG CGCTCTTTCT GGCGAACCTG
CTCAACCGCA CCCTCCTGCG GGCCAAGACC TTCTTCCGGA TGGCCATCTT CATCCCGAAC
GTCGTCTCGG TGGCCGCGGT CGCGATCGTC TTCGGCATGC TCTACCAGCG CGAGTACGGG
CTGGTCAACT GGCTGCTCGG CTTCGTTGGG ATCGACCAAA TTGACTGGGA TGGGCAGACC
TGGAGCTCCT GGACGGCGAT CGCGTCCATG GTCAACTGGC GGTGGACGGG GTACAACACC
CTGATCCTGC TCGCCGGCAT GCAGGCCATC CCTCGGGACC TCTACGAGGC GGCCGAGATC
GACGGTGCCG GCCAGTGGCG GCAGTTCTGG CGAATCACCC TGCCCCTGCT CAGGCCGACG
TTCGTCTTCG TGGTCATCCT CTCCACGATC GGCGGCATGC AGCTGTTCAC CGAACCGCTG
CTCTTCGCCA ACGGCAGCAT CATCGGCGGC AACCAGCGCG AGTTCCAGAC CCTGGCCATG
TACATGTACG AGATGGGGCT GGTGAACCTC AACAGTGCCG GTTACGGGGC CGCCGTCGCC
TGGGCCCTCT TCATGATTAT CGGCCTGATG TCGCTGCTCA ACTTCGTCCT CGTCCGCCGC
GCGGCCACGT GA
 
Protein sequence
MSHSAATAPP TPTAPPAPPG GARERRRFTL DRLDLTYSPY VYIAPFFLIF GAFGLYPMAR 
TAWMSLHDWD MIGERTFIGL DNYTRLLSDD YFWNALVNTF GIFALSTIPQ LLLALFLANL
LNRTLLRAKT FFRMAIFIPN VVSVAAVAIV FGMLYQREYG LVNWLLGFVG IDQIDWDGQT
WSSWTAIASM VNWRWTGYNT LILLAGMQAI PRDLYEAAEI DGAGQWRQFW RITLPLLRPT
FVFVVILSTI GGMQLFTEPL LFANGSIIGG NQREFQTLAM YMYEMGLVNL NSAGYGAAVA
WALFMIIGLM SLLNFVLVRR AAT