Gene Sare_0393 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_0393 
Symbol 
ID5705652 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp453427 
End bp454731 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content68% 
IMG OID641269918 
Productextracellular solute-binding protein 
Protein accessionYP_001535313 
Protein GI159036060 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0128577 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGCCA CCAACCGCCG CCGCCTCGCC GCGGTCGCCC TCGCAGCCGC CACCACCCTG 
CTCGTCACCA CCGCCTGCGG AAACGGCGAC GACGCGGGCG ACGGCACGAT CACACTCACC
GTCGACGTCT TCGGTCAATT CGGCTACGAC GAGCTGTACC AGCAGTACAT GGACGACAAC
CCCGGTGTGA CGATCGTCGA GCGGGGCACC GGTGGCAACC TCGACGAGTA CTCCCCCAAG
CTCACCCAGT GGCTGGCCGC CGGAAAAGGT GCCGGCGACA TCGTCGCCAT CGAGGAGGGC
CTGCTGGTCG AATACAAGGC CAACCCGGAC AACTTCGTCA ACCTGCTCGA CCACGACGCG
GGCGAGTTGC AGGGCAACTT CCTGGACTGG AAGTGGAACG CCGGGCTCAC CGCGGACGGT
GCGCACCTGA TCGGGCTCGG CACCGATGTC GGCGGCATGG CGATGTGCTA CCGCACGGAC
CTGTTCGCCG AGGCCGGGCT GCCAACCGAA CGGGACGCCG TCTCCGCACT CTGGCCGACC
TGGGCGGACT ACATCCGCGT CGGCGAGAGG TTCACCGCCG CGAAGACCGG GGCGGCGTTC
CTGGACGCCG CCACCAACAC GTTCAGCACG ATCGTGTTGC AGACGGCTGG CAACACGAAC
GGCTACCACT ACTACGACAC CAACGACGAG CTCGTCGTGG ACACCAACCC GGCCGTGCGG
CAGGCGTGGG ACACCACCAT GGACATCATC GACTCCGGCC TGTCCGGAAG GTACAGCGCG
TGGTCGGAGG AGTGGGTCTC CGCCTTCAAA CAGGCCACGT TCGCCACCAT CGCCTGCCCC
GCCTGGATGA CCGGCGTCAT CGAGGGCAAC ACCGGAACCG CGGGCGCGGG CAAGTGGGAC
ATCGCCCGGG TGCCCGGCGA CGGTGGCAAC TGGGGCGGCT CGTACCTCGC CGTGCCGAAA
CAAAGCCGGC ACCAGGCGGA AGCCGTCAAA CTGGCCATGT TCCTGACCAG CGCCGAAGGG
CAGATCGGGG CGTTCAAGGC CAAGGGCCCG CTGCCCTCGT CGCCGCAGGC ACTCGGCGAC
CCCGCGGTCG CCGAAGCCAC CAACGCGTAC TTCTCCGACG CCCCCGTCGG GCAGATCTTC
GCCGCCGGCG CCAAGGGGTT GAAGCCGGTC TACATGGGCC CGAAGAACCA GGCCGTCCGC
ACCGAGGTGG AAAACGCGGT CCGCACCGTC GAGCTCGGTC AGCGGACCCC CGAGCAGGGC
TGGAGCGACG CGGTGACGAA CGCGAAGAAG GCCGCCGCCA AGTAG
 
Protein sequence
MGATNRRRLA AVALAAATTL LVTTACGNGD DAGDGTITLT VDVFGQFGYD ELYQQYMDDN 
PGVTIVERGT GGNLDEYSPK LTQWLAAGKG AGDIVAIEEG LLVEYKANPD NFVNLLDHDA
GELQGNFLDW KWNAGLTADG AHLIGLGTDV GGMAMCYRTD LFAEAGLPTE RDAVSALWPT
WADYIRVGER FTAAKTGAAF LDAATNTFST IVLQTAGNTN GYHYYDTNDE LVVDTNPAVR
QAWDTTMDII DSGLSGRYSA WSEEWVSAFK QATFATIACP AWMTGVIEGN TGTAGAGKWD
IARVPGDGGN WGGSYLAVPK QSRHQAEAVK LAMFLTSAEG QIGAFKAKGP LPSSPQALGD
PAVAEATNAY FSDAPVGQIF AAGAKGLKPV YMGPKNQAVR TEVENAVRTV ELGQRTPEQG
WSDAVTNAKK AAAK