Gene Sare_2552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_2552 
Symbol 
ID5706406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp2905350 
End bp2906681 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content69% 
IMG OID641272015 
Productextracellular solute-binding protein 
Protein accessionYP_001537385 
Protein GI159038132 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000382941 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGAAGTT CCAGGGCGCT GCTGGCGGCG CTGCTCTCGA CAGTGCTGTT CGCCACCGGC 
TGCGGATCCG GGCTCGGGGC CGCGTCCGAC GGCACGGGGC CGGTGCGACT GTTGGTCTTC
GGCGCACCCG AGGAGTTGGC CGCGTACCGC ACGCTGATCG AGGCGTACGG TCAGGCACGG
CCCGGGAACG AGGTGCAGCT CATCGAGGCG AGCGACCGCA AGGACCTGCT GGCCCGGCTG
GCCACGTCGG TCGCCGGGGG CGCCCCGCCG GACCTGTTCC TGATGAACTA TCGCTTCTAC
GGCCAGTTCG CCGCGAAGAA CGTGGTCGAG CCGTTGGACG AGCGCATCGC CGCGTCCGAG
AAAGTGGATC CCGACGACTA CTACCCGGTG GCGATGAACG CCTTCACCTG GGGCGGCAAA
CAGCTCTGCC TGCCACAGAA CGTGTCCAGT CTCGCCGTCT ACTACAACCG CACCCTGTTC
GCCAAGTACC AGGTCCCCGA GCCGAAGGCC GGCTGGACCT GGAACGACAT GGTCGGTACC
GCCATCGCCA TGACCCGGGA CGCCCGCGGT GTGGTGGTCA AGGGCACCGA GAGCGAGGGC
GCCGCCGTCC GGCCAGCCGT ACACGGGCTC GGCGTCGAGC CGTCGATCAT TCGCGTTGCC
CCGTTCGTGT GGTCCGCCGG CGGCGAGATC GTCGACGACC CGAACCGGCC GACCCGGCTC
ACCCTGGACA CCCCGGTCGG ACGGGAGGCA CTGAAGAACC TGGTCGACCT ACGGCAGGCG
TACGGCGTGG TGCCCACGGA CGAGGAGGTC GAGGCCGAGG ATGACGAGTC CCGCTTCGCC
AACGGCCGAC TCGCCATGCT GATGTCCTCG CGGCGCTCCA CCACCACCTT CCGCTCGATC
ACTGACTTCG AGTGGGACGT CGCCCCGCTG CCGGTCTACC AGGACCAGGT CGGGGTGCTG
CACTCGGACG CGTACTGCAT GACCCGGAGC GCGAAGCGTA AGGACGCGGC ATGGCGGTTC
CTGGAGTTCG CCATCTCCGC CGAAGGACAG CGGATCATCG CCGCCACCGG AAGGACGGTA
CCGTCGCACA TCGACGTCTC GCGCTCCTCG GTGTTCCTCG GCCCGTCCCA GCCGCCGCGC
AGCGCGACGG TCTTCCTCGA CACGATTCCC ACCCTCCGGA CACTGCCGAC CGTCTCCACC
TGGCCCGAAG TCGAGGATGT GACCGCCGGG ATCCTGGAGA ACGCGCTGTA CCGGGGCGAC
CGGTTGGACG ATGTCATCCG CGCCGTCGAT GAGCAGACCC GCCCGCTGTT CGCACGTGGT
GAGCACGGGT GA
 
Protein sequence
MRSSRALLAA LLSTVLFATG CGSGLGAASD GTGPVRLLVF GAPEELAAYR TLIEAYGQAR 
PGNEVQLIEA SDRKDLLARL ATSVAGGAPP DLFLMNYRFY GQFAAKNVVE PLDERIAASE
KVDPDDYYPV AMNAFTWGGK QLCLPQNVSS LAVYYNRTLF AKYQVPEPKA GWTWNDMVGT
AIAMTRDARG VVVKGTESEG AAVRPAVHGL GVEPSIIRVA PFVWSAGGEI VDDPNRPTRL
TLDTPVGREA LKNLVDLRQA YGVVPTDEEV EAEDDESRFA NGRLAMLMSS RRSTTTFRSI
TDFEWDVAPL PVYQDQVGVL HSDAYCMTRS AKRKDAAWRF LEFAISAEGQ RIIAATGRTV
PSHIDVSRSS VFLGPSQPPR SATVFLDTIP TLRTLPTVST WPEVEDVTAG ILENALYRGD
RLDDVIRAVD EQTRPLFARG EHG