Gene Strop_3264 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3264 
Symbol 
ID5059729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3742197 
End bp3743447 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content68% 
IMG OID640475512 
Productextracellular solute-binding protein 
Protein accessionYP_001160076 
Protein GI145595779 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.591163 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGGA TGCGTAGGGC GGCCGTCGCC GCGGTTGGGG CGCTCGCGTT GCTGTCGCCC 
GCCGCTTGCG GTGGTGCCGA CAGCGGGGCC GACCAGGAGG TGGAGGTCTT CACCTGGTGG
GCCGACGGGG GCGAGAAGGC GGGCCTCGAC GGTCTGGTCG CTGCCTTCGA CGAGCAATGT
GACTACTCGT TCGTGAACGG GGCGGTGGCC GGCGGCGCCG GTTCGAACGC CAAGCAGGTA
CTGGCCTCCC GACTCCAACA GGGCGACGCG CCGGACACCT TCCAGGCCCA CGCCGGCGCC
GCGCTGTCGG AATACATCGC AGCCGGCCAG ATCGAGGATC TCAGCGCCCT GTACGACGAG
TGGGGCCTGA CCGAGGCGCT ACCGCCGGGA CTGATCGACA ACCTCAGCGT GGACGGCAAG
GTCTACTCGG TGCCGGCGAA CGTCCACCGG TCGAACGTGC TCTGGACGAA CACGTCGGTC
CTGGCCGATG CGGGGATCAC GGCCGAGCCG ACGACGCTGG CCGACCTCCT CGCCGCGCTC
GACACACTGA AGGCCGCGGG CATCAGTGCG CCGCTCGCGA TCGGCAAGGA CTGGTCCCAG
CTGATGCTGC TGGAGGCGGT GCTGATCAGT GACCTCGGCC CGGAGGGCTT CACCGGCCTC
TGGACCGGTG CGACCGACTG GAACAGCCCC GAGGTCACCC AGGGCCTGGA GAACTACAAG
CGGCTGCTCA GCTACACCAA CACGGACCGG GACACCTACG ACTGGACCGA CGCTGGCAAG
CTCCTCATGG ACGGCAAGGC CGGCTTCTTC CTGATGGGGG ACTGGGCGCC GAGCGACTTC
GAAGCCAAGG GCTTCACCGA CTTCGGTTAC ATCACGTTCC CGGGTAACGG GGACACCTTC
CAGTGGCTCG CCGACTCCTT CGTGTTGCCG CAGGGAGCCG ATAACCCCGA GGGCACCAAG
TGCTGGCTGA AGACGGTCGG CAGCGCCGAG GGACAGCAGG CGTTCAACCT CAAGAAGGGC
TCCATCCCCG CCCGTACCGA CGCCGTCGAG GCCGACTACC CCGCCTACCA GCAGTCGGCC
ATCCAGGCGT GGAAGACCGG CACGCAGGTC CCGTCCTGCG CCCACGGTGC CGCCTGCTCG
CAGGGTGCCA TCGAGGCCGC GAACTCCGCG ATCGGCAAGT TCTCCAGCGA CCAGGACCTG
GCGGGACTGC AAAAGGCAAT GTCCGCCGCC GCCGCGCTCG GCAAGAACTA G
 
Protein sequence
MSRMRRAAVA AVGALALLSP AACGGADSGA DQEVEVFTWW ADGGEKAGLD GLVAAFDEQC 
DYSFVNGAVA GGAGSNAKQV LASRLQQGDA PDTFQAHAGA ALSEYIAAGQ IEDLSALYDE
WGLTEALPPG LIDNLSVDGK VYSVPANVHR SNVLWTNTSV LADAGITAEP TTLADLLAAL
DTLKAAGISA PLAIGKDWSQ LMLLEAVLIS DLGPEGFTGL WTGATDWNSP EVTQGLENYK
RLLSYTNTDR DTYDWTDAGK LLMDGKAGFF LMGDWAPSDF EAKGFTDFGY ITFPGNGDTF
QWLADSFVLP QGADNPEGTK CWLKTVGSAE GQQAFNLKKG SIPARTDAVE ADYPAYQQSA
IQAWKTGTQV PSCAHGAACS QGAIEAANSA IGKFSSDQDL AGLQKAMSAA AALGKN