Gene Strop_1560 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_1560 
Symbol 
ID5058013 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp1772680 
End bp1774116 
Gene Length1437 bp 
Protein Length478 aa 
Translation table11 
GC content71% 
IMG OID640473828 
Productglycoside hydrolase family protein 
Protein accessionYP_001158404 
Protein GI145594107 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.677245 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTAATC CCGCGAGCCC ACCCGCCGTG GGCGTTCTCG CCGAACGCCC GCCACTGGCC 
TTCCCGCCCG GCTTCCTCTG GGGCGCCGCC ACCGCGGCCT ACCAGATCGA GGGCGCGGCG
ACCGCCGGCG GTCGGACGCC GTCGATCTGG GACACCTTCA GCCACACCCC GGGCCGGGTG
GTGGCCGGGC ACACCGGTGA CGTGGCGTGC GACCACTACC ACCGCCTCGA CTCAGACGTC
GCCCTCATGG CCGAGTTGGG GCTGAGGTCG TACCGGTTCT CGGTCTCCTG GCCTCGGGTG
CAGCCCGGCG GAACCGGCCC GATCAACCAG GAGGGACTCG ACTTCTACCG GCGGCTGGTG
GACCAGCTGC TGGCGAACGG CATCGAGCCG TGGCTGACTC TCTACCACTG GGACCTGCCG
CAGCCGCTGG AGGACGCGGG CGGCTGGCCG GCCCGGGACA CCGCCGCCCG GTTCGCCGAC
TACGCCGCCC TGGTCGCCGG CGCGCTCGGC GACCGGGTGC GGTACTGGAC CACCCTCAAC
GAGCCGTGGT GCTCGGCGTT TCTCGGGTAC GGTTCCGGGG CGCACGCCCC GGGTCGGTCC
GACCCCGCCG CCGCGGTCCG GGCCGGTCAC CACCTGCTGC TCGGTCACGG GCTGGCCGTG
CCGGCGCTGC GGGCGGCCGC CCAGTCCGAG GTTGAGATCG GGGTGACCCT CAACCTGTAC
CCGGTGACCC CGGCCACCGA TTCGCCCGGC GACGCCGACG CGGCACGCCG AATCGACGGG
CTGGCGAACC GGTTCTTCCT CGACCCACTG CTGCGCGGGT CATACCCGGC GGATCTGATG
TCCGATCTCA GGCAGGTCAG CGACTTCGGG CACGTGCGTG CGGGGGACCT GGCCACCATC
GCCGCCCCGC TGGATCTGGT CGGGATCAAC TACTACAGCC GGCACGTGGT CGCCGCGCCG
ACGGCGGCGG CCCCGCCGGA GCCGTACTGG CGCACACCCT CGTGCTGGCC GGGTAGCGAA
GACGTTCGAT TTGTCGCCCG GGGCATGCCG GTCACCGACA TGGACTGGGA GATCGATCCT
TCCGGCCTGG TCGAGACGTT GCAGCGGGTG TACGAGGAGT ACACCGACCT GCCGCTCTAC
GTCACCGAGA ACGGTTCGGC CTTCGTAGAC GCGGTCGTTG ACGGGAAGGT GGACGACCCG
GACCGCGTCG CCTACTTCGA GGCGCACCTA CGCGCCGCAC ACCAGGCGAT CGCCGCGGGT
GTTCCGCTGC GCGGCTACTT CGCCTGGTCG TTGATGGATA ACTTCGAGTG GGCCTGGGGC
TACACGAAGC GGTTCGGCAT GATCCACGTC GACTACCGCA GTCAGGCCCG CACCCTGAAA
TCCAGCGGTC GGTGGTACGC CGAGACCATC CGACGTAACG GTCTGGCCGC ACAATAG
 
Protein sequence
MSNPASPPAV GVLAERPPLA FPPGFLWGAA TAAYQIEGAA TAGGRTPSIW DTFSHTPGRV 
VAGHTGDVAC DHYHRLDSDV ALMAELGLRS YRFSVSWPRV QPGGTGPINQ EGLDFYRRLV
DQLLANGIEP WLTLYHWDLP QPLEDAGGWP ARDTAARFAD YAALVAGALG DRVRYWTTLN
EPWCSAFLGY GSGAHAPGRS DPAAAVRAGH HLLLGHGLAV PALRAAAQSE VEIGVTLNLY
PVTPATDSPG DADAARRIDG LANRFFLDPL LRGSYPADLM SDLRQVSDFG HVRAGDLATI
AAPLDLVGIN YYSRHVVAAP TAAAPPEPYW RTPSCWPGSE DVRFVARGMP VTDMDWEIDP
SGLVETLQRV YEEYTDLPLY VTENGSAFVD AVVDGKVDDP DRVAYFEAHL RAAHQAIAAG
VPLRGYFAWS LMDNFEWAWG YTKRFGMIHV DYRSQARTLK SSGRWYAETI RRNGLAAQ