Gene Strop_3384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3384 
Symbol 
ID5059851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp3882858 
End bp3884585 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content73% 
IMG OID640475633 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001160195 
Protein GI145595898 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGATAC CTTCACGACG CGGCGGTGTC GCGGTCATCG CGCTGGCCGC ACTCACCGCC 
CTGCTTCTTG CCGGCTGCAC CGGCGGTCCG GGGCGCGCAC AGCCGACACC GGCGGCGAGG
GGCTCGGCGA CCGGCTCGTC GCCCGTGGGC CCGGTTCAGG ATCCGGTGGC CAGCGCCGCG
GCCCTGGTCA ACTCGCTGGC GGATGAGGAC CTCGTCGGAC AGGTGTTGAT GCCCTACGCC
TACGGTGCCG CCGCCGATCA GGTCTCGACC GGTTCGGCCG CCGGCAACCA GGAGCTGGCC
GGCGTCGACA CCCCCGCCCA GATGATCGCG AAGTACCGCC TCGGCGGGCT CATCCTCGTC
GGCTTCAGCG CGGACGACCC GACCAGCGGC AACCAGGAGA CCACCAACGT CGACAACCCG
GAACAGGTCC GGGCGCTGAC CACCGGGCTG CGGTCCGCCG CTGCCGACCT GGCCACCGGT
GCGGCGCCGT TCCTGATCGG CACCGACCAG GAGTACGGAG TGGTCACCCG GATCACCGAC
GGGGTCACCA TGTTGCCCAG CGCGTTGGCC ACGGGGGCGG CCGGCAAGCC TGAGCTGACC
GAGTCCGCCT GGCGGGCTGC CGGCACCGAA CTGGCCGCGA TGGGCGTCAA CGTGGACTTC
GCTCCCGTCG CCGACGTGCT TGTCACCCCC AGCACCGTGA TCGGGTCCCG GTCGTACGGC
GCCGACCCGT CGATGGTGGC CGAGCAGGTC AGTGGGGTGG TGCGCGGTCT GCAGGCGACC
GGCGTCGCGG CGACCCTGAA GCATTTTCCC GGCCACGGGC ACAGCGCCAC CGACTCCCAC
GAGGCGCTGC CGGTGCTGCC GCAGCCGCGC ACCGTGCTCG AGCTGGAGGC CTGGCGTCCC
TTCGCGGCCG GCATCGGGGC CGGCGCCCTC GCCGTGATGT CCGGGCACCT CGACGTCCGG
GCAGTCGACC CGGGCACCCC GGCAACGTTC TCGCACACCC TGCTTACCGA GGTGCTCCGC
GGCGAGCTCG GCTTTCAGGG AGTGGTGATC ACCGACGGGA TGAACATGGC CCCCGCCAAA
CGATGGTCGC CCGGTGAGGC CGCGGTCCGT GCCCTGAAGG CCGGCAACGA CCTGATCCTC
ATGCCGCCGC ACGTCGGCCA GGCGTATGAC GGGCTGCTCG CCGCGCTGCG CGACGGCTCG
CTGCCCCGGA CCCGGCTGGT GGAGGCGGTG ACCCGGGTGC TGACCATGAA GTTCACTCTG
GCCGGTGCGG CCGTCCCGGA ACTGGATGTC GTCGGCGCGC CGGCCCACCA GGCGGCGGCC
GTCGAGCTTG CCACCGCCGC GGTGACCGCG CTGCGTGGCC CGTGTGGCAG CCCGGTCTCC
GGGCCGGTGA CCGTGACCGC CTCCACTGGC CGGGAACACA CCCGGGCGAC GCTGATCAAG
AAGCTGACCG CGGCCGGGGT GCCGGTTGCC GACAGCGGCG GTGCCGTGGT CCACCTGGTC
GGCTACGGCG ACAGCGCCGA CGACCTGAGC GCCGACGCCG CCGTGACCGT TGCCATGGAT
ACCCCGTACC TGTTGGCCGA GGCGGACTCC CCGGCGTTGC TGGCAACCTA CTCGTCGAGC
CCGGCGTCGA TGACCGGACT GGCCCAGGTG CTGGCTGGCG AGGCCACTCC GGCCGGCAGC
GCACCGGTGC CGGTCTCCGG CCTGCCCGCC ACGACCTGCG GCAGCTGA
 
Protein sequence
MSIPSRRGGV AVIALAALTA LLLAGCTGGP GRAQPTPAAR GSATGSSPVG PVQDPVASAA 
ALVNSLADED LVGQVLMPYA YGAAADQVST GSAAGNQELA GVDTPAQMIA KYRLGGLILV
GFSADDPTSG NQETTNVDNP EQVRALTTGL RSAAADLATG AAPFLIGTDQ EYGVVTRITD
GVTMLPSALA TGAAGKPELT ESAWRAAGTE LAAMGVNVDF APVADVLVTP STVIGSRSYG
ADPSMVAEQV SGVVRGLQAT GVAATLKHFP GHGHSATDSH EALPVLPQPR TVLELEAWRP
FAAGIGAGAL AVMSGHLDVR AVDPGTPATF SHTLLTEVLR GELGFQGVVI TDGMNMAPAK
RWSPGEAAVR ALKAGNDLIL MPPHVGQAYD GLLAALRDGS LPRTRLVEAV TRVLTMKFTL
AGAAVPELDV VGAPAHQAAA VELATAAVTA LRGPCGSPVS GPVTVTASTG REHTRATLIK
KLTAAGVPVA DSGGAVVHLV GYGDSADDLS ADAAVTVAMD TPYLLAEADS PALLATYSSS
PASMTGLAQV LAGEATPAGS APVPVSGLPA TTCGS