Gene Strop_4051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_4051 
Symbol 
ID5060533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4606573 
End bp4607643 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content72% 
IMG OID640476312 
Productpolyprenyl synthetase 
Protein accessionYP_001160859 
Protein GI145596562 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0142] Geranylgeranyl pyrophosphate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATCCGG CTGACAAGCT CTCGGGTGCC ACTGGCGTGG GCAGTCGCTC CGACCGGGGC 
GGCGCAGGTC AGTTGGGTGC CGTCGGGCTG CACCCAATCG ACGCGGGGCT CCCGGATTCA
GCGTTTCGGG TGCTGGAGGG GGTCGAGGCC GCGCTGCGGG CTGATGTCGC CAGCGCCGAC
CCGTTCGTCA CCGAGGCCGC CCGGCACCTC CTTGACGCCG GTGGCAAGCG GTTCCGCCCG
CTGCTGGTGG CGCTCGGCGC CCAGTTCGGG GATCCGACTC GGGAGCAGGT CGTGCCGGCC
GCCGTGGTGG TGGAGCTCAC CCACCTGGCC ACGCTTTACC ACGACGACGT CATGGACGAG
GCGCCGGTGC GCCGGGGGGC CCCGAGCGCC AACTCGCGGT GGACGAACTC GGTGGCCATC
CTGGTCGGTG ACTATCTCTT CGCCCGCGCC GCGGACATCT CCGCGGATCT GGGCACCGAG
GCGGTCCGAC TGCAGGCGCG GACCTTCGCG CGCTTGGTGC ACGGCCAGAT CGCCGAAACC
GTGGGGCCGC GTCCCGGTGT GGATCCGGTG GCGCACCACC TGCACGTGAT CGCTGAGAAG
ACCGGCTCGC TGATCGCTAC CGCGGCCCGG TTCGGTGGGA TGTTCAGCGG GGCCAGCCCG
ACGCACACCC AGGCACTGGC TGGTTACGGT GAGGCGATCG GGGTCGCCTT CCAGCTCTCC
GACGACCTGT TGGACATCTC CAGTGAGGCG GAGCGCTCCG GCAAGACGCC GGGGACCGAT
CTCCGTGAGG GTGTCCCCAC CCTGCCGGTG TTGTATGCAC TCGCCTCGGA CGACGCGGAC
GCCGCGTCGG TGCGGCTTCG GGAGGTCCTG GCGGTCGGTC CGCTGACCGA TGACGAACTG
CACGCCGAGG CGCTCGGACT GCTCCGGGAG AGCCCGGCGT TGAAGCGGGC GCGGGAGACG
GTCCGTAGCC GTGCCGAGGA AGCGCGCGCG CAGCTTGCGC CGCTGCCGCC GGGCCCGGCC
CGGCACGCGC TCGAATCCCT CTGCGACCAG ATCGCGGACC GGACCGGCTG A
 
Protein sequence
MNPADKLSGA TGVGSRSDRG GAGQLGAVGL HPIDAGLPDS AFRVLEGVEA ALRADVASAD 
PFVTEAARHL LDAGGKRFRP LLVALGAQFG DPTREQVVPA AVVVELTHLA TLYHDDVMDE
APVRRGAPSA NSRWTNSVAI LVGDYLFARA ADISADLGTE AVRLQARTFA RLVHGQIAET
VGPRPGVDPV AHHLHVIAEK TGSLIATAAR FGGMFSGASP THTQALAGYG EAIGVAFQLS
DDLLDISSEA ERSGKTPGTD LREGVPTLPV LYALASDDAD AASVRLREVL AVGPLTDDEL
HAEALGLLRE SPALKRARET VRSRAEEARA QLAPLPPGPA RHALESLCDQ IADRTG