Gene Dshi_3100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDshi_3100 
SymbolubiA 
ID5710952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDinoroseobacter shibae DFL 12 
KingdomBacteria 
Replicon accessionNC_009952 
Strand
Start bp3267233 
End bp3268201 
Gene Length969 bp 
Protein Length322 aa 
Translation table11 
GC content69% 
IMG OID641269027 
Product4-hydroxybenzoate polyprenyltransferase 
Protein accessionYP_001534434 
Protein GI159045640 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0382] 4-hydroxybenzoate polyprenyltransferase and related prenyltransferases 
TIGRFAM ID[TIGR01474] 4-hydroxybenzoate polyprenyl transferase, proteobacterial 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG AGGCGCAGAC ACCAGACGGC GGAACCGTCG CCGACGCGCC CGCGGATAAC 
TGGGTCGACC GCTACGCGCC GCCCGCAACC CGCCCCTACC TGCGGCTCAG CCGCGCGGAC
CGGCCGATCG GCACCTGGCT TTTGCTGATC CCCTGCTGGT GGGGCGCTCT TCTGGGGGCG
GCGGCGGGTG ACGGGTTCGA CCGCGGCACT GCCTGGATCA TGGTCGGCTG CGGGATTGGC
GCCTTCCTGA TGCGCGGCGC GGGCTGCACC TGGAACGATA TCACCGACCG CGACTTCGAC
GCAAAGGTCG CACGCACGCG CTCCCGCCCG ATCCCGTCGG GTCAGGTCAG CGTGCGCGGC
GCGCTGGTCT GGATGGTGGT GCAGGCGCTG CTGGCCTTCG GCATCCTGCT GAGCTTCAAC
CTGCCCGCCA TCGGGCTGGG CATCGCCTCG CTGGCGCTGG TCTGCGTCTA TCCCTTCGCC
AAACGGTTCA CCTGGTGGCC GCAGATCTTC CTGGGGCTTG CGTTCAACTG GGGCGCGCTG
CTGGCCTGGA CGGCCGAGAC CGGTACTCTG CTCGATGCGC CCAGCGTGCT GCTCTATGTC
GCCGGGATCG CCTGGACCCT GTTCTACGAC ACGATCTACG CCCATCAGGA CACCGAGGAC
GATGCCCTGA TCGGCGTAAA ATCGACCGCG CGGCTTTTCG GCGATCGATC GCGGCGGTGG
TTGGGGCTGT TCCTGATCGC GGCAACCGTG CTGGCGGGCG GGGCCGTCAT TGCCGCGCTG
GCCCCACTGG ATGCGCCCCT GGCGCTCGCG CTCGGGCTCG GCGGGGTCTG GGCCTTCGGA
TGGCACATGG TGTGGCAGCT GCGGCAGCTC GACCCGAACG ACCCCGGCAA CTGCCTGCGG
CTGTTTCGGT CCAATCGGGA CGCCGGTCTG ATCCTTGCGC TGTTTCTCGC CCTGACCTTG
CTCGCTTGA
 
Protein sequence
MSNEAQTPDG GTVADAPADN WVDRYAPPAT RPYLRLSRAD RPIGTWLLLI PCWWGALLGA 
AAGDGFDRGT AWIMVGCGIG AFLMRGAGCT WNDITDRDFD AKVARTRSRP IPSGQVSVRG
ALVWMVVQAL LAFGILLSFN LPAIGLGIAS LALVCVYPFA KRFTWWPQIF LGLAFNWGAL
LAWTAETGTL LDAPSVLLYV AGIAWTLFYD TIYAHQDTED DALIGVKSTA RLFGDRSRRW
LGLFLIAATV LAGGAVIAAL APLDAPLALA LGLGGVWAFG WHMVWQLRQL DPNDPGNCLR
LFRSNRDAGL ILALFLALTL LA