Gene Caul_4587 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4587 
Symbol 
ID5902049 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4961453 
End bp4962703 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content71% 
IMG OID641565106 
ProductUbiH/UbiF/VisC/COQ6 family ubiquinone biosynthesis hydroxylase 
Protein accessionYP_001686205 
Protein GI167648542 
COG category[C] Energy production and conversion
[H] Coenzyme transport and metabolism 
COG ID[COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases 
TIGRFAM ID[TIGR01988] Ubiquinone biosynthesis hydroxylase, UbiH/UbiF/VisC/COQ6 family 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGACC ACGACGCCGA CGTCATCATC GCCGGAGCCG GCATGGCCGG AACTACCCTG 
GCGCTCGCCC TGGCCTCAGC CGGGGTCACG GCCGTGCTGG TCGATCCCCA GCCGTTCGAC
GCCCAGCTGG CCGAGAGCTT CGACGGGCGC TCGTCGGCGA TCGCCTATTC TTCGTTTCGC
CAGTGGCGGG CGATCGGGGC CGGCGAGGCC CTGGAGCCCC ACGCCCAGCG GATCGAGCAG
ATCCTGGTCA CCGACGGCAA GACGCCGGGT CCCGCCTCGG GCGGACCCTC GCCGTTCTTC
CTGCGCTTTG ATTCCAGCGA GATCGCCGAC CGCTCCGAGG GCGAACCCCT GGGCTACCTG
ATCGAGAACC GCCAGATCCG CACGGCCTTG GCCAAGACCG TGATCGACAA GGGGATCACG
GTGCTGGCCC CGGTCGCCGC CAAGGCGCTG GAGGTCACGG CGGCCGCCGC GACCCTGACC
CTGACCGACG GCCGCACGCT GTCGGCCCCT CTGGTGGTCA GCGCCGAGGG CCGGGGCAGC
GTGCTGCGCA AGGCCGCCGG CATCGGCGAC ATCGGCTGGA GCTACGGCCA GAGCGGAGTG
GTGGCCACCG TCCGCATGCA GCATCCGCAC CAGGGCGTGG CCCACGAATA TTTCCTGCCC
AGCGGCCCGT TCGCCATCCT TCCGCTCACG GACAATCGCG CCAGCCTGGT GTGGACCGAG
AGCACGCTGC GCGCCGAGGC CCTGCGCAAC GCGTCGCCGG AAGCGTTCCA CAGCCACCTG
ATGCGGCGGT TCGGCGAGTT CCTGGGCAAG GTCGAGATGG CCGGCCCGAC CTTCGTCTAT
CCGCTGTCGC TGTCGCTGGC CGAGCGACTG GTCGCCCCAC GCCTGGCCCT GATCGGTGAC
GCCGCCCACG GGGTGCACCC GATCGCCGGC CAGGGGCTGA ACCTGGGCCT CAAGGACGCC
GCCGCCCTGG CCGAGGTGGT GGCCGAGGCC CTGCGCAACG GCGAGGACAT CGGGGCCGAG
TCGACCCTGG AGCGCTACGC CCGCTGGCGG CGGTTCGACA ATGTCACCAA CGCCCTGGCC
TTCGACGGCT TCGTGCGCCT GTTCTCCAAC GACAACCCGC TGCTGCGCCT GGCGCGCGGC
GTGGGCCTGG CGGCGGTCAA CCGCATCGCC CCCGCGCGGC GGTTCTTCAT GCATGAAGCC
GGCGGCGGGG TGGGAGATCT GCCAAAGCTG CTGCGGGGCG TGGGGATTTA G
 
Protein sequence
MRDHDADVII AGAGMAGTTL ALALASAGVT AVLVDPQPFD AQLAESFDGR SSAIAYSSFR 
QWRAIGAGEA LEPHAQRIEQ ILVTDGKTPG PASGGPSPFF LRFDSSEIAD RSEGEPLGYL
IENRQIRTAL AKTVIDKGIT VLAPVAAKAL EVTAAAATLT LTDGRTLSAP LVVSAEGRGS
VLRKAAGIGD IGWSYGQSGV VATVRMQHPH QGVAHEYFLP SGPFAILPLT DNRASLVWTE
STLRAEALRN ASPEAFHSHL MRRFGEFLGK VEMAGPTFVY PLSLSLAERL VAPRLALIGD
AAHGVHPIAG QGLNLGLKDA AALAEVVAEA LRNGEDIGAE STLERYARWR RFDNVTNALA
FDGFVRLFSN DNPLLRLARG VGLAAVNRIA PARRFFMHEA GGGVGDLPKL LRGVGI