Gene Caul_4391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4391 
Symbol 
ID5901852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4766264 
End bp4767235 
Gene Length972 bp 
Protein Length323 aa 
Translation table11 
GC content68% 
IMG OID641564909 
Producthydroxymethylbutenyl pyrophosphate reductase 
Protein accessionYP_001686009 
Protein GI167648346 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0761] Penicillin tolerance protein 
TIGRFAM ID[TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.791533 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTCC ATTCGCCTTC CCCTCGTCCT GGCCGCCCCC CGCTGCGCGT CGTCCTGGCC 
AGTCCGCGCG GCTTCTGCGC CGGGGTCGAT CGGGCGATCC AGATCGTCGA GCGGACGATC
GAAAAGTTCG GCGCGCCGGT CTTCGTGCGC CACGAGATCG TCCATAACCG TCATGTGGTC
GACCGGCTGA AGGCCCTGGG CGCGGTGTTC GTCGAGGAGC TGGACGAGGT TCCGGAAGAC
CGCCCGGTGG TGTTCTCGGC CCACGGCGTG CCCAAGACCG TGCCGGCCGC CGCCAAGGCG
CGCGAGATGA TCTATCTGGA CGCTACCTGC CCCCTGGTCT CCAAAGTCCA TGTCGAGGCC
CAGAAGCATT TCGACGCCGG TCGCGAGATC GTGCTGATCG GCCACGCCGG CCACCCCGAG
GTGGTCGGCA CCATGGGCCA ACTTCCGGAG GGCACCGTCA CCCTGATCGA GGACATCGAC
GACGCCCACG CCTGGATTCC CAAGGACCCG GCCAACGTCG CCTTCCTCAC CCAGACCACC
CTGTCGGTCG ACGACACCGC CGAGATGGTC GACCTGCTCA AGCAGCGCTT CCCGGGCATC
GCCGCCCCGC ACAAGGAAGA CATCTGCTAC GCCACCACCA ACCGCCAGGA GGCGGTGAAG
ATGCTGGCCG AAGTCTCGGA CTTGATCCTG GTGGTCGGCT CGAAGAACTC GTCTAACTCG
GTGCGCCTGA TGGAGGTCGG CAAGCGGGCC GGCGCCCGGG ACGCCAGGCT GATCGACGAC
GCCCGGGGCA TCGACTGGAG CTGGTTCCAG GGCGTGGAAC GGGTCGGCGT CACGGCCGGC
GCCTCGGCCC CGGAAGACCT GGTCCAGGGC GTGCTCGACG CCATCGCCAC GCGCTACGAC
GTCACCATCG AGGAACTGAT CGAGGCCCGC GAGACGGTGA TCTTCAAGCT ACCGAGGCTG
CTGACGGCCT AG
 
Protein sequence
MNVHSPSPRP GRPPLRVVLA SPRGFCAGVD RAIQIVERTI EKFGAPVFVR HEIVHNRHVV 
DRLKALGAVF VEELDEVPED RPVVFSAHGV PKTVPAAAKA REMIYLDATC PLVSKVHVEA
QKHFDAGREI VLIGHAGHPE VVGTMGQLPE GTVTLIEDID DAHAWIPKDP ANVAFLTQTT
LSVDDTAEMV DLLKQRFPGI AAPHKEDICY ATTNRQEAVK MLAEVSDLIL VVGSKNSSNS
VRLMEVGKRA GARDARLIDD ARGIDWSWFQ GVERVGVTAG ASAPEDLVQG VLDAIATRYD
VTIEELIEAR ETVIFKLPRL LTA