Gene Caul_0957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0957 
SymbolispG 
ID5898412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1005773 
End bp1006912 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content68% 
IMG OID641561439 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_001682585 
Protein GI167644922 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.838599 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.683477 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACCG ACCACACCCA CCTCCGTCCG TGGCGCTCGA TCGAGCGCCG CAAGTCGCGC 
AAGATCCGCG TCGGCAATGT CGAGGTGGGC GGCGACGCGC CGATCACCGT CCAGTCGATG
ACCAACACCC TGACCAGCGA CGCCGCCGCG ACGCTGGAGC AGATCCGCCA ACTGGAAGAG
GCCGGCGCCG ACATTGTCCG CGTTTCGTGC CCCGACACCG ACTCGACGGC GGCCTTCAAG
ACCATCGCCC GCGAGAGCCG GGTGCCGCTC GTGGCCGACA TCCACTTCCA CTACAAGCGC
GGCATCGAGG CGGCGCAAAA CGGCGCGGCC TGCCTGCGGA TCAATCCGGG CAATATCGGC
AGCCCCGACC GCGTGCGCGA CGTCATCCAG GCGGCCCGCG ACCACGGCTG CTCGATGCGG
ATCGGCGTCA ACGCCGGCTC GCTGGAGCGC GAACTGCTGG AAAAGTACGG CGAGCCTTGC
CCCGACGCGA TGGTCGAGAG CGCCCTCAAC CACGCCCGCA TCCTGCAGGA CCACGACTTC
CACGAGTTCA AGATCAGCGT GAAGGCGTCC GACCCGTTCA TGACGGTGGC GGCCTATCAC
CAGCTGTCCG AGCGCATCGA CTGCCCGCTG CACCTGGGGG TCACCGAGGC CGGCGCCCTG
CGGACCGGCA CGGTGAAGTC GTCGATCGGC ATCGGCTCGA TGCTGTGGGC CGGCATCGGC
GACACCATCC GGGTGTCCCT GGCCGCCGAC CCGGTCGAGG AGATCAAGGT CGGCTTCGAT
ATCCTCAAGT CGCTGGGCCT GCGCCATCGC GGCGTCAACA TCATCGCCTG CCCGTCCTGC
GCCCGTCAGG GCTTCAACGT CATCAAGACG GTGGAGGCCT TGGAGCAGCG GCTGGCCCAC
ATCTCGCAAC CGATGTCGCT GTCGATCATC GGCTGCGTGG TCAACGGTCC CGGCGAGGCG
CTGATGACCG ACCTGGGTTT CACCGGCGGC GGGGCCGGGT CGGGCATGGT CTACATGGCC
GGCAAGCCCG ACCACAAGCA GTCCAACGAC GGCATGATCG ACCACATCGT CGAGCTGGTG
GAACAGCGCG CGGCCCTGCT GAAGGCCGCG GCCGACGCCG AGGCGATCGC GGCGGAGTAG
 
Protein sequence
MATDHTHLRP WRSIERRKSR KIRVGNVEVG GDAPITVQSM TNTLTSDAAA TLEQIRQLEE 
AGADIVRVSC PDTDSTAAFK TIARESRVPL VADIHFHYKR GIEAAQNGAA CLRINPGNIG
SPDRVRDVIQ AARDHGCSMR IGVNAGSLER ELLEKYGEPC PDAMVESALN HARILQDHDF
HEFKISVKAS DPFMTVAAYH QLSERIDCPL HLGVTEAGAL RTGTVKSSIG IGSMLWAGIG
DTIRVSLAAD PVEEIKVGFD ILKSLGLRHR GVNIIACPSC ARQGFNVIKT VEALEQRLAH
ISQPMSLSII GCVVNGPGEA LMTDLGFTGG GAGSGMVYMA GKPDHKQSND GMIDHIVELV
EQRAALLKAA ADAEAIAAE