Gene Syncc9605_1298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_1298 
SymbolispG 
ID3737241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp1209698 
End bp1210894 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content60% 
IMG OID637775888 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_381607 
Protein GI78212828 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCCC TGGCTCGGCG CTACGACACC CAGATCCACC GCCGTGTGAC CCGCACTGTG 
ATGGTGGGTG ATGTGCCGGT AGGCAGCGAG CACCCGATCG TGGTGCAGTC GATGATCAAC
GAGGACACCC TCGATATCGA GGCTGCTGTA GCCGGCATCA TCCGCCTTGC CGAAGCCGGC
AGTGAGATCG TTCGGGTGAC GACGCCCTCA ATGGCCCACG CCAAGGCGAT GGGACAGATC
CGTAAGGAGC TTCGTCAGCG CGGCTGCAGC GTTCCCCTGG TGGCGGACGT TCACCACAAC
GGCGTCAAGA TCGCCCTGGA GGTCGCCCAG CACGTCGACA AAGTTCGGAT CAATCCCGGC
CTGTTCATTT TTGATAAGCC AGATCCGAAC CGCCAGGAGT TCAGCCCCGA AGAATTTGCT
GCCATCGGCC AGCGCATTCG TGAGACGTTT GAGCCCTTGG TGACCCTGCT GCGGGACCAG
AACAAAGCGC TTCGAATCGG TGTGAACCAT GGCTCCCTGG CGGAGCGGAT GCTGTTCACC
TACGGCGACA CCCCTGAGGG GATGGTCGAA TCAGCGATGG AATTCGTGCG CATCTGCCAC
GAGCTTGATT TTCACAACAT CCTGATTTCG ATGAAGGCCT CGCGGGCTCC TGTGATGCTC
GCGGCTTACC GCCTGATGGC GGACACCATG GACAAGGAAG GCTTCAATTA CCCCTTGCAC
TTAGGCGTGA CCGAAGCCGG CGATGGTGAT TACGGCCGCA TCAAGAGCAC CGCAGGCATT
GCCACTCTGC TGGCCGATGG ATTGGGAGAC ACCCTCCGGG TTTCCCTGAC GGAGGCCCCC
GAAAAAGAAA TCCCCGTCTG TTACTCGATT CTCCAATCCC TGGGTCTGCG CAAGACCATG
GTCGAGTACG TCGCCTGCCC CAGCTGCGGT CGCACCCTGT TCAATCTGGA GGAGGTGTTG
CACAAGGTTC GCAACGCCAC ATCCCACCTC ACGGGTCTGG ACATCGCCGT GATGGGGTGC
ATCGTCAATG GCCCTGGCGA AATGGCCGAC GCTGATTACG GCTACGTCGG CAAAACCCCT
GGCGTGATTT CGCTGTATCG CGGTCGTGAT GAAATCCGCA AGGTGCCTGA AGCTGAGGGC
GTTGAAGCCC TGATCCAGTT GATCAAAGAG GACGGTCGCT GGGTGGAGCC CGCCTGA
 
Protein sequence
MTALARRYDT QIHRRVTRTV MVGDVPVGSE HPIVVQSMIN EDTLDIEAAV AGIIRLAEAG 
SEIVRVTTPS MAHAKAMGQI RKELRQRGCS VPLVADVHHN GVKIALEVAQ HVDKVRINPG
LFIFDKPDPN RQEFSPEEFA AIGQRIRETF EPLVTLLRDQ NKALRIGVNH GSLAERMLFT
YGDTPEGMVE SAMEFVRICH ELDFHNILIS MKASRAPVML AAYRLMADTM DKEGFNYPLH
LGVTEAGDGD YGRIKSTAGI ATLLADGLGD TLRVSLTEAP EKEIPVCYSI LQSLGLRKTM
VEYVACPSCG RTLFNLEEVL HKVRNATSHL TGLDIAVMGC IVNGPGEMAD ADYGYVGKTP
GVISLYRGRD EIRKVPEAEG VEALIQLIKE DGRWVEPA