Gene GM21_1333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1333 
SymbolispG 
ID8136660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1566033 
End bp1567088 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content63% 
IMG OID644868947 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_003021151 
Protein GI253699962 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.00000208857 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAA CGACCCGGCA GATCATGATA GGGAATATCC CCATCGGCGG AGGAGCTCCC 
TGCTCCGTCC AGTCCATGTG CTCGACCGAC ACCCGTGATG TCGCCGCGAC CCTTGGGCAG
ATCGGCCGGC TTGCCGCCGC CGGATGCGAG ATCGTGCGCT GCGCAGTACC CGATATGGAC
GCCGCCCTGG CCCTTGCCGC CATAAAATCC GGCTCCCCCA TGCCGCTCAT AGCTGATATC
CACTTCGACT ACAAGCTCGC CTTGAAGGCC CTGGAGTCCG GTGTTGGCGG GCTTCGTCTC
AATCCCGGCA ACATAGGCGA AAAGTGGAAG GTCGCCGAGG TGGTGAAAGC CGCGGCCGAG
CGCAACGTTC CCATCCGCAT CGGCGTCAAC GGCGGTTCGC TGGAAAAGGA ACTGCTGGTG
AAGTACGGGC ACCCGACTCC CGAGGCCATG GTCGAGTCGG CGCTGGGGCA CGTGCGGATC
CTGGAGGAGC TGGGCTATCA GCAGATAAAG ATATCGATCA AGGTCTCCGA CGTGTTGCGG
ACCCTGGAAG CGTACCGGCT CCTTTCCGAC GCCGTGGACT ACCCGCTGCA CATAGGCGTA
ACCGAAGCCG GAACCATCTT CGCCGGAACC GTCAAGTCCT CCGTTGGTCT CGGGATCCTT
CTGCACCAGG GGATCGGCGA CACCATGCGC GTCTCCCTCA CCGGCGACCC GGTTGACGAG
GTGCGGGTGG CGTACGACAT CCTCAAATCG CTCGGTTTGC GGACGCGCGG CATCAACTTC
GTCTCCTGCC CCACCTGCGG GCGCTGCCAG GTAAACCTGA TACCCGTAGC CGAGGAAGTA
GAGCGGCGCC TGGCGCATCT GGATACGACG ATCACCGTTG CCGTCATGGG ATGTTCCGTC
AACGGTCCAG GCGAGGCCCG CGAGGCGGAC TTCGGCATAG CCGGGGGTAG GGGAGAAGGG
CTTCTCTTCA GGCACGGCGA GATCCTGCGC AAGGTCCCCG AAGCCGAGCT AGCCGACGCC
CTGGTGGAAG AAGTTTTGAA GAACAGTCAG ACCTGA
 
Protein sequence
MKKTTRQIMI GNIPIGGGAP CSVQSMCSTD TRDVAATLGQ IGRLAAAGCE IVRCAVPDMD 
AALALAAIKS GSPMPLIADI HFDYKLALKA LESGVGGLRL NPGNIGEKWK VAEVVKAAAE
RNVPIRIGVN GGSLEKELLV KYGHPTPEAM VESALGHVRI LEELGYQQIK ISIKVSDVLR
TLEAYRLLSD AVDYPLHIGV TEAGTIFAGT VKSSVGLGIL LHQGIGDTMR VSLTGDPVDE
VRVAYDILKS LGLRTRGINF VSCPTCGRCQ VNLIPVAEEV ERRLAHLDTT ITVAVMGCSV
NGPGEAREAD FGIAGGRGEG LLFRHGEILR KVPEAELADA LVEEVLKNSQ T