Gene Nmul_A2377 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A2377 
SymbolispG 
ID3784968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp2703718 
End bp2704968 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content59% 
IMG OID637812466 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_413058 
Protein GI82703492 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTATGG TTCAAAGTGC TTTTCCCCCA CGCCGGAACA GCGTGGGCGT TCAGGTAGGT 
TCGATCCGGA TCGGCGGGGG CGCGCCCATC GTAGTGCAGT CCATGACCAA TACCGATACG
GAAGATGAAA TCGCTACCAC CGTGCAAGTG GCCCAGCTTG CGCGTGCCGG ATCCGAACTC
GTGCGCATCA CCGTCAATAC GGCCGAAGCA GCCAGGGCGG TGCCGGGTAT CAGGGCGCGA
CTCGACGATA TGGGTTGCCA AGTTCCCCTG ATAGGCGATT TTCACTTCAA TGGCCATAAA
CTCGTGACTG AGTATCCCGG TTGCGCCCGT GCGCTGGCGA AATACCGTAT CAATCCCGGT
AATGTCGGGC ACGGAAAGAA ACGTGACGAA CAGTTTTCCA TTCTGATCGA AGCGGCCTGC
AAGTATGAAA AACCGGTGCG CATCGGGGTC AACTGGGGAA GCCTCGATCC AGAGCTGCTG
GCGCGCATGA TGGACGAGAA TGCCCGCTCC GGGGACCCGA GGGATGCCTC CGCGGTAATG
TACGAAGCCT TGATTACCTC TGCGCTTCAA AGCGCTGAGC GTGCGGAGGA GATCGGGCTG
GGGCGTGACA GAATCATATT GTCATGCAAG ATGAGCGGCG TGAGAGACCT CATTACCGTT
TATCGCGCCC TTGCGGCCCG CTGCGATTAT GCGCTGCACC TGGGGCTCAC CGAGGCGGGC
ATGGGTTCGA AGGGGATTGT TGCTTCCACG GCGGCATTGT CGGTACTGCT TCTCGAAGGT
ATCGGCGATA CGATACGGAT ATCGTTGACG CCCGAACCAG GCGGAGACCG CGCGCGCGAA
GTGGTCGTGG CCCAGGAGAT ACTGCAAACC ACCGGTTTGC GCGCTTTTGT GCCTCTGGTT
GCCGCCTGTC CCGGCTGCGG CCGTACCACC AGCACCTATT TTCAGGAGCT GGCGGAAAGC
ATCCAGGGCT ACGTGCGCGA GCAGATGCTG GTATGGCGCG AGGAATACGA AGGTGTGGAA
AATATGACCC TCGCTGTGAT GGGGTGCGTG GTCAATGGTC CCGGCGAAAG CAAGCATGCC
AATATCGGCA TCAGCCTGCC GGGCTCGGGG GAACGGCCTG TGGCGCCGGT ATTTGTGGAT
GGCCAGAAGG CTGTAACGCT GAAGGGCGAT AATATTGCAG GAGAGTTTCG CCAGATAGTC
GATGAATATG TGCAGATGAA ATACCCCAAG AAAGCAGTCG ATGCCCACTA G
 
Protein sequence
MSMVQSAFPP RRNSVGVQVG SIRIGGGAPI VVQSMTNTDT EDEIATTVQV AQLARAGSEL 
VRITVNTAEA ARAVPGIRAR LDDMGCQVPL IGDFHFNGHK LVTEYPGCAR ALAKYRINPG
NVGHGKKRDE QFSILIEAAC KYEKPVRIGV NWGSLDPELL ARMMDENARS GDPRDASAVM
YEALITSALQ SAERAEEIGL GRDRIILSCK MSGVRDLITV YRALAARCDY ALHLGLTEAG
MGSKGIVAST AALSVLLLEG IGDTIRISLT PEPGGDRARE VVVAQEILQT TGLRAFVPLV
AACPGCGRTT STYFQELAES IQGYVREQML VWREEYEGVE NMTLAVMGCV VNGPGESKHA
NIGISLPGSG ERPVAPVFVD GQKAVTLKGD NIAGEFRQIV DEYVQMKYPK KAVDAH