Gene NSE_0799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNSE_0799 
SymbolispG 
ID3931584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNeorickettsia sennetsu str. Miyayama 
KingdomBacteria 
Replicon accessionNC_007798 
Strand
Start bp710334 
End bp711578 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content42% 
IMG OID637900955 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_506674 
Protein GI88608788 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGAGC TCGATAGGAA GTCGCTTGTT GTTGATGTAG CGGGCGTTAG GATAGGGGGG 
GGAAACCCGA TAGTTGTGCA GTCCATGACC TCTGGCAGCA GAACAAATCC AAATGATGTA
TCAGCAATAG CTGTAGATGA ATCCCGCGAG GCTATAGCTT TGGCTAAGGC CGGCTCTGAG
CTAATAAGGA TTGCCATAAA TAGTGAGAAG GCTGCAGCGG CGATTCCTTA TATAAGGGAA
AATCTAGATA AATCCGGCTT TGAAAAAGTT CCGCTTGTTG GATGTGGTCA GTATGAGATT
AAGCGCGTAC TTGAAACACA GACAGAAGCT ATCAGATCCT TGGGTAAGAT AAGAATTAAT
CCAGGTAATA TAGGATTTGG TTCAAAAAGA GATACAAATT TTGACAAAGT AATTGAGTTG
ATTTGCAAAC ATGATCTCCC CATTAGAATT GGCGTTAACT GGGGAAGTCT AGATCAATCG
GTGCTGCGTC AAATAATGGA TGATAATGCA AAAAATGAGA AACCCAGCTC CTATAATGGA
GTACTGAGAT GTGCATTGAT TCAGTCTGCT TTGGCTAGTG CGAAGCGTGC TGAGGAAGTT
GGACTATCAC CAGATAAAAT TATACTTTCG TGTAAGGTAA GCAGTTTTCA GGATTTAGTA
GCGGTTTACT CCTCCTTGGC CGAACAATGT CGTTATCCAC TGCACTTGGG TTTGACAGAA
GCAGGTATGG GTACTTCAGG AATAATAAAA ACTACTGCAG CACTCTCAGT ACTCCTTAGT
AGGGGTATTG GTGATACAAT ACGTGCTTCT TTAACACAGA AACCTGGTGA ATCAAGGGTA
ATAGAAGTAG AAACCTGTCA GTTGATATTG CAGTCAATGG GTTTAAGAGT CTTTGTTCCT
CAGGTCACTT CATGTCCTGG TTGTGGAAGG ACGAATGGAA ATTATTTCCA GCAGATTAGC
AGTGACCTGA ATGACTTCAT AAAGGATAAT CTAGTCGATT GGAAGAGGCT TTATCTAGGT
GTTGAGAATT TCAAACTTGC TGTAATGGGT TGTATTGTTA ATGGTCCGGG TGAGAGTAAA
CATGCAGACG TGGGTATTAG CTTGCCTGGT TATAATGAAA ATATGGTAGC GGCGGTTTTT
ATTGACGGAA AACCTTCAGC AAAACTTATT GGCGAAAACA TTCTCGAGGA ATCAAAGCGG
ATCATTTTAG AGTACATCAA GAACAAATAT GCTCCACGAA GTTAA
 
Protein sequence
MSELDRKSLV VDVAGVRIGG GNPIVVQSMT SGSRTNPNDV SAIAVDESRE AIALAKAGSE 
LIRIAINSEK AAAAIPYIRE NLDKSGFEKV PLVGCGQYEI KRVLETQTEA IRSLGKIRIN
PGNIGFGSKR DTNFDKVIEL ICKHDLPIRI GVNWGSLDQS VLRQIMDDNA KNEKPSSYNG
VLRCALIQSA LASAKRAEEV GLSPDKIILS CKVSSFQDLV AVYSSLAEQC RYPLHLGLTE
AGMGTSGIIK TTAALSVLLS RGIGDTIRAS LTQKPGESRV IEVETCQLIL QSMGLRVFVP
QVTSCPGCGR TNGNYFQQIS SDLNDFIKDN LVDWKRLYLG VENFKLAVMG CIVNGPGESK
HADVGISLPG YNENMVAAVF IDGKPSAKLI GENILEESKR IILEYIKNKY APRS