Gene Saro_0417 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0417 
SymbolispG 
ID3917563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp455622 
End bp456746 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content66% 
IMG OID640443146 
Product4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase 
Protein accessionYP_495699 
Protein GI87198442 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATCGG TACGTCCCTG GCGCGATATC GCGCGTCGCA AGAGCCGCCA GATCATGGTC 
GGCACGGTCC CCGTCGGCGG CGATGCCCCG ATCACCGTGC AGACCATGAC CAACACCCCG
ACGTCCGACG CCGTCGCCAC GATCGACCAG ATCCGTCGCT GCGAGGAAGC GGGCGCCGAT
CTTATCCGCG TGTCCTGTCC CGACGTGGAA AGCACAGCGG CCTTCCGCCA GATCGCCCGG
GCCGCCCGGG TTCCGCTGAT CGCGGACATC CACTTCCACT ACAAGCGCGC GCTCGAAGCG
GCCGATGCGG GGGCCGCGTG CCTGCGCATC AATCCGGGCA ACATCGGCAG CAGCGACCGC
GTGGCCGAAG TGGTCCGCGC CGCCAAGGCC AACGGCTGCG CGATCCGCAT CGGCGTCAAC
GCTGGCAGCC TCGAGAAAGA CCTGCTCGAA AAGTACGGCG AGCCCTGTCC CGAAGCGCTC
GTCGAATCCG CGCTCGACCA TATCAAGCTG CTGCAGGACC ACGATTTCCA CGAATACAAG
GTGGCGGTTA AGGCCTCCGA CGTGTTCCTC GCCGTCGCCG CCTACATGGG CCTTGCAGAA
GCGGTCGATT GCCCGCTGCA TCTTGGCATT ACCGAGGCAG GCGGGCTGAT CGGCGGGACG
GTGAAATCGT CCGTCGGCAT CGGCAACCTG CTCTGGGCCG GCATCGGCGA CACCTTGCGC
GTCTCGCTTT CGGCCGAACC GGAAGAGGAA GTGCGCGTCG GGTTCGAGAT CCTCAAGACG
CTGGGCCTGC GCACGCGCGG CGTCCGCGTC GTGTCGTGTC CGTCCTGCGC TCGGCAGGGT
TTCGACGTGA TCCGGACCGT GGAGGCGCTG GAAAAGCGGC TGACGCACAT CAAGACGCCG
ATCTCGCTTT CCGTACTGGG CTGCGTCGTC AATGGACCGG GCGAAGCCCG CGAGACCGAT
ATCGGCCTCA CCGGCGGCGG CAACGGCAAG CACATGGTCT ATCTTTCGGG CGTGACCGAC
CACCACGTCC AGTCGGAGGA CATGCTAGAC CACATCGTCT CGCTGGTCGA ACAGAAGGCT
GCGGAAATGG AAGCCGCTGC CGCAGAAGCG GAAGCGGCAG CCTGA
 
Protein sequence
MSSVRPWRDI ARRKSRQIMV GTVPVGGDAP ITVQTMTNTP TSDAVATIDQ IRRCEEAGAD 
LIRVSCPDVE STAAFRQIAR AARVPLIADI HFHYKRALEA ADAGAACLRI NPGNIGSSDR
VAEVVRAAKA NGCAIRIGVN AGSLEKDLLE KYGEPCPEAL VESALDHIKL LQDHDFHEYK
VAVKASDVFL AVAAYMGLAE AVDCPLHLGI TEAGGLIGGT VKSSVGIGNL LWAGIGDTLR
VSLSAEPEEE VRVGFEILKT LGLRTRGVRV VSCPSCARQG FDVIRTVEAL EKRLTHIKTP
ISLSVLGCVV NGPGEARETD IGLTGGGNGK HMVYLSGVTD HHVQSEDMLD HIVSLVEQKA
AEMEAAAAEA EAAA