Gene Apre_0597 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0597 
Symbol 
ID8397377 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp694434 
End bp695504 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content40% 
IMG OID644994954 
Product1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 
Protein accessionYP_003152360 
Protein GI257066104 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAGA AAACTAAGAA GATTTATGTA GGAGATGTTG CAGTAGGAGG CGACTCTCCT 
ATTTCTGTTC AATCAATGAC AACAGCAAAG ACAAGCGATA TCGAAAAGGT AGTAGGGCAG
ATCAATGCCC TAGAAGAAGC GGGTTGCGAT ATAGCAAGAT CTGCTATTAA CTCCATAGAA
GATGCCAAGG CAATTGTTGA GATTAAGAAA AGAACTAATA TACCACTTGT TGCAGACATT
CAGTTCGACT ATAAGCTTGC CCTTGCAGCG GTAGAATATG GTTGTGATTG TCTAAGATAC
AATCCTGGTA ATATAGGAGG TAGTGATAAG GTTAAGCTTC TTGTAGATAA GTGCAAGGAG
AAAAACATCC CTATAAGGAT TGGGGTAAAC TCAGGCTCTA TATCGAGAGA AATCGTAGAT
AGATTCGGTG GAGTAAACGC AGACTCCCTA GTAGCAAGTG CCCTAGAGGA AGTAAAGATC
CTAGAGGAGA TGGACTTTAC CGATATCAAA ATTTCTGTTA AGTCAAGCGA TGTAAATACA
ATGATCGATG CCTACAGGAA ATTATCAGAT AAGGTTGACT ACCCACTCCA CCTTGGTGTA
ACAGAAGCAG GTCCCTTGTA CCAAGCCCTT GTCAAATCCT CTATCGGCAT AGGTTCTTTA
CTAAAAGATG GGATAGGAGA TACTATCAGA GTTTCAATCA CAGGAGATAT TCTAGAAGAA
GTTAAGGCAG GAAAGGCAAT CCTTAAGGCC CTTAACCTAA GACGAGATGG CCTAGATATA
GTATCTTGTC CAACATGCTC AAGAACTACA GTAAATCTTC ATGAAATTGT AAAGGAAGTA
GAAGAGAAGA GTGGGGGTCT AGATATCTCT GCTAAGGTTG CCATCATGGG TTGTCCAGTA
AATGGACCAG GAGAGAGTAA GGAAGCAGAA TATGGAATTT CTGCAGCAAA TGGCATGGGC
TTTCTCTTTA AGAATGGCAA GACTATTAAG AAAGTTAGAG AAGATGAGAT AGTAGATACT
TTGATAGAGA CTCTCAAAGA AAGCAAAGAA GATGAGAATA GACCTTCATA A
 
Protein sequence
MRKKTKKIYV GDVAVGGDSP ISVQSMTTAK TSDIEKVVGQ INALEEAGCD IARSAINSIE 
DAKAIVEIKK RTNIPLVADI QFDYKLALAA VEYGCDCLRY NPGNIGGSDK VKLLVDKCKE
KNIPIRIGVN SGSISREIVD RFGGVNADSL VASALEEVKI LEEMDFTDIK ISVKSSDVNT
MIDAYRKLSD KVDYPLHLGV TEAGPLYQAL VKSSIGIGSL LKDGIGDTIR VSITGDILEE
VKAGKAILKA LNLRRDGLDI VSCPTCSRTT VNLHEIVKEV EEKSGGLDIS AKVAIMGCPV
NGPGESKEAE YGISAANGMG FLFKNGKTIK KVREDEIVDT LIETLKESKE DENRPS