Gene Athe_1808 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1808 
Symbol 
ID7408595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1882950 
End bp1883996 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content38% 
IMG OID643716185 
Product1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 
Protein accessionYP_002573674 
Protein GI222529792 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.59435 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAATTAT TGACAAAAAA GGTTAGAATA GGCAATCTTT ATATTGGTGG CGGTGAGCCA 
ATTAGAATTC AGTCAATGAC AAATACAAAG ACAAAGGATG TTGAGGCTAC AGTTGAGCAG
ATACTGAGCC TTGAGAGTTT GGGATGCGAC ATCATAAGAG TTGCTGTCCC TGATTTAGAT
AGTGCTAAGG CCATAAGTAA AATAAAGTCA AGAATCCACA TTCCACTTGT TGCTGACATT
CATTTTGACT ATAAGCTCGC GCTTGAAGCT ATATACAATG GCGCTGATAA GATTAGAATA
AATCCTGGGA ACATTGGAAA TGAAAGAAAA GTCCAAGAAA TAGCTAAAGA GGCCAAAAGA
TATGGGATTG CCGTCAGAGT TGGTGCAAAT TCAGGTTCGC TCCCAAAGGA TATTTTGCAA
AAATACAAAT CTCCTGTACC AGAGGCTATT GTGGAGGCTG CAATTTATCA GGTAAAACTT
CTTGAAAAGT TTGACTTTGA CAATATTGTT GTGTCTGTCA AATCTTCAGA TGTTTTAACT
ACAATTAAGA GCTATGAAAT ACTATCCCAA AACCTAAACT ATCCTCTTCA TGTTGGTCTT
ACCGAAGCAG GAACTTTTGT TGCAGGAACT GTTAAGTCAA GTATTGCAAT TGGCTATCTT
CTTTTGAGGG GAATTGGTGA TACAATAAGA GTTTCTCTTA CAGATAGTCC AGAGAAAGAG
GTTATTGTGG CAAAAGAGAT TTTAAAAAGT TTAAATCTCA GAAAAGGTGT GAAGATAGTA
TCATGTCCCA CCTGTGCAAG ATGTAATGTT GACCTTTTAA AGATTGCAGA TGAGGTTGAA
AAGAGAATAC AAAATTTGGA CTTGGACATT ACAGTCGCAA TAATGGGCTG TGCAGTAAAC
GGCCCTGGTG AGGCAAAAGA AGCTGATGTA GGTGTGGCAT GTGGCGTTGG TGAAGGACTT
CTGTTTAAGA AAGGCAAGAT TATAAGGAAA GTGAAAGAGA ATGAGATTGT AGATGAGCTT
GTAAAGGAAA TCTATTCTCT TTCTTAA
 
Protein sequence
MKLLTKKVRI GNLYIGGGEP IRIQSMTNTK TKDVEATVEQ ILSLESLGCD IIRVAVPDLD 
SAKAISKIKS RIHIPLVADI HFDYKLALEA IYNGADKIRI NPGNIGNERK VQEIAKEAKR
YGIAVRVGAN SGSLPKDILQ KYKSPVPEAI VEAAIYQVKL LEKFDFDNIV VSVKSSDVLT
TIKSYEILSQ NLNYPLHVGL TEAGTFVAGT VKSSIAIGYL LLRGIGDTIR VSLTDSPEKE
VIVAKEILKS LNLRKGVKIV SCPTCARCNV DLLKIADEVE KRIQNLDLDI TVAIMGCAVN
GPGEAKEADV GVACGVGEGL LFKKGKIIRK VKENEIVDEL VKEIYSLS