Gene Amuc_1388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1388 
Symbol 
ID6275639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1657289 
End bp1659046 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content59% 
IMG OID642613445 
Product1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 
Protein accessionYP_001877993 
Protein GI187735881 
COG category[I] Lipid transport and metabolism 
COG ID[COG0821] Enzyme involved in the deoxyxylulose pathway of isoprenoid biosynthesis 
TIGRFAM ID[TIGR00612] 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.715567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.0640691 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGTCTT CCTATTGTCC GAGTCCTTAT CGATATACGC GCCGCGTAAC CCGTGAAGTC 
ATGGTGGGGA ATGTGGGGGT GGGCGGATCC AATCCCATCC GGATCCAGTC CATGCTGACG
TCCGATACGC GGGATACGGA TGCCTGCGTG AAGGAGGCTT TGGAGCTGGC GGAGGCAGGG
TGCGAGATTA TCCGCCTGAC CGCCCAGACC AAGGCGTATG CCGCCAATTT GGAGAATATT
GCCCGGGAAT TGCGCGCTGC CGGCTGCCAT GTGCCTCTGG TGGCCGATAT CCACTTCAAG
CCGGATGCCG CGATGGAGGC TGCCAAATGG GTGGAGAAGA TTCGTATTAA TCCGGGCAAT
TTCGTTGATA AGAAGAAGTT TGAAGTGCGG GAGTATTCCG ACGCCGAATA CCGCGAGGAG
CTGGACCGCC TGAAGGAAGA ATTTACGCCC CTGGTTCTGT TTTGCCGGGA GCATGGCCGC
GCGATGCGCA TCGGTTCCAA CCATGGCTCC CTGTCCGACC GCATTCTGAA CCGCTTTGGC
GATACGCCGG AGGGGATGGT GGAGAGCGCG ATTGAGTTTG CCCAGATTGC CCGCGACCTG
GATTACCATT CCCTGGTGTT TTCCATGAAG GCTTCCAACG TCAAGGTGAT GGTGGCCGCT
TACCGATTGC TGGTGGAGCG CATGAACGCC CTGGGGCCGG ATTGGAATTA TCCCATTCAT
CTGGGGGTGA CGGAAGCCGG GGGCGGAGAG GACGGCCGCA TCAAGAGCGC CGTGGGCATC
GGCTCCCTGC TGACGGACGG CATTGGTGAT ACCCTGCGCG TTTCCCTGAC GGAGGACGCT
GTGAGGGAGG TGCCCGTGGC TTACCGCCTG TCCAATCCTT TCCAGCCGTC GGAGCGTTCC
GATGACCCGG TTTCCTTCCC TGAACCGGAG TTGAGCTATG ATCCCCTGAA GTTTTCCAAA
AGGCAGGGTG GGCTGGCGAT GTATTATGGC GTACGCCTGG GCTGGGAACA GCCTGTGCGC
GTGGCGGTTC CTGACGCCGG GTTTTACGCC CTGCAGACAG AACGGGAGGC GATGGGGGAC
ATGATGCCTG AATTATCCCT GGGGCAGCTG GATGCCATTG AGGTGGATCC CCGGTGCGAT
GCCGATCTGG AGCCGTTGAA GGAGCTGGCG GAACCGTCTA TTGTTACTGT GAAGAACGGG
CTGGCTATGG AGCCTGTATA TGCGTTCCGC CTTCTGGCTG CCCGTATTGA GGACAGGCAT
CTGATCCTGC TGAAGGATAC GCTGGTGCCC GGTTCCGTTT CCGGGGAAGA CGTGCCGCTG
ACGGCTGCCC GCAATATCGG CTCCCTGCTG TGCGACGGGA TTGGAGACGC CGTGCTGATT
CAGGGCGAGT CGGACCCCCG TTTGGCTTCT TTCCTGGGAT TCAATATTTT GCAGGCTACG
GGAACGAGGC TGACGCGGGC GGATTACGTT TCCTGCCCGT CCTGCGGGCG TACCCTGTAC
AATATCCAGG AGGCGACGGC CCGCATCCGG AAAGCCACGG AACATCTGAA AGGGGTGAAG
ATTGCCGTGA TGGGATGTAT TGTGAATGGC CCCGGCGAGA TGGCGGATGC GGATTTCGGT
TATGTGGGCG GCGCGCCGAA CAAGATCAAC CTGTATGTGA AGCATACGCC TGTGAAGTTC
AATATTCCCC AGGAGGAGGC TGTGGAACGG CTGGTGGATC TGATCAAGGA GTATGGGCGG
TGGGTGGACC CCAAGTGA
 
Protein sequence
MQSSYCPSPY RYTRRVTREV MVGNVGVGGS NPIRIQSMLT SDTRDTDACV KEALELAEAG 
CEIIRLTAQT KAYAANLENI ARELRAAGCH VPLVADIHFK PDAAMEAAKW VEKIRINPGN
FVDKKKFEVR EYSDAEYREE LDRLKEEFTP LVLFCREHGR AMRIGSNHGS LSDRILNRFG
DTPEGMVESA IEFAQIARDL DYHSLVFSMK ASNVKVMVAA YRLLVERMNA LGPDWNYPIH
LGVTEAGGGE DGRIKSAVGI GSLLTDGIGD TLRVSLTEDA VREVPVAYRL SNPFQPSERS
DDPVSFPEPE LSYDPLKFSK RQGGLAMYYG VRLGWEQPVR VAVPDAGFYA LQTEREAMGD
MMPELSLGQL DAIEVDPRCD ADLEPLKELA EPSIVTVKNG LAMEPVYAFR LLAARIEDRH
LILLKDTLVP GSVSGEDVPL TAARNIGSLL CDGIGDAVLI QGESDPRLAS FLGFNILQAT
GTRLTRADYV SCPSCGRTLY NIQEATARIR KATEHLKGVK IAVMGCIVNG PGEMADADFG
YVGGAPNKIN LYVKHTPVKF NIPQEEAVER LVDLIKEYGR WVDPK