Gene Amuc_1646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1646 
Symbol 
ID6274630 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1989053 
End bp1990252 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content54% 
IMG OID642613706 
Product4-hydroxy-3-methylbut-2-enyl diphosphate reductase 
Protein accessionYP_001878247 
Protein GI187736135 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0761] Penicillin tolerance protein 
TIGRFAM ID[TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.182674 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCAGG ACTCAACCCG CAAAGCCAGA GTGAATGTGA GACGGGCCGA AGTAATGGAG 
CAGGTGGAAA AGGAAATCCA GCAGCATTAC CAGAGTGAAT TGATTTCCCA CATCCGTTCT
GCGGGCAATG TGTACAATCT GGGCCATACG GAATTTTTTC TCGCCCGGGA GTTCGGGTTC
TGCAACGGCG TGCGCCGTGC TATTGATATA GCATATGCGG CACGCAGAGT GTTTCCTGAC
CGGCGGATTT TTCTGATTGG GGACATTATT CACAATCCGG AAGTGAACCG GCAGCTGGAG
GAAATGGGCA TTCGAAAACT TCCCTGGAAG CAGTTGGATT CCTCCTATGA CCGGGTTGCT
CCGGACGATG TGGTGATTAT TCCCGCGTTC GGTGTTCCCA CTCCTTTCAT GGATGCGCTG
GAGGAGAAGG GCGTGCAGAT TGTGGATACG ACATGCGGGG ACGTGATGAA GGTCTGGAAA
AGAGTGAAGA ATTACGCCGC CATGGGGATT ACCTCCATTA TCCACGGTAA GGCTACCCAT
GAGGAAACCA GCGCTACGGC TTCCCGGGCT CTTGGGGAAC GGGGAAGGGG GAAATACCTG
GTGGTTTACG ATTTGGAGGA TGCCCGTATC CTGTGCGACT ACATCATGGG CCGCGGAGAC
CGCGAGGCAT TCCTGAAACG GTTTGAAGGA TGCTGTTCCC CGGGATTCGA TCCCGACCGG
GATCTGGAGG AGGTTGGCAT CGCCAACCAG ACCACCATGT TGAAAACGGA GACGCAGACG
CTCCAGAAGA TGGTGAAGGA TGCCATTGTT CAGAGGGATG GGGACGATGA TAATTTTTAT
GTGTTTGACA CCATTTGCGG TGCTACCCAG GATCGCCAGG ATGCCCTGTA TGAACTGCTT
AAAAATCCTC TGGACGTCAT GTTTGTGGTG GGTGGCTACA ACAGTTCCAA CACAACGCAT
CTGGTGGATA TTGCCAGGGA GCATGTGCCC ACGTACTTCA TTGAGTCCGC AGAATGCATC
AAGTCCATCC AGTATGTGGA TGCTTTTGAT ACGAAGACGC GGGAAGTGCG CCGCATGACT
ACGGAACCGG TAGTGCAGAA TCTGGGCAAA TCCCTGAAGG TGGGAATTAC GGCGGGCGCC
TCATGTCCGG CCAACCTGAT TGAGGCCACC ATCCTCCGCA TTGCGGATCT GCGCAAGTAG
 
Protein sequence
MNQDSTRKAR VNVRRAEVME QVEKEIQQHY QSELISHIRS AGNVYNLGHT EFFLAREFGF 
CNGVRRAIDI AYAARRVFPD RRIFLIGDII HNPEVNRQLE EMGIRKLPWK QLDSSYDRVA
PDDVVIIPAF GVPTPFMDAL EEKGVQIVDT TCGDVMKVWK RVKNYAAMGI TSIIHGKATH
EETSATASRA LGERGRGKYL VVYDLEDARI LCDYIMGRGD REAFLKRFEG CCSPGFDPDR
DLEEVGIANQ TTMLKTETQT LQKMVKDAIV QRDGDDDNFY VFDTICGATQ DRQDALYELL
KNPLDVMFVV GGYNSSNTTH LVDIAREHVP TYFIESAECI KSIQYVDAFD TKTREVRRMT
TEPVVQNLGK SLKVGITAGA SCPANLIEAT ILRIADLRK