Gene Amuc_1693 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1693 
Symbol 
ID6274617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2056673 
End bp2059438 
Gene Length2766 bp 
Protein Length921 aa 
Translation table11 
GC content61% 
IMG OID642613752 
Product2-oxoglutarate dehydrogenase, E1 subunit 
Protein accessionYP_001878292 
Protein GI187736180 
COG category[C] Energy production and conversion 
COG ID[COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes 
TIGRFAM ID[TIGR00239] 2-oxoglutarate dehydrogenase, E1 component 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.00326952 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATGCCT CCATCTTCTC CAAACTCCCG CCGGAGGAAA TTACGGCGCT CCATCAATCA 
TGGAAAAACG ATCCCCTGTC CGTGGACCCT CTCTGGGCCG CCTGGTTCGA CGGCTACGAA
CTGGGGAGCG GTGGAGCCCC CCGGGAAAAA GAAAATGCGG GGAACGCCGC GGACACATCC
CCCTACACCG TTCCTCCGGA AAGCGCGGAA CGCCGCGGGC GCGTCAACCA GCTCATCCGG
GCCTACCGGG TCATGGGACA CCAATGCGCC CGCTTTAACC CGCTGGCTCC TCCGGACCAG
ACCGGTTGCC CCGTCAATCC GGAAGATATG GGATTCCGAG AGGAGGATAT GGACCAGCCC
GTCAACATCG GCACCTTCAT GGACGGAGGA ACGTTCACGC TCCGTGAAAT CATCACTAGG
CTGCAAAAAA TCTATTGTGG AGCCATCGGA TTTGAATACC ACCACATAGA CAATCTTAAA
ATCCGCTCCT GGATTGAAGA AAAAATCCAG CTCCGGGCAA ACGGCGTGGA TTATGGCCCG
GAAGTCCGGA GAGAAGCCTT GTTCCATCTC TGCAAGGCGG AACTGTTTGA GGAATTCCTG
GGCAAGCGCT TCATGGGGGA AAAACGCTTT TCCCTGGAAG GAGGGGAAGG AGCCATCGTC
CTGCTGGACG CCATGATCAA GCGCTGCCCC GCCGCCGGAG TCTCTCACAT TGAAATGGGC
ATGGCCCACC GCGGAAGGCT GAATGTGCTC GCCAATATTC TCCACAAACC GCTGAAAACC
ATCTTCCGCG AATTCACGCC GGACTACCTG CCGGAATCCC CCATCGGCAG GAGCGACGTC
AAATACCACC TCGGTTACGC CGCCACACGT CATGTGGACG GTAAAGAACT CCATATCCGC
CTTTCCTCCA ATCCCAGCCA TCTGGAAGCC GTGTATCCTG TGGTGGAAGG CAGGGCCAGG
GCCATGCAGC ACAACCTGCA AGACGCGGAA CGCAAGCGTG TGCTGCCGCT CGTCCTGCAC
GGGGACGCCG CTTTCTCAGG ACAGGGCATC GTGGCGGAAG TACTCAATCT CTCCCTGCTG
AAAGGTTACC GCACGGGCGG CACCGTGCAC CTCGTCATCA ACAACCAGAT CGGCTTCACC
ACCAGCCCGG ACGAAGCCCG CTCCTCCCGC TACGCCACGG ACGTGGCGCA GATGCTCCAA
TCCCCCATCC TGCACATCAA CGGAGAAAGC CCGGAAGACC TGGTGTGGGC GGCGGACTTC
GCGCTCCAGT TCCGCCAGGA ATTCGGAAGG GACATCATTC TGGACATGTA CTGTTACCGC
CGTCTGGGCC ACAATGAAAC GGACCAGGCC GCCTTCACCG CGCCCATGCA GACCAAGCGG
ATTGAGGCGC GCCCCACCGC TGCCGCCCTG TATGGGACGG CGCTCAGAAA GAGAGGGGAA
TTGACGGAAC AACAGGAGCG GGACATTCGG AATGACCTTT GGGAAGGTAT GGAACAGGCC
TACCTGCAAA TGAAGGAAAA CCCCGCGGAC TACATCCTGC CCGCCACCGC GCAGGATGCG
GATGAAACGC CCCTCCCGCG CACCAGCACG CGGACGGGAA TCAGTCCGGA ACTGTTCCAG
CGCGTCGGAA GTATCCTTAC GGAACTTCCG GACAGCTTCA CGCCCCACCC CACGCTGGAA
AAACGCTTCC TGGCCCGCCG CCGGGAAGCC TTCCGGGAAG GCGGCCTGCT GGACTGGGCC
ATGGCGGAAT CCCTGGCATG GGGCAGCCTG CTCACGGAAA ACCACACCGT GCGCCTGTCC
GGACAGGACT GCCAGCGCGG CACCTTCTCC CAGCGGCACG CCGTCCTGCA CGACTTCAAT
GACGGCTCCC TGTACACTCC CCTGGAAAAA CTGAACCACG GCACCACCGC ATTCCGCATT
TACAACTCAT CCCTGTCGGA AGCCTCCGTA TTGGGCTTTG AATACGGCTA CGCGCTGGAA
AGTCCGGACG CGCTGGTCAT GTGGGAGGCC CAATTCGGGG ACTTTTCCAA CGGGGCCCAG
GTCATCGTGG ACCAATTCAT TGCTGCCGCA GAAGCCAAAT GGAACCAGAA GAACCGCATG
GTACTTCTGC TACCCCACGG TTATGAAGGA GCAGGCTCGG AACACTCCAG CGCCCGCATG
GAACGCTACC TACAGCTTTG TGCGGATGAC AACATGCAGG TCATCAATCC CACCACTCCG
GCCCAGTATT TCCACGCCCT GCGCCGCCAG GTGCACCGGA ACGTGCACAA ACCCCTGATT
ATTTTCACGC CCAAAAGCCT GCTCTCCCGT CCGGAGGCCG TCTCCCCGCA CCGGGAATTC
CTGGCCTCCA CCCGTTTCCG CGAAGTGCTG CCGGATCCGG ATACTCCCGC GCCGGACCAG
GTCACGCGCG CCGTCTTCTG CACCGGGAAA ATCTATTATG ACCTGGCTGC CTACCGCAGG
GAACGGAATA TTTCAGACAC GGTAATCATC CGTCTGGAAC AAATCTATCC TCTGGCGCAG
GAACAGCTTA CCTATCTGCT GGCGCCCTAC CAAAAAGTGC GCGACTTCGT CTGGTGCCAG
GAAGAACCCT CCAATATGGG GGCCTGGGGC CACCTGCGGA ACAGGCTGGG ACGCCTCTTC
GCCACTTCCT TCCGCTATGC GGGCCGCCCC TCCATGGCCT GCCCTGCGGA GGGGGCCAAA
GCGCTGCACG CCGCCGCACA AAAAAGACTC ATCGCCGCTG CCTTCGGGCC GCGGGCCCAA
TCCTGA
 
Protein sequence
MNASIFSKLP PEEITALHQS WKNDPLSVDP LWAAWFDGYE LGSGGAPREK ENAGNAADTS 
PYTVPPESAE RRGRVNQLIR AYRVMGHQCA RFNPLAPPDQ TGCPVNPEDM GFREEDMDQP
VNIGTFMDGG TFTLREIITR LQKIYCGAIG FEYHHIDNLK IRSWIEEKIQ LRANGVDYGP
EVRREALFHL CKAELFEEFL GKRFMGEKRF SLEGGEGAIV LLDAMIKRCP AAGVSHIEMG
MAHRGRLNVL ANILHKPLKT IFREFTPDYL PESPIGRSDV KYHLGYAATR HVDGKELHIR
LSSNPSHLEA VYPVVEGRAR AMQHNLQDAE RKRVLPLVLH GDAAFSGQGI VAEVLNLSLL
KGYRTGGTVH LVINNQIGFT TSPDEARSSR YATDVAQMLQ SPILHINGES PEDLVWAADF
ALQFRQEFGR DIILDMYCYR RLGHNETDQA AFTAPMQTKR IEARPTAAAL YGTALRKRGE
LTEQQERDIR NDLWEGMEQA YLQMKENPAD YILPATAQDA DETPLPRTST RTGISPELFQ
RVGSILTELP DSFTPHPTLE KRFLARRREA FREGGLLDWA MAESLAWGSL LTENHTVRLS
GQDCQRGTFS QRHAVLHDFN DGSLYTPLEK LNHGTTAFRI YNSSLSEASV LGFEYGYALE
SPDALVMWEA QFGDFSNGAQ VIVDQFIAAA EAKWNQKNRM VLLLPHGYEG AGSEHSSARM
ERYLQLCADD NMQVINPTTP AQYFHALRRQ VHRNVHKPLI IFTPKSLLSR PEAVSPHREF
LASTRFREVL PDPDTPAPDQ VTRAVFCTGK IYYDLAAYRR ERNISDTVII RLEQIYPLAQ
EQLTYLLAPY QKVRDFVWCQ EEPSNMGAWG HLRNRLGRLF ATSFRYAGRP SMACPAEGAK
ALHAAAQKRL IAAAFGPRAQ S