Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1693 |
Symbol | |
ID | 6274617 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2056673 |
End bp | 2059438 |
Gene Length | 2766 bp |
Protein Length | 921 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 642613752 |
Product | 2-oxoglutarate dehydrogenase, E1 subunit |
Protein accession | YP_001878292 |
Protein GI | 187736180 |
COG category | [C] Energy production and conversion |
COG ID | [COG0567] 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes |
TIGRFAM ID | [TIGR00239] 2-oxoglutarate dehydrogenase, E1 component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.00326952 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATGCCT CCATCTTCTC CAAACTCCCG CCGGAGGAAA TTACGGCGCT CCATCAATCA TGGAAAAACG ATCCCCTGTC CGTGGACCCT CTCTGGGCCG CCTGGTTCGA CGGCTACGAA CTGGGGAGCG GTGGAGCCCC CCGGGAAAAA GAAAATGCGG GGAACGCCGC GGACACATCC CCCTACACCG TTCCTCCGGA AAGCGCGGAA CGCCGCGGGC GCGTCAACCA GCTCATCCGG GCCTACCGGG TCATGGGACA CCAATGCGCC CGCTTTAACC CGCTGGCTCC TCCGGACCAG ACCGGTTGCC CCGTCAATCC GGAAGATATG GGATTCCGAG AGGAGGATAT GGACCAGCCC GTCAACATCG GCACCTTCAT GGACGGAGGA ACGTTCACGC TCCGTGAAAT CATCACTAGG CTGCAAAAAA TCTATTGTGG AGCCATCGGA TTTGAATACC ACCACATAGA CAATCTTAAA ATCCGCTCCT GGATTGAAGA AAAAATCCAG CTCCGGGCAA ACGGCGTGGA TTATGGCCCG GAAGTCCGGA GAGAAGCCTT GTTCCATCTC TGCAAGGCGG AACTGTTTGA GGAATTCCTG GGCAAGCGCT TCATGGGGGA AAAACGCTTT TCCCTGGAAG GAGGGGAAGG AGCCATCGTC CTGCTGGACG CCATGATCAA GCGCTGCCCC GCCGCCGGAG TCTCTCACAT TGAAATGGGC ATGGCCCACC GCGGAAGGCT GAATGTGCTC GCCAATATTC TCCACAAACC GCTGAAAACC ATCTTCCGCG AATTCACGCC GGACTACCTG CCGGAATCCC CCATCGGCAG GAGCGACGTC AAATACCACC TCGGTTACGC CGCCACACGT CATGTGGACG GTAAAGAACT CCATATCCGC CTTTCCTCCA ATCCCAGCCA TCTGGAAGCC GTGTATCCTG TGGTGGAAGG CAGGGCCAGG GCCATGCAGC ACAACCTGCA AGACGCGGAA CGCAAGCGTG TGCTGCCGCT CGTCCTGCAC GGGGACGCCG CTTTCTCAGG ACAGGGCATC GTGGCGGAAG TACTCAATCT CTCCCTGCTG AAAGGTTACC GCACGGGCGG CACCGTGCAC CTCGTCATCA ACAACCAGAT CGGCTTCACC ACCAGCCCGG ACGAAGCCCG CTCCTCCCGC TACGCCACGG ACGTGGCGCA GATGCTCCAA TCCCCCATCC TGCACATCAA CGGAGAAAGC CCGGAAGACC TGGTGTGGGC GGCGGACTTC GCGCTCCAGT TCCGCCAGGA ATTCGGAAGG GACATCATTC TGGACATGTA CTGTTACCGC CGTCTGGGCC ACAATGAAAC GGACCAGGCC GCCTTCACCG CGCCCATGCA GACCAAGCGG ATTGAGGCGC GCCCCACCGC TGCCGCCCTG TATGGGACGG CGCTCAGAAA GAGAGGGGAA TTGACGGAAC AACAGGAGCG GGACATTCGG AATGACCTTT GGGAAGGTAT GGAACAGGCC TACCTGCAAA TGAAGGAAAA CCCCGCGGAC TACATCCTGC CCGCCACCGC GCAGGATGCG GATGAAACGC CCCTCCCGCG CACCAGCACG CGGACGGGAA TCAGTCCGGA ACTGTTCCAG CGCGTCGGAA GTATCCTTAC GGAACTTCCG GACAGCTTCA CGCCCCACCC CACGCTGGAA AAACGCTTCC TGGCCCGCCG CCGGGAAGCC TTCCGGGAAG GCGGCCTGCT GGACTGGGCC ATGGCGGAAT CCCTGGCATG GGGCAGCCTG CTCACGGAAA ACCACACCGT GCGCCTGTCC GGACAGGACT GCCAGCGCGG CACCTTCTCC CAGCGGCACG CCGTCCTGCA CGACTTCAAT GACGGCTCCC TGTACACTCC CCTGGAAAAA CTGAACCACG GCACCACCGC ATTCCGCATT TACAACTCAT CCCTGTCGGA AGCCTCCGTA TTGGGCTTTG AATACGGCTA CGCGCTGGAA AGTCCGGACG CGCTGGTCAT GTGGGAGGCC CAATTCGGGG ACTTTTCCAA CGGGGCCCAG GTCATCGTGG ACCAATTCAT TGCTGCCGCA GAAGCCAAAT GGAACCAGAA GAACCGCATG GTACTTCTGC TACCCCACGG TTATGAAGGA GCAGGCTCGG AACACTCCAG CGCCCGCATG GAACGCTACC TACAGCTTTG TGCGGATGAC AACATGCAGG TCATCAATCC CACCACTCCG GCCCAGTATT TCCACGCCCT GCGCCGCCAG GTGCACCGGA ACGTGCACAA ACCCCTGATT ATTTTCACGC CCAAAAGCCT GCTCTCCCGT CCGGAGGCCG TCTCCCCGCA CCGGGAATTC CTGGCCTCCA CCCGTTTCCG CGAAGTGCTG CCGGATCCGG ATACTCCCGC GCCGGACCAG GTCACGCGCG CCGTCTTCTG CACCGGGAAA ATCTATTATG ACCTGGCTGC CTACCGCAGG GAACGGAATA TTTCAGACAC GGTAATCATC CGTCTGGAAC AAATCTATCC TCTGGCGCAG GAACAGCTTA CCTATCTGCT GGCGCCCTAC CAAAAAGTGC GCGACTTCGT CTGGTGCCAG GAAGAACCCT CCAATATGGG GGCCTGGGGC CACCTGCGGA ACAGGCTGGG ACGCCTCTTC GCCACTTCCT TCCGCTATGC GGGCCGCCCC TCCATGGCCT GCCCTGCGGA GGGGGCCAAA GCGCTGCACG CCGCCGCACA AAAAAGACTC ATCGCCGCTG CCTTCGGGCC GCGGGCCCAA TCCTGA
|
Protein sequence | MNASIFSKLP PEEITALHQS WKNDPLSVDP LWAAWFDGYE LGSGGAPREK ENAGNAADTS PYTVPPESAE RRGRVNQLIR AYRVMGHQCA RFNPLAPPDQ TGCPVNPEDM GFREEDMDQP VNIGTFMDGG TFTLREIITR LQKIYCGAIG FEYHHIDNLK IRSWIEEKIQ LRANGVDYGP EVRREALFHL CKAELFEEFL GKRFMGEKRF SLEGGEGAIV LLDAMIKRCP AAGVSHIEMG MAHRGRLNVL ANILHKPLKT IFREFTPDYL PESPIGRSDV KYHLGYAATR HVDGKELHIR LSSNPSHLEA VYPVVEGRAR AMQHNLQDAE RKRVLPLVLH GDAAFSGQGI VAEVLNLSLL KGYRTGGTVH LVINNQIGFT TSPDEARSSR YATDVAQMLQ SPILHINGES PEDLVWAADF ALQFRQEFGR DIILDMYCYR RLGHNETDQA AFTAPMQTKR IEARPTAAAL YGTALRKRGE LTEQQERDIR NDLWEGMEQA YLQMKENPAD YILPATAQDA DETPLPRTST RTGISPELFQ RVGSILTELP DSFTPHPTLE KRFLARRREA FREGGLLDWA MAESLAWGSL LTENHTVRLS GQDCQRGTFS QRHAVLHDFN DGSLYTPLEK LNHGTTAFRI YNSSLSEASV LGFEYGYALE SPDALVMWEA QFGDFSNGAQ VIVDQFIAAA EAKWNQKNRM VLLLPHGYEG AGSEHSSARM ERYLQLCADD NMQVINPTTP AQYFHALRRQ VHRNVHKPLI IFTPKSLLSR PEAVSPHREF LASTRFREVL PDPDTPAPDQ VTRAVFCTGK IYYDLAAYRR ERNISDTVII RLEQIYPLAQ EQLTYLLAPY QKVRDFVWCQ EEPSNMGAWG HLRNRLGRLF ATSFRYAGRP SMACPAEGAK ALHAAAQKRL IAAAFGPRAQ S
|
| |