Gene Amuc_1543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1543 
Symbol 
ID6273667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1854516 
End bp1856783 
Gene Length2268 bp 
Protein Length755 aa 
Translation table11 
GC content58% 
IMG OID642613602 
Productformate acetyltransferase 
Protein accessionYP_001878145 
Protein GI187736033 
COG category[C] Energy production and conversion 
COG ID[COG1882] Pyruvate-formate lyase 
TIGRFAM ID[TIGR01255] formate acetyltransferase 1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0294136 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGCA TAGTAAAAGA CTTGAAGGAG TCCACCGCCC TTCCGCAGGA ATGGCAGGGA 
TTCAAGCCGG GAACATGGAC GGAGTCCATT GACGTACGGG ATTTCATCCA GCATAACTAC
ACGCCCTATT CCGGCAATGA AGAGTTTCTT TCCGGCCCCT CCCAGCGCAC GCTTCGTTTA
TGGGACGAGC TGAAAGTCCT GCTGAAACGG GAAATAGACA ACGGCGGCGT GCTGGATGCT
GATGAAAAAG TGGTATCCTC CATCACATCC CACAAGCCCG GCTACATTGA CAAGGAACTG
GAAGTGGTCG TAGGCCTCCA GACGGACGCC CCCCTGAAAA GGGCCCTGAT GCCCTTTGGC
GGCTTGCGCA TGGCCCAACA GGCCCTGGAA TCCTACGGGT TCAAGATGTG CGAGAAAACC
GCGGACATTT TCAAAAAGAT ACGCAAGACG CACAATGAAG GCGTTTTCGA CGCCTACACT
TCCGATATCC GGGCGGCCCG TTCCGCCGGG ATTATTACCG GCCTTCCGGA CGCCTATGGC
CGCGGCCGCA TCATCGGGGA CTACCGCCGG GTGGCCCTGT ACGGCACGGA TAAGCTGATT
GCCGAACGCC GCAAGGACCT CAAAAACCGT GAACATTCCC CCCTTACGGA TGAACTGATC
CGTCTGCGCG AAGAAATGAG CGAACAAATA CGCGCCCTGG AAGAACTCGC CCAGCTGGGC
GCCTCCTACG GCTGCGACCT CACTCGCCCC GCCGCAAACG CCCGCGAAGC CGTGCAATGG
ACGTACCTGG GCTACCTGGC TGCCGTGAAA GAACAGAATG GAGCGGCCAT GTCCCTGGGC
CGCGTTTCCA CTTTCTTCGA CATTTATTTC ACCCGCGACC TGGAACAGGG GCTCATCACG
GAAGAGGAAG TTCAGGAAAT CATCGACCAG TTCGTGATGA AGCTGCGCAT CGTCCGTTTC
ATCCGCACGC CTGATTACAA TAACCTGTTC TCCGGAGACC CCACCTGGGT GACGGAATCC
ATCGGCGGCA TGGGAGAGGA CGGGCGCACC CTCGTCACCC GCAGTTCTTT CCGCATGCTG
CAGACCCTGT ACAACCTGGG GCCGGCTCCG GAACCGAACC TGACCGTTCT GTGGTCCCGG
AACCTGCCGG AAGCCTTCAA AAGCTTCTGC GCCAAGGTTT CCATTGAAAC ATCCTCCGTT
CAATACGAGA ACGATGACCT GATGCGCCCA CATTGGGGCG ACGACTACGG CATCGCATGC
TGCGTGTCCG CCATGCGCAT CGGCAAGCAG ATGCAGTTCT TCGGAGCGCG CGCCAACCTC
GCCAAGTGCC TTCTCTACGC CCTGAACGGC GGCGTGGATG AACTCAAGGG CAAGCAGGTG
GCCCCGCCCT CCCCCCGCTA CACGGAGGAA ATTCTCAACT ATGATGAAGT GATGACCCTG
TACGACAAGA TGCAGGACTG GCTCGCCAAA ACTTACATTG ACGCGCTGAA CATCATCCAC
TACATGCACG ACAAATACTG CTATGAACGC ATTGAAATGG CGCTGCATGA TCCGGAAATT
CTGCGCACGA TGGCTACGGG AATCGCCGGG CTTTCCGTGG CGGCGGACTC CCTGTCCGCC
ATCAAGTATG CCACCGTAAA AGCCATCCGC AATGAAGAAG GGCTGATTGT GGACTTCAAG
ACGGAAGGGG AATTCCCCTG TTACGGGAAC AATGACCCGC GCGTGGACGA CATAGCATGC
AGCCTGGTCA GCAATTTCAT GGAAAAACTG CGCAGGCTCC ACACTTACCG CAATTCCCTG
CCCACCCAGT CCATCCTGAC GATCACCTCC AACGTGGTGT ACGGCAAAAA GACGGGCAAC
ACGCCGGACG GCCGCCGAGC CGGGGAACCG TTCGCCCCCG GAGCCAACCC CATGCACGGA
AGGGACAGGA ACGGAGCCGT GGCCTCCATG CTCTCCGTAG CCAAGCTGTC CTATGACGAC
TCCCTGGACG GCATCTCCTA CACCTTCTCC ATCGTTCCTC AGGCCCTGGG CAAGGAGGAA
CGTGAACGTC GCGTCAAGCT CGTCTCCCTG CTGGACGCCT ACTTTGCCGC TACAGGTCAC
CACATTAATG TGAACGTACT GGAACGGGAA ACGCTTCTCG ACGCCATGGA TCACCCGGAA
AAATACCCGC AGCTTACCAT CCGCGTTTCC GGCTATGCCG TGAATTTCAT CAAGCTGACC
CGGGAACAGC AGCAGGAGGT CATCAACCGC ACTTTCCACA CCCGTTAA
 
Protein sequence
MISIVKDLKE STALPQEWQG FKPGTWTESI DVRDFIQHNY TPYSGNEEFL SGPSQRTLRL 
WDELKVLLKR EIDNGGVLDA DEKVVSSITS HKPGYIDKEL EVVVGLQTDA PLKRALMPFG
GLRMAQQALE SYGFKMCEKT ADIFKKIRKT HNEGVFDAYT SDIRAARSAG IITGLPDAYG
RGRIIGDYRR VALYGTDKLI AERRKDLKNR EHSPLTDELI RLREEMSEQI RALEELAQLG
ASYGCDLTRP AANAREAVQW TYLGYLAAVK EQNGAAMSLG RVSTFFDIYF TRDLEQGLIT
EEEVQEIIDQ FVMKLRIVRF IRTPDYNNLF SGDPTWVTES IGGMGEDGRT LVTRSSFRML
QTLYNLGPAP EPNLTVLWSR NLPEAFKSFC AKVSIETSSV QYENDDLMRP HWGDDYGIAC
CVSAMRIGKQ MQFFGARANL AKCLLYALNG GVDELKGKQV APPSPRYTEE ILNYDEVMTL
YDKMQDWLAK TYIDALNIIH YMHDKYCYER IEMALHDPEI LRTMATGIAG LSVAADSLSA
IKYATVKAIR NEEGLIVDFK TEGEFPCYGN NDPRVDDIAC SLVSNFMEKL RRLHTYRNSL
PTQSILTITS NVVYGKKTGN TPDGRRAGEP FAPGANPMHG RDRNGAVASM LSVAKLSYDD
SLDGISYTFS IVPQALGKEE RERRVKLVSL LDAYFAATGH HINVNVLERE TLLDAMDHPE
KYPQLTIRVS GYAVNFIKLT REQQQEVINR TFHTR