Gene Amuc_1040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1040 
Symbol 
ID6274073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1238246 
End bp1242442 
Gene Length4197 bp 
Protein Length1398 aa 
Translation table11 
GC content58% 
IMG OID642613089 
ProductDNA-directed RNA polymerase subunit beta' 
Protein accessionYP_001877647 
Protein GI187735535 
COG category[K] Transcription 
COG ID[COG0086] DNA-directed RNA polymerase, beta' subunit/160 kD subunit 
TIGRFAM ID[TIGR02386] DNA-directed RNA polymerase, beta' subunit, predominant form 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.11734 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGACA CCCCCACCAT CAGGGAAATG CACGGCCTGA GCGACAAGCC CCGGACCTTT 
GACCAGGTTG CCATCACCGT GGCGGATCCG GATACCATCC GCAGCTGGTC ATTCGGTGAA
GTCGTCAACC CGGAAACCAT CAACTACCGC ACGTTCAAGC CGGAAAAAGG CGGCCTGTTC
TGCGAACGCA TCTTCGGGCC CACCCGAGAC ATGGAATGCG CCTGCGGCAA GTACAAGCGC
ATCAAGCATA AGGGCATCAC CTGCGACCGC TGCGGCGTGG AAGTAACCAA CGCGCGCGTG
CGCCGCGAAC GAATGGGCCA TATTGAACTG GCCGTTCCGG TTTCCCATAT CTGGTTTTAC
AAATGCATGC CCAGCCGCAT TGGCCTTATG CTGGACATGA CAGCCCGCCA TTTGGAACGC
GTGATTTACT ATGAAGACTA CATCGTGGTA GATCCCGGCA GTACCCCTCT GGAAAAGGGG
GCCATCCTGA CGGAAGAAGA ATTCCGCAAT GCGGAAGACG AATACGGCTA TGACAGCTTT
GAAGCCGGCA TGGGCGCGGA AGCCATCCAG AAAATGCTGG CGGCCATTGA TCTGCCCACC
CTCGTCGCGG ATCTTCAGGA ACAGCTGGAC AATACCAACT CCAAACAGAA CAAGCGCAAG
ATTGCCAAGC GTCTGAAACT GGCCCAGGGG TTCCTTCAGT CCAACACACG CCCGGAATGG
ATGATTTTGA ACGTTCTGCC CGTCATTCCT CCGGACCTGC GCCCGCTGGT TCCTCTGGAA
GGCGGCCGTT TCGCAACGTC CGACCTGAAT GACCTGTACC GCCGCGTCAT CAACCGCAAC
AACCGCCTGA AAACCCTCCT GAGCCTCAAA ACTCCAGAAG TCATCATCCG TAATGAAAAA
CGCATGCTTC AGGAAGCCGT GGATGCCCTG TTTGACAACG GCCGCCACGG CCGTGCCGTC
ACCGGCGCCG GCAACCGCCC CCTCAAATCC CTCTCCGACA TGCTGAAGGG CAAGGGAGGC
CGTTTCCGCC AGAACCTGCT CGGCAAGCGC GTGGACTACT CCGGCCGCTC CGTTATCGTC
ATCGGCCCGG AATTGAAACT CAACCAGTGC GGTCTTCCCA AGAAGATGGC GCTCATTCTG
TTTGAACCCT TCATCATCCA CCGTCTGAAA GAGCTGGGTT ACGTGCACAC GGTGCGCTCC
GCCAAGAAGC TCATTGACCG CAAGACGCCG GAAGTGTGGG ATATTCTGGA AGAAGTGACC
AAGGGCCACC CGGTCATGCT CAACCGCGCG CCCACCCTGC ACCGCCTCTC CATCCAGGCT
TTTGAACCGG TTCTGATTGA AGGTTCCGCC ATCCGTCTGC ACCCGCTCGT CTGTAATGCG
TACAACGCGG ACTTCGACGG CGACCAGATG GCTGTGCACG TGCCTCTGTC CGTGGAAGCG
CAGATGGAAG CCCGGCAGCT CATGCTGGCG CCCAACAATA TTTTCTCCCC TGCTTCCGGC
AAGCCCATTG CCACACCCAC GCAGGACATC ATTCTGGGCG CGTACTTCCT GACGCATACC
CGTGCTGCGG AAGTACAGAA CAATCAGGAT AATCATCACC ATCTTCCCCT CTTCGAATCC
ATTGACGAGG TGGAATACGC CATTGCCGCC CGCAAAATCG GCTACCATGA CTGGATCCGC
CTGCACAACC CGGACTACGG CAAAAAGCCT TCCGAAGTAG TGTATGGGGA TGTCACCAAG
AAGGTTATCA TCACTACTGC CGGACGCGTG CGTTTCAATG AAATCTGGCC CCGGGAACTC
GGTTACATTA ACCGCAACGT AGGCAAGAAA CAGATGGGCG ACATCATCTG GCGCTGCTAC
CAGACCGTCG GCAAGGAACG TACCGTGCAG ACTCTGGACG CCCTGAAAAA CCTGGGCTTC
AAGGAAGCAA CCCGTTCCGG CTGCTCCATC GGCATCGTGG ACATGGTGGT TCCCTCCCAG
AAAAAGACGG AAATTGAAAA AGCCTATGCG GAGCTGGACA AGGTGACCCG CCAGTATAAG
AACGGTATTA TCACGGATGG GGAACGCTAC CAGAAGGTGG TGGACATCTG GACCCAGACT
ACGGATGTCA TCCAGGCGGC TCTGTACCGC AAGCTGGAAC ACAACGAAGG CTCCAAGATG
GCCAGCCCGC TCTTCATGAT GGTGGACTCC GGAGCCCGAG GCAACAAGGC GCAGATCAAG
CAGCTCTCCG GCATGCGCGG TTTGATGGCG AAACCCAGCG GCGAAATTAT CGAACGCCCC
ATCACGGCCA ACTTCCGTGA AGGCCTTTCC GTGCTGGAAT ACTTCATCTC CACCCACGGC
GCCCGCAAGG GTCTGGCAGA TACCGCGCTG AAAACGGCGG ACTCCGGCTA CATGACCCGC
AAACTCGTGG ACGTGGCCCA GGATGTCATC GTCCATGCGG AAGATTGCGG CACCAGCAAC
GGCATCACCG TTCACGCCAT CTATGACGGC GACGAAGAAG TGGCGTCCCT TTCCTCCCGT
ATCTACGGCC GGACTTCCTG CGAACGCATC GTTGACCCCG TCAGCGGCGA GGTTATCGTA
GACATCAACG ACCTCATTAA CGAAAAGCAG GCGGAACAAC TGGAAAAAAT CGGCATTGAA
CGGCTGAAAA TCCGCTCCGT ACTCACCTGC GAACTCAAAA AGGGCTGCTG TGCCAAGTGC
TACGGCCTGA ACCTGGCCAC CGGACAGGAA GTGAAGATCG GGGAAGCGGT CGGCATTATT
GCCGCCCAGT CCATCGGCGA ACCCGGCACG CAGCTCACCA TGCGTACGTT CCACGTGGGC
GGAACGGCTA CCACGGCGTT CAAGCAGCCC ATCGTGAAAG CCAAGAACGA CGGCCGCGTC
ATCTACACGG AAGATCTCCG CACGGTGGAA AACGCAGACG GCAACTTCGT CGTCCTGAAT
AAAAACTGCT CTGTCCGCAT CGAAAACGAA CAGGGCCGCG AACTGGAATC CTACCAGCCC
GTCATCGGCA CCATCCTGTA CGTGCCCAAC GGCGGCACTA TCAAGAAGGA TGAAACCCTC
GCCACCTGGG ATCCGTACAA TGTGCCCGTG ATTGCAGAAA AGGGCGGCAT CGTCGAATTC
AAGGATATGA TCGTCGGCAT CACCGTTTCC AAGGAAACGG ACCGGGAAAC CGGTGCCTCC
TCCCTTGTCG TGATGGAACA CAAGCAGGAA CTTCACCCGC AAGTGGTCAT CCGCGATGCC
AAGACCCGCG AAGTTCTGGC TCATCATGCC ATTCCCGCAG GCGCCAACCT CACTGTGAAG
GACGGAGAAA CCATCTCCGC CGGCACAATG GTGGCCAAGA CGCCCCGCAA GGTAGCCAAG
ACGAAGGACA TCACCGGCGG TCTGCCCCGC GTGGCGGAAT TGTTCGAAGC CCGCAAGCCC
AAGGACGCCT GCACCATTGC ACGCGTGGAA GGCATTGTGC GCCTCAGCAG CAAGAATACT
TCCCGCGGCA AGAAGGTCAT TACCATTGAA ACACCCACGG GCGAACTGGT GGACCATCTG
GTCCCGATGA ACAAGCACGT CATCGTTCAT GAAGACGACC ACGTGCATCT GGGCGACCAG
CTTACGGAAG GCCCCGTTTC TCCGGAAGAA ATTCTGGATG TCTGCGGCAA GGAACGTCTC
CAGGAACACC TCGTTAACGA AGTTCAGGAA GTGTACCGCC TCCAGGGGGT GGAAATCAAC
GACAAGCATG TGGAAATCAT CGTGCGCCAG ATGCTCCGCA AGGTAGTCAT CACGGAACCC
GGAAATACCG AATTCCTGTG GGGAGACCAA GTGGACAAGA CCACGTTCGA CCGCATCAAT
GAACAAACCG TAGCCCAGGG CGGCCAACCG GCCGCAGCCA AGCCCGTTCT GCTCGGTATC
ACGAAGGCCT CCCTGGAAAC GGAATCCTTC ATTTCCGCGG CATCTTTCCA GGATACCACA
CGCGTTCTGA CGGAAGCATC CACCCTCGGC AAGACCGATA CTCTGGAAGG CTTCAAGGAA
AACGTCATCA TGGGCCACCT CATTCCCGCC GGCACCGGAT TCTCCCGTTA CAGCAAGATT
GAAGTGGAAC CTGCAGAGGG CGCAGAAGAA ATCGCGGCGG CCAGCGAAGA AGAGGAAGCG
GCGGAACTTG CCGAAGACAT GTTGAACGAT ACCATCAACT TCGACAACGA ACGCTAA
 
Protein sequence
MSDTPTIREM HGLSDKPRTF DQVAITVADP DTIRSWSFGE VVNPETINYR TFKPEKGGLF 
CERIFGPTRD MECACGKYKR IKHKGITCDR CGVEVTNARV RRERMGHIEL AVPVSHIWFY
KCMPSRIGLM LDMTARHLER VIYYEDYIVV DPGSTPLEKG AILTEEEFRN AEDEYGYDSF
EAGMGAEAIQ KMLAAIDLPT LVADLQEQLD NTNSKQNKRK IAKRLKLAQG FLQSNTRPEW
MILNVLPVIP PDLRPLVPLE GGRFATSDLN DLYRRVINRN NRLKTLLSLK TPEVIIRNEK
RMLQEAVDAL FDNGRHGRAV TGAGNRPLKS LSDMLKGKGG RFRQNLLGKR VDYSGRSVIV
IGPELKLNQC GLPKKMALIL FEPFIIHRLK ELGYVHTVRS AKKLIDRKTP EVWDILEEVT
KGHPVMLNRA PTLHRLSIQA FEPVLIEGSA IRLHPLVCNA YNADFDGDQM AVHVPLSVEA
QMEARQLMLA PNNIFSPASG KPIATPTQDI ILGAYFLTHT RAAEVQNNQD NHHHLPLFES
IDEVEYAIAA RKIGYHDWIR LHNPDYGKKP SEVVYGDVTK KVIITTAGRV RFNEIWPREL
GYINRNVGKK QMGDIIWRCY QTVGKERTVQ TLDALKNLGF KEATRSGCSI GIVDMVVPSQ
KKTEIEKAYA ELDKVTRQYK NGIITDGERY QKVVDIWTQT TDVIQAALYR KLEHNEGSKM
ASPLFMMVDS GARGNKAQIK QLSGMRGLMA KPSGEIIERP ITANFREGLS VLEYFISTHG
ARKGLADTAL KTADSGYMTR KLVDVAQDVI VHAEDCGTSN GITVHAIYDG DEEVASLSSR
IYGRTSCERI VDPVSGEVIV DINDLINEKQ AEQLEKIGIE RLKIRSVLTC ELKKGCCAKC
YGLNLATGQE VKIGEAVGII AAQSIGEPGT QLTMRTFHVG GTATTAFKQP IVKAKNDGRV
IYTEDLRTVE NADGNFVVLN KNCSVRIENE QGRELESYQP VIGTILYVPN GGTIKKDETL
ATWDPYNVPV IAEKGGIVEF KDMIVGITVS KETDRETGAS SLVVMEHKQE LHPQVVIRDA
KTREVLAHHA IPAGANLTVK DGETISAGTM VAKTPRKVAK TKDITGGLPR VAELFEARKP
KDACTIARVE GIVRLSSKNT SRGKKVITIE TPTGELVDHL VPMNKHVIVH EDDHVHLGDQ
LTEGPVSPEE ILDVCGKERL QEHLVNEVQE VYRLQGVEIN DKHVEIIVRQ MLRKVVITEP
GNTEFLWGDQ VDKTTFDRIN EQTVAQGGQP AAAKPVLLGI TKASLETESF ISAASFQDTT
RVLTEASTLG KTDTLEGFKE NVIMGHLIPA GTGFSRYSKI EVEPAEGAEE IAAASEEEEA
AELAEDMLND TINFDNER