Gene Amuc_1041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1041 
SymbolrpoB 
ID6274072 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1242515 
End bp1246453 
Gene Length3939 bp 
Protein Length1312 aa 
Translation table11 
GC content57% 
IMG OID642613090 
ProductDNA-directed RNA polymerase subunit beta 
Protein accessionYP_001877648 
Protein GI187735536 
COG category[K] Transcription 
COG ID[COG0085] DNA-directed RNA polymerase, beta subunit/140 kD subunit 
TIGRFAM ID[TIGR02013] DNA-directed RNA polymerase, beta subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAGC GACTCTACTT CGGGAATATC AAGGAAGTCA TCGAACCTCC GAACCTTATT 
GAGATTCAAC TTCAATCATA CTTCGATTTT CTACAACAGG ACACGCCGGC CGCGGCCAGG
AAAAACATCG GGTTACAAGG CGTTCTGAAG GAAATCTTTC CTATCAAGAG CTATGACGAA
AACATTGAGC TCGACTTCGT CTCCTATGAC ATTGAACAGC CGAAGATGAG CGATTACGAG
GCCATCCGCG CGGGGGAAAC CTACAGCGCC GCCCTTCAGG TCACCTTCAA GCTCAAATCC
GACAACGAGT CCAAGGAAGA AACGGTCTAT ATGGGAGAAC TCCCTATGAT GACCAACCGC
GGCACCTTTG TCATTAACGG CGCCGAACGC GTCATTGTGT CCCAGCTGCA CCGTTCCCCG
GGCATTTGCT TTGAAAGCGC CCAGCATCTG AACGGAAAAC TGCTCCACTC CTTCCGCATC
ATCCCGGACC GGGGCTCCTG GCTGGAGGTC CAGTTTGACA CCAACGACCT GCTTTACGTT
TACCTTGACC GCCGCCGCCG CCGCCGCAAA TTCCTTGCGA CGACGTTTAT GCGCTACCTG
GGCTTCAAAA CGGACCGTGA CATCGTCAGC CAGTTCTACA ACATCCGCAC GCTTCCCCTG
AGCGAAGACA TGACGGAAGA AGATCTGCAC AACCTTGTGG CTGTGGACAC GATCAAGGAC
AAGGACCTGG TTCTTGCCAA GGCTTTCGAA CAGCTCAATA TGGGTGTGGT TCGCCAGCTC
CTCCAGTTCG GCATCAAGAA AATCGACGTC ATTGACCAGA GCGAGGACGA CGTGCTGATC
AAAACCCTGA AGAAGGATCC CGCTCACGAC GAAGAATCCG CCCTCAAGGA AATCTACAAA
CGCCTGCGTC CCGGAGATCC CGCTACTGCC GCTCAGGCAC GCACCCTGCT TAAGAGGCTT
TTTGACGATC CCAAGAAATA CGACCTCACC CGCGTGGGGC GCTACAAAAT CAACCAGAAA
CTCGGGCTGG ACACCAGCCT GGACCAGCGC CTGATGACGG CGGAAGATTT CCTGGCCGCC
CTCAAATACC TCCTGCGCCT CAAGAAGGGC GAAGGCATGG TGGACGACAT CGACCACCTG
GGCAGCCGCC GCGTACGTGC CGTAGGCGAA CTCATGGCCA ATCAATGCCG CGTAGGCCTT
GCCCGCACCG AGCGCTTGGT CAAGGAACGC ATGACCCTCA TTGACCAGAA CATTGAAGGC
GTCACGCCCA GCAAACTCAT CAATCCGAAG GCCCTCAGCG CCGTCGTGCG CGACTTCTTC
GGCCGCTCCC AGCTCTCCCA ATTCATGGAC CAGATCAACC CCCTCGCGGA ATTGACGCAC
AAGCGCCGTC TTTCCGCCCT GGGGCCCGGC GGCCTGAACC GTGACCGGGC CGGATTTGAA
GTCCGGGACG TACACCCCTC CCACTACGGG CGCATTTGCC CCATTGAAAC GCCGGAAGGT
CCCAACATCG GCCTGATCAA CTCCATGTGT ACCTACGCCC GCATCAATGA ATTCGGATTC
ATTGAAACAC CTTACCGCAA GGTGGAAAAC GGCAGGGTCA CCAATACCAT CGAATACGTC
ACCGCCGATC AGGAAGAAGG CTATCTCATC GCCCAGGCCA ATAACCCGCT GGACGAACAG
GGCAACTTCA CGACCTCCCG CGTAACCGCC CGTGAAAAAG GCGAATTCAT CGAAGTGGAT
CCGGCAGACG TGCATTACAT GGACGTTTCT CCCAAGCAGC TCGTTTCTAT CGCCGCCGGT
CTTATCCCCT TCCTGGAACA TGACGACGCC AACCGCGCCC TGATGGGCTC CAACATGCAG
CGCCAGGGCG TGCCTCTGAT GGTAGCGGAA TCTCCATACG TGGGAACCGG CATCGAAGGA
AAGTGCGCCA GGGACTCCCG TTCCGTCGTT CTGGCGGAAG CGGACGGCAT CGTGGCCTCC
GCCACGGCTG AAGTCATTAT CACCACGAAA GACGGGGAAC TGCCCGTGCG TCCGGAAGTA
TTCCTGTCCG ATCCCGACAG TGTGCGCACG GACCGCGACA ATGGCGTTTA CGTCTATCCC
CTGCGCAAGT TCATGCGTTC CAATGCTGGA ACCTGCATCA ACCAGCGTCC GATCGTGCGC
CGCGGCGACA AAATCAAAAC CGGCGACGTG CTGGCGGACG GCCCGAACAC GGACCAGGGC
GAACTGGCCC TCGGCCGCAA TGTTCTGGTG GCCTATATGC CGTGGAACGG TTACAACTTC
GAAGACGCTA TTGTCATCTC CGAAAAAACC GTGAAGGAAG ACACCTTCAC TTCCATTCAT
ATCTCCGAGT TTGAAGTTCA GGCCCGTGAC ACCAAGCTGG GTCCGGAAGA AATCACCCGT
GATATCCCGA ACGCCGGTGA TGAAGCCCTG AAAAACCTGG ATCATGACGG CGTCATCCGC
ATCGGCGCGG AAGTGAAGCC CGGCGACATC CTCGTCGGCA AAATCACCCC CAAATCGGAA
ACGGAACTGG CTCCGGAAGA ACGTCTCCTC CGCGCCATTT TCGGTGAAAA GGCCGCAGAA
GTGAAAGATA CCTCACTCCG TGTCCCCTCC GGCTGCACCG GCATCGTGAT GGATGTGCGC
ATTTCTTCCA CAGGCTCCGG CCACCACCGC GGCGACCTCG TCGTAGACAG CGCGGAAAAG
AAAAAGCAGT TCAAGAAAAT CAACGACGAA CATAAAAAGA AGAAGGAGCA GCTCATTGAC
CAGTTGACCA AGAAGCTCTC CGATATTCTT CTGGGTGAAA AAATCCCGCT GGACGTCGTC
AATGAACAGA CCGGTGAAAT CATCATCCCG GCCAACAGGA AAATCACCAA GACCCTGTTG
CGTAAACTGG CCCTGGTTCA CGACCACATC GAAATCGAAC CCAGCCCGAT TCGCAACAAG
ATTCTGGAAA TCATCACTTC GTTTGAAGGC CGCTTTACGG AACTGGATGA CGAACGGGAA
CACAGGCTGG ATCAGATGGA ATCCGGGGAC GAATCCGAAC CCGGCGGACT GAAGGAAGTT
AAAGTATATA TCGCCGCCAA GCGCAAGCTC GGCGTGGGTG ACAAGATGGC CGGTCGCCAC
GGCAACAAAG GCGTCGTTGC CAAAATCGTC CCGGAACAGG ATATGCCCTT CCTCGCGGAC
GGTACTCCGG TGGATATCGT TCTGAACCCC TTGGGCGTGC CTTCCCGAAT GAATGTGGGG
CAGGTGCTTG AAGCCCACCT CGGCATCGCT GCCAGGGCCC TTGGCTTCAA GGTGGCTACC
CCGGTGTTCG ACGGGATCAG TGAAGAAACC ATCTGGAATT ACATGTCCGA AGCCAAGAAG
GTGGACGGTT TCACCTGGAT CGGTGACGGC AAGGACGGCA CCGTGGGAGG GAAGAGTACC
CTTTATGACG GCCTGACCGG CGAACCTTTC CATAACCCGG TGGTGGTAGG CCAGACCTAC
ATGCTCAAAC TGAACCACCT GGTGGCGGAC AAGATTCACG CCCGCGCCGT GGGTCCGTAC
AGCCTGGTCA CGCAGCAGCC TCTGGGCGGC AAGGCCCAAT ACGGCGGTCA GCGTTTCGGG
GAAATGGAAG TGTGGGCACT GGAAGCCTAT GGCGCCGCCT ATACCCTCCA GGAACTTCTC
ACCGTCAAGT CCGACGACGT TCAGGGCCGT ACCCGCATTT ACGAATCCAT CGTGAAGGGG
GATAATACCC TGGAAGCCGG AACTCCGGAA TCTTTCAACG TTCTGATGAA GGAAATGCAG
TCCCTGGGGC TGAACGTACG CCCCGGCAGC AAGGATGAAC AACCCTCCCT GCAACTCGGC
GGCACGGATC TCACTCCCGT GGACGGCATG ACGGAAGGCT TTGACAGCGA CGACATGGCC
GGCCTGGCGG ACGTCGACTT CTCCGACCTC AAATTCTAA
 
Protein sequence
MSKRLYFGNI KEVIEPPNLI EIQLQSYFDF LQQDTPAAAR KNIGLQGVLK EIFPIKSYDE 
NIELDFVSYD IEQPKMSDYE AIRAGETYSA ALQVTFKLKS DNESKEETVY MGELPMMTNR
GTFVINGAER VIVSQLHRSP GICFESAQHL NGKLLHSFRI IPDRGSWLEV QFDTNDLLYV
YLDRRRRRRK FLATTFMRYL GFKTDRDIVS QFYNIRTLPL SEDMTEEDLH NLVAVDTIKD
KDLVLAKAFE QLNMGVVRQL LQFGIKKIDV IDQSEDDVLI KTLKKDPAHD EESALKEIYK
RLRPGDPATA AQARTLLKRL FDDPKKYDLT RVGRYKINQK LGLDTSLDQR LMTAEDFLAA
LKYLLRLKKG EGMVDDIDHL GSRRVRAVGE LMANQCRVGL ARTERLVKER MTLIDQNIEG
VTPSKLINPK ALSAVVRDFF GRSQLSQFMD QINPLAELTH KRRLSALGPG GLNRDRAGFE
VRDVHPSHYG RICPIETPEG PNIGLINSMC TYARINEFGF IETPYRKVEN GRVTNTIEYV
TADQEEGYLI AQANNPLDEQ GNFTTSRVTA REKGEFIEVD PADVHYMDVS PKQLVSIAAG
LIPFLEHDDA NRALMGSNMQ RQGVPLMVAE SPYVGTGIEG KCARDSRSVV LAEADGIVAS
ATAEVIITTK DGELPVRPEV FLSDPDSVRT DRDNGVYVYP LRKFMRSNAG TCINQRPIVR
RGDKIKTGDV LADGPNTDQG ELALGRNVLV AYMPWNGYNF EDAIVISEKT VKEDTFTSIH
ISEFEVQARD TKLGPEEITR DIPNAGDEAL KNLDHDGVIR IGAEVKPGDI LVGKITPKSE
TELAPEERLL RAIFGEKAAE VKDTSLRVPS GCTGIVMDVR ISSTGSGHHR GDLVVDSAEK
KKQFKKINDE HKKKKEQLID QLTKKLSDIL LGEKIPLDVV NEQTGEIIIP ANRKITKTLL
RKLALVHDHI EIEPSPIRNK ILEIITSFEG RFTELDDERE HRLDQMESGD ESEPGGLKEV
KVYIAAKRKL GVGDKMAGRH GNKGVVAKIV PEQDMPFLAD GTPVDIVLNP LGVPSRMNVG
QVLEAHLGIA ARALGFKVAT PVFDGISEET IWNYMSEAKK VDGFTWIGDG KDGTVGGKST
LYDGLTGEPF HNPVVVGQTY MLKLNHLVAD KIHARAVGPY SLVTQQPLGG KAQYGGQRFG
EMEVWALEAY GAAYTLQELL TVKSDDVQGR TRIYESIVKG DNTLEAGTPE SFNVLMKEMQ
SLGLNVRPGS KDEQPSLQLG GTDLTPVDGM TEGFDSDDMA GLADVDFSDL KF