Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2070 |
Symbol | katE |
ID | 6274146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2515550 |
End bp | 2517805 |
Gene Length | 2256 bp |
Protein Length | 751 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642614131 |
Product | hydroperoxidase II |
Protein accession | YP_001878660 |
Protein GI | 187736548 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0753] Catalase |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.192326 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA AGAAACCGCC CATGGAAGAC AGCGCCGCCC CTTTTCCCAA TGGAAAAGAA GCTGCTCCCC ATTATACGGA TACAATTGAT CCGGAATTGA TCAAACCCAC TCCAAAACCT ACGCCGCCCA ATGCGGAACC TTCCGCGCCC GGTTCCATGA AGATGCCGGA TAACGCTACG GAAAAAATCA GGGAGCTGGA TGCCATGCGC TCCAACGGCA TGGGACAGGC CCTCACAAGC AACCTGGGAG TTAAAATATC CGATGATCAA AACACACTTA AAGCAGGAAG CAGAGGCCCC TCTCTGCTTG AAGACTTCCA CTTTCTGGAG AAAATGGCTC ATTTTGACCA GGAGCGGATA CCGGAACGCG TGGTTCACGC AAGAGGTTCC GGGGCACATG GTTATTTTCA GGTGTACAAA TCCCTTTCCA AGTACACCAA AGCGGCTTTT TTGCAGGATC CGGGAGAAAA AACCCCGGTC TTTGTGCGTT TTTCCACCGT GCAGGGCTTC AGAGGATCTC CGGATACAGT AAGGGATATC CGCGGTTGGG CTACCAAATT CTATACCAAG GAAGGCAACT ATGATCTGGT AGGCAATAAC ACGCCCGTCT TCTTCATTCA GGATGCCATC AAATTTCCGG ATTTCGTTCA TGCCGTCAAA CCGGAACCCC ACAATGAAAT GCCCCAGGGG CAGACGGCCC ATGACTCTTT CTGGGACTAC GTCTCCCTGC AGCCGGAAAC TCTGCATAAC GTCATGTGGG CCATGTCGGA CCGCGGCATC CCCAGAAGCT TCCGTACGAT GGAAGGGTTC GGCATCCATA CGTACAAACT GGTCAATGAA GAAGGAAAAA GCACTTTCGT CCGTTTCCAC TGGAAACCGG TGTACGGGAA AAAATCCCTG GTGTGGGATG AAGCCCAGGT ATTGACAGGA CGCGATCCTG ACTTCCACCG CAAGGATCTC TGGCAATCCA TTGAAGCCGG AGATTATCCG GAATACGAGC TGGGTCTGCA GCTCATCCCG GAAGAAGACG CAGACCAGCT CGACTTCGAT ATTCTGGACG CTACCAAGCT CATTCCGGAA GCTCTGGTTC CCGTGGAAAT CGTCGGGAAA ATGGTGCTGA ACCGCAACCC GGACAACTTT TTTGCGGAAA CGGAACAGGT AGCCTTCTGC CCCGCCAATA TCGTGCCGGG CATTGATTTT TCAGACGATC CGCTGCTCCA GGGCCGTATT TTCTCTTACA GCGATACCCA GCGGCACCGG CTGGGAGGAG CCAATTTCAC GGAAATTCCC ATCAACCGCC CCATTTGCCC CTTCCACAAC AACCAGAGGG ACGGCTTCCA CCGTATGCAG ATAGACGCCT CTCCGGCCAA CTATGATCCC AATTCCATCG GGAACAACTG GCCGAGAGAA ACGCCCCCGG AGAAAGGCGG CTTCACCACC AGCCCCCAGA CGGTAAGCGG TGTCAAGGAA CGTCTGCGGA ACCCCTCCTT CGCGGAATAC TATTCCCACC CCCGGTTGTT CTGGATGAGT CAAACTCCGG TGGAACAGGA GCACATTATC AACGCGTTCA GCTTTGAGCT GGGCAAGGTG ACGCGCCCCT ATATTCGGGA ACGCGTGGTG GACCTTCTGA CACGCATTGA TCCGGACCTG GCAAGCGGAG TGGCCCGCAA CCTGGGAATT CAACTCACCA GGGAACAGCT CAGCAGGGAA CTTCCCAGGC CCGTCTGCGG CCTGGAACAA GATCCGTCAT TGAGCCTGTA TGCCCATACG GACGGCAACC TCAAAGGCCT CCGCGTCTCT TTACTGGCAG CGGACGGCGT CAGCCTGAAA TCCGTGAAAG AAATCTGCGA GGCCCTGCAT GAAGAAGGCA TCCACCCCCA AATCATTGCC CCGCACATGG GAAGCGTAAC AACGGAAGAA GGGGAAGATC TGCCTGTTAA CGGGACTCTG TCCGGCACTC CTTCCGTCCT GTTTGACTCC GTCATCGTCC CGGAAGGAGA ACAAAGCATC GCAGCGCTCC TGAAAGACGG AGATGCCAAG TACCATTTGC GCCAGGCCTA CAGGCACCTG AAAGCCATCG GACTGCCCGG CAACGCCAAA GCCATGCTTG AGGCAGCCTC CCTGCCCCAG GATATGGATG ATGCCGGACT GCTCATGCCG AAGGACACCA AATCCCTGAT GCCCTCCTTC ATCACGGCCA TGAAACAGCA CCGCGTCTGG AGCCGCGAGC CTAAAACCCT TGATTTCGGC GCCTAG
|
Protein sequence | MKKKKPPMED SAAPFPNGKE AAPHYTDTID PELIKPTPKP TPPNAEPSAP GSMKMPDNAT EKIRELDAMR SNGMGQALTS NLGVKISDDQ NTLKAGSRGP SLLEDFHFLE KMAHFDQERI PERVVHARGS GAHGYFQVYK SLSKYTKAAF LQDPGEKTPV FVRFSTVQGF RGSPDTVRDI RGWATKFYTK EGNYDLVGNN TPVFFIQDAI KFPDFVHAVK PEPHNEMPQG QTAHDSFWDY VSLQPETLHN VMWAMSDRGI PRSFRTMEGF GIHTYKLVNE EGKSTFVRFH WKPVYGKKSL VWDEAQVLTG RDPDFHRKDL WQSIEAGDYP EYELGLQLIP EEDADQLDFD ILDATKLIPE ALVPVEIVGK MVLNRNPDNF FAETEQVAFC PANIVPGIDF SDDPLLQGRI FSYSDTQRHR LGGANFTEIP INRPICPFHN NQRDGFHRMQ IDASPANYDP NSIGNNWPRE TPPEKGGFTT SPQTVSGVKE RLRNPSFAEY YSHPRLFWMS QTPVEQEHII NAFSFELGKV TRPYIRERVV DLLTRIDPDL ASGVARNLGI QLTREQLSRE LPRPVCGLEQ DPSLSLYAHT DGNLKGLRVS LLAADGVSLK SVKEICEALH EEGIHPQIIA PHMGSVTTEE GEDLPVNGTL SGTPSVLFDS VIVPEGEQSI AALLKDGDAK YHLRQAYRHL KAIGLPGNAK AMLEAASLPQ DMDDAGLLMP KDTKSLMPSF ITAMKQHRVW SREPKTLDFG A
|
| |