Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_0623 |
Symbol | |
ID | 6274203 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 731325 |
End bp | 732635 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 642612674 |
Product | glycosyl hydrolase BNR repeat-containing protein |
Protein accession | YP_001877241 |
Protein GI | 187735129 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.815723 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCTGC ACCTATCATC CCTGGCAGCG TTGCTCCTGG CATCATACCT TCCGGCGCAG GCCACCGTAC CGGCCCATTC CCCTTCCACA GCGTTCATCC GGAGCGGCCT CCCGATCGTT GATCTGGATC AATGGACAGA AGCCCAGGTA GTAGTCGACA AGGAAAAAGG AAAATACCTG GGACATCCAA CGACGCTATT GCTGAAAGAC GGAAAAACCA TTCTGTGCGT TTATCCTAAA GGGCACGGCT CCGGGGAAAT CATCCTGAAG AAATCCACGG ACGGAGGCAA AACATGGAGC GAAAGGCTGC CGGTTCCAGA ATCATGGAAA ACCAGCAGGG AAGTACCCAC ACTATATGAA ACGGAAGACT CCCGGGGCAA GCGCCGCATT CTGCTTTTCA GCGGCATTCA GGGGGGAAAC AGAAACACAG CCCCCAGAAA CCGGATGGCG GTCAGTGAAG ACAACGGAAC AACATGGTCC GAGCTGACCC CCATTCCCAA CCAGGTCGGA GGCATTGTTG TCATGAGTGA CCTGATTCCT CTAAAAACGG GAAAAGGGCA TTATATGGCC TCCTACCATG CCAATGCCCG GGGCAAAGAC GGACATGGAG AGTTTCACAC CATTGAACAA TATGTCACCT TTACGGAAGA CGGTGGGCTG ACCTGGACCT CCCCCCAGGT CATTTTCCCA GGGACAAGGG ACATGCACCT GTGTGAGGGA GGTTTTGTCC GCAGCCCGGA CGGAAAAACA ATCGCGCTGC TGTTACGGGA AAACAGCAGG CACCATAACT CCCAGATCAT GTTTTCCGAA GATGAAGGTA AAACATGGAC TCCCCCCAAA GAACTGCCGG CAGCCCTGTG CGGAGACCGC CACCAGATTC TTCCCCTGCC TGACGGAAGG CTTCTGGTTC AATTCAGGGA TGCTCCCCCG ACCAGGAAGA AAGGGCAGGC CGCCAGCCCG ACGGAAGGAG ACTGGGTAGC ATGGATAGGC CGGTGGGAAG ACCTGAAAAA CGGCACGGAA GGCTCATATA AAATCCGTTT TAAGGACAAC CGCAACGGTT GGGACTGCGC CTACCCGGCC GCCGAACTAT TGCCGGACCA TACCCTGGTA TGCACTACCT ACGGACACTT TGACAAAGGG GAATTGCCAT ACATCCTCTC CGTCAGATTT AAAATCAGCG ATACGGACAA GATGGTCAAA CAATATGCGG GGAACAATCA CCCCAAGATC AAAAATGACA CAGGAGCGGG AGAAACCGTT TTTGACCCCA ATGAGCCGGA CTCCGTTAAT CGCCTTCTGA AACGTCCCTG A
|
Protein sequence | MNLHLSSLAA LLLASYLPAQ ATVPAHSPST AFIRSGLPIV DLDQWTEAQV VVDKEKGKYL GHPTTLLLKD GKTILCVYPK GHGSGEIILK KSTDGGKTWS ERLPVPESWK TSREVPTLYE TEDSRGKRRI LLFSGIQGGN RNTAPRNRMA VSEDNGTTWS ELTPIPNQVG GIVVMSDLIP LKTGKGHYMA SYHANARGKD GHGEFHTIEQ YVTFTEDGGL TWTSPQVIFP GTRDMHLCEG GFVRSPDGKT IALLLRENSR HHNSQIMFSE DEGKTWTPPK ELPAALCGDR HQILPLPDGR LLVQFRDAPP TRKKGQAASP TEGDWVAWIG RWEDLKNGTE GSYKIRFKDN RNGWDCAYPA AELLPDHTLV CTTYGHFDKG ELPYILSVRF KISDTDKMVK QYAGNNHPKI KNDTGAGETV FDPNEPDSVN RLLKRP
|
| |