Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1914 |
Symbol | |
ID | 6275373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2322961 |
End bp | 2324121 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 642613974 |
Product | restriction modification system DNA specificity domain |
Protein accession | YP_001878508 |
Protein GI | 187736396 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 0.00381757 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATGAGA AATCGTTGAT TCCGTCTATC CGCTTTGCCG GATTTACTGA CGCATGGGAA CGGCGTAAGC TGGGGGATTT AGCGGAGTTT AGAAGAGGGC TAACCTATTC ACCAAGAGAT ATATCAACAT CTGGAATCAG GGTATTGCGC TCGTCAAATA TAGATGAGGA TTCTTTCGTT TTAGCAGAGG ATGATGTTTA TGTAAAAGAG ACGGCTGTGT GCATCCCGCT TGTTGAAAAA GGCGACATTT TAATTACCGC AGCTAATGGC TCAAGCAGAT TAGTCGGAAA GCATGCTTTG ATTATTGACG ATAAGGGTAA AATGGTACAC GGCGGGTTCA TGCTGCTCGC GCATCCGTAT ACGCATTCTG CTTTCGTTAA TGCTCTTATG CATGCACCCT GGTACTCATC GTTTATCCGC ACTAACGTTG CTGGAGGAAA TGGAGCTATA GGAAATCTGA ATAAAAGCGA TTTGGAAGAA CAAGATATTG CGGCGACCTC TGAGCAAGAG CAAGAAAGAA TCGGTTCCTT GTTTGCCTCC CTCGACCATC TCATCACCCT TCATCAGCGT AAGTATGAAA AGCTCCTTAA CATCAAAAAA TCGATGTTGG ACAAAATGTT CCCGAAAAAT GGTGAGCTTT TCCCCGAAGT TCGCTTTGCC GGATTTACTG ACGCATGGGA ACGGCAGAAG CTGGGGGATT TGGTAGAGTC TGTTCCGTTT AAGCAGTATA TAGCATCACC TGAACCTGAC GGAAAATTCG AAATTATCCA ACAAGGAAGT GAGCCTATTA TTGGATATGG AAACGGAATC CCTTGTGAAG ATTATGCAAA GATAACGATT TTCGGAGACC ATACAGTTTC AATCTACAAA CCACAAAAGC CCTTTTTTGT AGCCACTGAT GGCACAAGAC TCCTTACAGC AAGAGTTCTA GATGGAGATT TTTTTTATTT CCTCTTGGAG CGATACAAAC CAATCCCTGA AGGATATAAG CGGCATTACA CGATATTGAT TGAAAGGTAT GGATGTTTTC CTTCCCATCG AGAGCAAAAG TTAATTGCCA TATTTTTTAG GAACATCGAC CACCTCATCA CCCTTCATCA GCGTAAGTTG GAAAAACTGC AAAACATCAA GAAAGCCTGT CTGGAAAAAA TGTTTGTTTA A
|
Protein sequence | MNEKSLIPSI RFAGFTDAWE RRKLGDLAEF RRGLTYSPRD ISTSGIRVLR SSNIDEDSFV LAEDDVYVKE TAVCIPLVEK GDILITAANG SSRLVGKHAL IIDDKGKMVH GGFMLLAHPY THSAFVNALM HAPWYSSFIR TNVAGGNGAI GNLNKSDLEE QDIAATSEQE QERIGSLFAS LDHLITLHQR KYEKLLNIKK SMLDKMFPKN GELFPEVRFA GFTDAWERQK LGDLVESVPF KQYIASPEPD GKFEIIQQGS EPIIGYGNGI PCEDYAKITI FGDHTVSIYK PQKPFFVATD GTRLLTARVL DGDFFYFLLE RYKPIPEGYK RHYTILIERY GCFPSHREQK LIAIFFRNID HLITLHQRKL EKLQNIKKAC LEKMFV
|
| |