Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1659 |
Symbol | |
ID | 6274570 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2006208 |
End bp | 2007308 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642613718 |
Product | integrase family protein |
Protein accession | YP_001878259 |
Protein GI | 187736147 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.0421076 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATGA AAAAAGCACC GAATACAAAC AAATTGAAAT ATGTCGGAGC CGCACTTCTG GAGGGGGAAT CAGTCACCCT GATCCAGGCT GCAAGGCTGG TATTAGAAAT CAAGGAAGCC CTTGGAGATG AAATCTGTAC CATTACCCGG TGCCGGGAAG TCGTCTCCCT GGGATTGAAC GCCATTAAAA ACAAACATCA AACCGTCAGT TTCGGCACGG CGGCCGTGGA ATGCCTGCGT TCCAAAAGCC ACCGCCGTCC CCGGACGCTG ACGGACATCA GGAGCATCAT CCACAAGCTT AAAAAGAGCA ATCCGGAACT GGAACACACC TCCCTGCGCA ACCTGAGCGT GGAGGAATGC CAGAACATCC TGATGAATAC CTTTACCACA TCCCGGCAGA GGCACAAAGC GAGGCTCATC TTGAGCGGAA TTTTCTCGTT CTCCGTCAAA CGCGGATGGT GTGACGAGAA TCCCATCCTC CGTGTAGACA CGCCTTTTCT GCAGGAGCAG GAAATCCCCG CTCTGACGCT GAAGGAAATA ACTCAGCTTC TCAAGGCGGC CATGGAAGAA TTTGACGGAA GCTGCGCAGC TGGAGCGGCG CTGATGATCT TTGCAGGAAT CCGCCCGCAG GAAGTGGAAC GCCTGCTCTG GGAAAACATC GCTCTCCGCG ACGGCTGTAT CATTCTGAAC TCCAAGCATA CCAAAACCGG AGGCGCCAGA CACGTCACCA TCCTGCCCGT GCTCGCCAAA TGGCTCAAAT TCTGCCGTGA CAGGACCAAA CCCGGCCCCG GAACTCCCAT CTGCCCGAAA GGGTGGACAA TCAAGTGGCG CAAAATCCGG AAAAAAGCCG GCTGGGGAGG AAGAAAAAAA TCATGGGTGC CGGACTGCCT GCGGCACACC TACGCCAGCT ACCACGCCAA GCACTTCAAG GACTACAACC TGTTGCAAAT GGAAATGGGG CACCGCTCCT CCTCCCTGCT CCGCACACGG TACTTGAACA TGAAAGGCAT CTCTCCGCAA ACGGCCACGC GCTTCTGGGC CCTGACGCCA GCCAAGGTCA TTGAAGAAAC GAAACCGCCG GAAGAACCGC CGGTCTCCTG A
|
Protein sequence | MNMKKAPNTN KLKYVGAALL EGESVTLIQA ARLVLEIKEA LGDEICTITR CREVVSLGLN AIKNKHQTVS FGTAAVECLR SKSHRRPRTL TDIRSIIHKL KKSNPELEHT SLRNLSVEEC QNILMNTFTT SRQRHKARLI LSGIFSFSVK RGWCDENPIL RVDTPFLQEQ EIPALTLKEI TQLLKAAMEE FDGSCAAGAA LMIFAGIRPQ EVERLLWENI ALRDGCIILN SKHTKTGGAR HVTILPVLAK WLKFCRDRTK PGPGTPICPK GWTIKWRKIR KKAGWGGRKK SWVPDCLRHT YASYHAKHFK DYNLLQMEMG HRSSSLLRTR YLNMKGISPQ TATRFWALTP AKVIEETKPP EEPPVS
|
| |