Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1378 |
Symbol | |
ID | 6275784 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 1642372 |
End bp | 1643622 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 642613434 |
Product | integrase family protein |
Protein accession | YP_001877983 |
Protein GI | 187735871 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.545263 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.00000000645491 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCTCCA TCTATAAAAA GCCGAACAGC CCTTACTGGT ACGCACAATA CCGCGTGAGA ACCGCTACAG GCTGGAAACT GGTCCGGCTG TCAACCAAAA TCAAGCATAC CCCCGCCACG GTAACAAGGG AAGTAAAAGA AGCCGCAGAG TCCATGGGGA AGCAGCTGAA CGTCCTGACC AGGGAACAGG CTATGACCAA GGCACAACGC CTGGCGGACG CCCTTGAATC AACGGCGCGG GCAAACCTGC CGGCCTATCA ATTACGCCGG GCCATTTCCG CATTGTCCAC GGAATTGACC GGAGAATCTA TGGAAATGCC CTCTGTCAAA TTATGGCTTG ATGACCACAT GCGGCGCATT ACGCGCAATG GGCTTAAACC CGCATCCATA GCGAACTACA AACAAGCCTT TGACAAATTT CGCGCCTCAA TGGGAGAACG TATCAACCTG CCTCTGGATC GCATTACTCC TCTGATGCTG GACGATTTCA AAAACCATCT TCTTTCCCGT GTCTCACCAT CTACCGCCAA TATTGCTCTT ACGCTGGTTT CCGCGGCGTT CCAGGCGGCA GTTGATTATA AAATTATTGA AACCAACCCC TTTACGGCGA TTACCAAGCC TCACAAGGGG AAAGCCGTCA AACGGCGGAA ATTCGAATTG GAAGAGCTTG AAAAGGTAAT GGCCGCATGC AATCCGGAAT GGCGCTCCAT GGTGAAAACG TGCCTCTATA CGGGCGGTCA AAGATTGGGA GACGTGGCAA CGCTCCGGTG GTCCCAGGTT GACGAGAAAC GAGGCGTTAT CCGGATGACC ACGCAGAAAA AGGGAAAGCC TCTGATGATT CCGATTTTTC CGGCGCTGAA AAAACACCTG CAGCAACGGA AGAAAGAAGC TCCTGGGGAC TTCCTGCATC CTGAATGCGC GAATATTTTT GAAAGCAAGG GATCCGGACG CCTGTCAAAT ATCTTTAGCC ACATCCTGTA CCAGTGTGGC CTTATTGCCA AAGACCCTCT GGCTGCAGGC AAAAAATACA AAAAGCAGGA AGGAAACGGC ACAGAGACGC GGCGCCACGT CAATGAATTG TCCTTCCACA GCCTCCGCTA TACGGCAACA ACCATGTTAC ATGACGCCGG TGTTCCCCCT GCTCTTGTGC AAGCCATTGT GGGGCACGAT TCCCGGGAAG TCCATGAAGG ATACATCGAC TTTGGAGCCA AGGAGTTTAC ACAAGCCCTT GAAAAGCTTC CCAAATTGTA G
|
Protein sequence | MASIYKKPNS PYWYAQYRVR TATGWKLVRL STKIKHTPAT VTREVKEAAE SMGKQLNVLT REQAMTKAQR LADALESTAR ANLPAYQLRR AISALSTELT GESMEMPSVK LWLDDHMRRI TRNGLKPASI ANYKQAFDKF RASMGERINL PLDRITPLML DDFKNHLLSR VSPSTANIAL TLVSAAFQAA VDYKIIETNP FTAITKPHKG KAVKRRKFEL EELEKVMAAC NPEWRSMVKT CLYTGGQRLG DVATLRWSQV DEKRGVIRMT TQKKGKPLMI PIFPALKKHL QQRKKEAPGD FLHPECANIF ESKGSGRLSN IFSHILYQCG LIAKDPLAAG KKYKKQEGNG TETRRHVNEL SFHSLRYTAT TMLHDAGVPP ALVQAIVGHD SREVHEGYID FGAKEFTQAL EKLPKL
|
| |