Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1514 |
Symbol | |
ID | 6275751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 1806589 |
End bp | 1808571 |
Gene Length | 1983 bp |
Protein Length | 660 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642613573 |
Product | hypothetical protein |
Protein accession | YP_001878116 |
Protein GI | 187736004 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.767692 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.00000248553 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCAGAAAT GGGAAAGATT CTTTCCCAAT CCCGATTACA TGAACATTCC CTCTTTTTCC CGTAGCGTTT CCAGAGGTTC CGCCGTGGGC TGGTTTTTAG TGTTGCTGCT GGTTTGCGGT GCCGGAGCCG GATATTATCT TTACCAGGAT AACCTGGCAA AAAGAAAAGC GGCCCAGGAA TTGACCGCCG AACGTAAATT GAAGGAGAAG AAGGCCAGGG AAGCTGCGGA GAAACAGCGA ATAAAACGGG AACGGGAAAT CAGGGAGAAG AAGGAAAAGG AACGGCTGGC GGCCCAGAAG GCTTATGAGG AAGCGCAGGA AGAGAAGGCG CGGCAGGCGG CGGAAGCGGC CCGTAAGCTT CAGGAGCAGG CCGAACGCGA AGAACGGGAA AAAAGGAGAA GGGAGGAGCT GGAACGGCGC GAACGTGAGG AAGAGGCCCG CAGGCAGGAG GAAGATACTC CTGTGGAGGA AGAGCCTGAA CCGGAAGGGC GTTTTCCCCA ACCCGTGAAA AACCGGATGC CGGAGCTTTC CGTTTATTCT ATTCCTTGCA GGGATGATAT CCAGACGGAA AAAGACAAGC TGCTGGAAAC ATGGTCCTGG GACAAGGCGG AAAAAATGGA GGGTATGGAG GAATTCCCCA CGGGTTCCTC TCCGTGGAAA AAAGGGAAAG ACGCCGGACG CATGCAGGCG CTTCTGGAAA AATGCCGGGA ATGGAAGGAT GCCAAACTTG CCTCGCTGAA GGCGTGCCCG GCGGCGAAGG ATTTTCCCGG CGTTCCGGAG AATGGGGCCC AGACGGTCAG GCGGACGGTG GAGATAGATT CCAATATTGG AGGCTGGCAT AGTACGGGAC TGTATGCGCC GCCCGGGGCG GAAATTTCGT GTTCCCTGTC CGGCGCTCCC AAAGACGGTT CGATCAGCGT CCGCATAGGA TGCCATACGG ACAGCCTTCA TAAGCTGGAT GAATGGAAAA GAGTGCCGGA AATAACCATG CAGGTTCCGG CTGGCCGTGG GCGCGTGAAA ATGGTGAATC CGATGGGCGG CCTTGTTTAT GTGAATGTAG GCCAGCGTCC CAGACGGGGA AAGGTCTTCA AGGTTCAGAT TTCCGGAGCC GTGCCTTCAC CTCTGTTCGT AATGGGGAAG ACCACTCCGG AACAATGGGC CGAACAATTG GAAAATACCA AGGCCCCGTG GGGGGAAATC CGCATGCCCC GGCTTATTGT CACGATGCCC GTGGAACAGC TGAAACAATG TCCGGATGTT CAGAAGACGG CGGAATTTCT GCAAAAAAAC ATGGCTCTTC AGGACTGGAT TATGGGATGG GATACCAAGC CGGACCGCCT GCATCATCCG ATGCGCTTTG TCGTGGACAG GCAGATATCC GCCGGGGCCG GGCATTCCGG TTATCCCGCC ATGGCCACGA AGGACTGGAC GAATTCCATT GCCACCGGTT CCATCATCCA TTCCGGAAGC TGGGGTTTGT GGCATGAACT GGGGCATAAC CACCAATCCC CTCCCTTTAC GATGGAAGGC CAGACGGAGG TATCCGTCAA CATATTCTCC ATGGTGTGTG AAGTGATGGG GACTGGAAAA GACTTTGAAT CCTGCTGGGG CGGCGGCATG GGGCCGTACG GCATGAGCGC GGAAATGAAA AAATATTTTT CAGGCACCCA GACTTACAAT GAGGCTCCCA ACAAGGTGCA GCTCTTCTTC TGGGTGGAGC TGATGTACTA TCTGGGGTTT GACGCCTTCC GCCAGGTGGC TCTTCAATTC CATGACAAGC CTTATGACAA CGGCGAACTG AGTGATGAAA AGAAATGGGA ATGGGTCATG AACGCTTTTT CCAAGGTCAC GGGAAAGAAC ATGGGGCCTT TCTTTAAGAT TTGGCGTACG CCGGTTTCCG AACGCGCTAC GGGCAGAATG AAAGACCTTC CCGCCTGGCT TCCTTCCAAG GATTATCCGG CCTGTTATAC CGCAGAGGAA TAA
|
Protein sequence | MQKWERFFPN PDYMNIPSFS RSVSRGSAVG WFLVLLLVCG AGAGYYLYQD NLAKRKAAQE LTAERKLKEK KAREAAEKQR IKREREIREK KEKERLAAQK AYEEAQEEKA RQAAEAARKL QEQAEREERE KRRREELERR EREEEARRQE EDTPVEEEPE PEGRFPQPVK NRMPELSVYS IPCRDDIQTE KDKLLETWSW DKAEKMEGME EFPTGSSPWK KGKDAGRMQA LLEKCREWKD AKLASLKACP AAKDFPGVPE NGAQTVRRTV EIDSNIGGWH STGLYAPPGA EISCSLSGAP KDGSISVRIG CHTDSLHKLD EWKRVPEITM QVPAGRGRVK MVNPMGGLVY VNVGQRPRRG KVFKVQISGA VPSPLFVMGK TTPEQWAEQL ENTKAPWGEI RMPRLIVTMP VEQLKQCPDV QKTAEFLQKN MALQDWIMGW DTKPDRLHHP MRFVVDRQIS AGAGHSGYPA MATKDWTNSI ATGSIIHSGS WGLWHELGHN HQSPPFTMEG QTEVSVNIFS MVCEVMGTGK DFESCWGGGM GPYGMSAEMK KYFSGTQTYN EAPNKVQLFF WVELMYYLGF DAFRQVALQF HDKPYDNGEL SDEKKWEWVM NAFSKVTGKN MGPFFKIWRT PVSERATGRM KDLPAWLPSK DYPACYTAEE
|
| |